Direct Preference Optimization: Your Language Model is Secretly a Reward ModelPDF#Large Language Models#Preferences#Reward#Training#Paper#PDF·arxiv.org·Jun 4, 2023Direct Preference Optimization: Your Language Model is Secretly a Reward Model
Large Language Models as Tool Makers#Large Language Models#Tools#Machine Learning#Paper#PDF·arxiv.org·Jun 1, 2023Large Language Models as Tool Makers
Scaling Data-Constrained Language ModelsPDF#Large Language Models#Scale#Paper#PDF·arxiv.org·Jun 1, 2023Scaling Data-Constrained Language Models
Improving Factuality and Reasoning in Language Models through Multiagent Debate#Reasoning#Large Language Models#Machine Learning#Computer Vision#Paper#PDF·arxiv.org·May 30, 2023Improving Factuality and Reasoning in Language Models through Multiagent Debate
Repairing the Cracked Foundation: A Survey of Obstacles in Evaluation Practices for Generated Text | Journal of Artificial Intelligence Research#Generative Speech#Large Language Models#Paper#PDF#Google#Natural Language Processing·jair.org·May 30, 2023Repairing the Cracked Foundation: A Survey of Obstacles in Evaluation Practices for Generated Text | Journal of Artificial Intelligence Research
Sophia: A Scalable Stochastic Second-order Optimizer for Language Model Pre-training#Large Language Models#Pretrained Models#Stanford#Paper#PDF·arxiv.org·May 27, 2023Sophia: A Scalable Stochastic Second-order Optimizer for Language Model Pre-training
Language models can explain neurons in language models#Interpretability#Large Language Models#Paper#PDF·openaipublic.blob.core.windows.net·May 27, 2023Language models can explain neurons in language models
The New World of LLM Functions: Integrating LLM Technology into the Wolfram Language#Wolfram#Large Language Models#Function·writings.stephenwolfram.com·May 24, 2023The New World of LLM Functions: Integrating LLM Technology into the Wolfram Language
Open LLM Leaderboard - a Hugging Face Space by HuggingFaceH4#Large Language Models#Opensource#Hugging Face#Ranking·huggingface.co·May 24, 2023Open LLM Leaderboard - a Hugging Face Space by HuggingFaceH4
Reprompting: Automated Chain-of-Thought Prompt Inference Through Gibbs Sampling#Chain of Thought#Large Language Models#Prompt Engineering#Paper#PDF#Microsoft#Algorithms·arxiv.org·May 22, 2023Reprompting: Automated Chain-of-Thought Prompt Inference Through Gibbs Sampling
When AI’s Large Language Models Shrink#Large Language Models#Scale#Sparsity·spectrum.ieee.org·May 13, 2023When AI’s Large Language Models Shrink
Enabling conversational interaction on mobile with LLMs#Large Language Models#Mobile#User Interfaces#Paper#Google·ai.googleblog.com·May 12, 2023Enabling conversational interaction on mobile with LLMs
Introducing MPT-7B: A New Standard for Open-Source, Commercially Usable LLMs#Large Language Models#Opensource#Instruction#Storytelling#Chatbot·mosaicml.com·May 8, 2023Introducing MPT-7B: A New Standard for Open-Source, Commercially Usable LLMs
Boosting Theory-of-Mind Performance in Large Language Models via Prompting#Large Language Models#Prompt Engineering#Theory of Mind#Paper#PDF·arxiv.org·May 8, 2023Boosting Theory-of-Mind Performance in Large Language Models via Prompting
Science in the age of large language models - Nature Reviews Physics#Large Language Models#Science#Paper·nature.com·May 7, 2023Science in the age of large language models - Nature Reviews Physics
Dissecting Recall of Factual Associations in Auto-Regressive Language Models#Large Language Models#Paper#PDF·arxiv.org·May 6, 2023Dissecting Recall of Factual Associations in Auto-Regressive Language Models
Generative AI entails a credit–blame asymmetry#Ethics#Generative Models#Large Language Models·nature.com·May 4, 2023Generative AI entails a credit–blame asymmetry
From Fear to Action: AI Governance and Opportunities for All#Research#Ethics#Large Language Models#Opinion#GPT-4·frontiersin.org·May 4, 2023From Fear to Action: AI Governance and Opportunities for All
StarCoder_paper.pdf#Large Language Models#Coding#Chatbot#Paper·drive.google.com·May 4, 2023StarCoder_paper.pdf
Generative Language Models and Automated Influence Operations: Emerging Threats and Potential Mitigations#Large Language Models#Generative Models#Political Science#Paper#PDF·arxiv.org·May 1, 2023Generative Language Models and Automated Influence Operations: Emerging Threats and Potential Mitigations
Language Models Enable Simple Systems for Generating Structured Views of Heterogeneous Data Lakes#Large Language Models#Training#Coding#Paper#PDF·arxiv.org·Apr 30, 2023Language Models Enable Simple Systems for Generating Structured Views of Heterogeneous Data Lakes
LlamaIndex 0.6.0: A New Query Interface Over your Data#Large Language Models#API·medium.com·Apr 29, 2023LlamaIndex 0.6.0: A New Query Interface Over your Data
Low-code LLM: Visual Programming over LLMs#Low Code#Large Language Models#Paper#PDF#Microsoft·arxiv.org·Apr 27, 2023Low-code LLM: Visual Programming over LLMs
Hyena Hierarchy: Towards Larger Convolutional Language Models#Large Language Models#Architecture#Training#Paper#PDF#Machine Learning#CNN·arxiv.org·Apr 25, 2023Hyena Hierarchy: Towards Larger Convolutional Language Models
Chameleon: Plug-and-Play Compositional Reasoning with Large Language Models#Large Language Models#Paper#PDF·arxiv.org·Apr 24, 2023Chameleon: Plug-and-Play Compositional Reasoning with Large Language Models
Emergent and Predictable Memorization in Large Language Models#Large Language Models#Paper#PDF·arxiv.org·Apr 24, 2023Emergent and Predictable Memorization in Large Language Models
Augmented Language Models: a Survey#Large Language Models#Meta#Survey#Paper#PDF·arxiv.org·Apr 23, 2023Augmented Language Models: a Survey
Learning to Compress Prompts with Gist Tokens#Large Language Models#Prompt Engineering#Compression#Paper#PDF·arxiv.org·Apr 23, 2023Learning to Compress Prompts with Gist Tokens