Promptbreeder: Self-Referential Self-Improvement Via Prompt Evolution (Paper Explained)#Prompt Engineering#Large Language Models#Paper#Mutation·youtube.com·Oct 18, 2023Promptbreeder: Self-Referential Self-Improvement Via Prompt Evolution (Paper Explained)
Goodtriever: Adaptive Toxicity Mitigation with Retrieval-augmented ModelsDownload PDF#Large Language Models#Cohere#Paper#PDF·arxiv.org·Oct 17, 2023Goodtriever: Adaptive Toxicity Mitigation with Retrieval-augmented Models
Llemma: An Open Language Model For Mathematics#Mathematics#Large Language Models#Paper#PDF#EleutherAI·arxiv.org·Oct 17, 2023Llemma: An Open Language Model For Mathematics
Efficient Streaming Language Models with Attention Sinks (Paper Explained)#Large Language Models#Paper#Review·youtube.com·Oct 16, 2023Efficient Streaming Language Models with Attention Sinks (Paper Explained)
Large Language Models can Learn RulesDownload PDF#Large Language Models#Reasoning#Paper#PDF·arxiv.org·Oct 14, 2023Large Language Models can Learn Rules
A Survey of Large Language Models for Healthcare: from Data, Technology, and Applications to Accountability and EthicsDownload PDF#Large Language Models#Health#Survey#Paper#PDF·arxiv.org·Oct 13, 2023A Survey of Large Language Models for Healthcare: from Data, Technology, and Applications to Accountability and Ethics
Walking Down the Memory Maze: Beyond Context Limit through Interactive ReadingDownload PDF#Large Language Models#Questions and Answers#Paper#PDF·arxiv.org·Oct 10, 2023Walking Down the Memory Maze: Beyond Context Limit through Interactive Reading
Large Language Models Cannot Self-Correct Reasoning YetDownload PDF#Large Language Models#Reasoning#Paper#PDF·arxiv.org·Oct 9, 2023Large Language Models Cannot Self-Correct Reasoning Yet
Enable Language Models to Implicitly Learn Self-Improvement From Data#Large Language Models#Paper#PDF·arxiv.org·Oct 7, 2023Enable Language Models to Implicitly Learn Self-Improvement From Data
Human Feedback is not Gold StandardDownload PDF#Large Language Models#Feedback#Criticism#RLHF#Paper#PDF·arxiv.org·Oct 4, 2023Human Feedback is not Gold Standard
Borges and AIDownload PDF#Large Language Models#Literature#Psychology#Paper#PDF·arxiv.org·Oct 4, 2023Borges and AI
Language Models Represent Space and TimeDownload PDF#Large Language Models#Paper#PDF·arxiv.org·Oct 4, 2023Language Models Represent Space and Time
NExT-GPT: Any-to-Any Multimodal LLM#Large Language Models#Multimodal#Paper#PDF·arxiv.org·Sep 27, 2023NExT-GPT: Any-to-Any Multimodal LLM
Large Language Models for Compiler OptimizationDownload PDF#Large Language Models#Compilers#Meta#Paper#PDF·arxiv.org·Sep 25, 2023Large Language Models for Compiler Optimization
Investigating Answerability of LLMs for Long-Form Question AnsweringDownload PDF#Large Language Models#QA#Paper#Salesforce·arxiv.org·Sep 25, 2023Investigating Answerability of LLMs for Long-Form Question Answering
Reasoning or Reciting? Exploring the Capabilities and Limitations of Language Models Through Counterfactual TasksPDF#Large Language Models#Performance#Paper#PDF·arxiv.org·Jul 31, 2023Reasoning or Reciting? Exploring the Capabilities and Limitations of Language Models Through Counterfactual Tasks
Llama 2: Open Foundation and Fine-Tuned Chat ModelsPDF#LLaMa#Large Language Models#Meta#Paper#PDF·arxiv.org·Jul 28, 2023Llama 2: Open Foundation and Fine-Tuned Chat Models
Language models and linguistic theories beyond words#Linguistics#Paper#PDF#Large Language Models·nature.com·Jul 24, 2023Language models and linguistic theories beyond words
Large language models encode clinical knowledge#Large Language Models#Medical#Google#Paper#PDF·nature.com·Jul 14, 2023Large language models encode clinical knowledge
OpenELM/OpenELM_Paper.pdf at paper · CarperAI/OpenELM · GitHub#Large Language Models#Algorithms#Feedback#Paper#PDF·github.com·Jul 11, 2023OpenELM/OpenELM_Paper.pdf at paper · CarperAI/OpenELM · GitHub
Large-scale Text-to-Image Generation Models for Visual Artists' Creative WorksPDF#Text-to-Image#Large Language Models#Generative Models#Paper#PDF·arxiv.org·Jul 5, 2023Large-scale Text-to-Image Generation Models for Visual Artists' Creative Works
Large Language Model as Attributed Training Data Generator: A Tale of Diversity and Bias#Large Language Models#Training#Machine Learning#Paper#PDF·arxiv.org·Jul 4, 2023Large Language Model as Attributed Training Data Generator: A Tale of Diversity and Bias
Towards Measuring the Representation of Subjective Global Opinions in Language Models#Large Language Models#Machine Learning#Opinion#Paper#PDF·arxiv.org·Jun 30, 2023Towards Measuring the Representation of Subjective Global Opinions in Language Models
Speed Is All You Need: On-Device Acceleration of Large Diffusion Models via GPU-Aware Optimizations#Large Language Models#Diffusion#GPU#Paper#PDF·arxiv.org·Jun 22, 2023Speed Is All You Need: On-Device Acceleration of Large Diffusion Models via GPU-Aware Optimizations
Opportunities and Risks of LLMs for Scalable Deliberation with Polis#Large Language Models#Democracy#Paper#PDF·arxiv.org·Jun 22, 2023Opportunities and Risks of LLMs for Scalable Deliberation with Polis
ZeRO++: Extremely Efficient Collective Communication for Giant Model Training - Microsoft Research#Large Language Models#Training#Microsoft#Paper#PDF·microsoft.com·Jun 22, 2023ZeRO++: Extremely Efficient Collective Communication for Giant Model Training - Microsoft Research
ChemCrow: Augmenting large-language models with chemistry tools#Large Language Models#Chemistry#Paper#PDF·arxiv.org·Jun 22, 2023ChemCrow: Augmenting large-language models with chemistry tools
Demystifying GPT Self-Repair for Code Generation#GPT-4#Debug#Large Language Models#Paper#PDF·arxiv.org·Jun 20, 2023Demystifying GPT Self-Repair for Code Generation
LLM-Blender: Ensembling Large Language Models with Pairwise Ranking and Generative FusionPDF#Large Language Models#Machine Learning#Paper#PDF·arxiv.org·Jun 20, 2023LLM-Blender: Ensembling Large Language Models with Pairwise Ranking and Generative Fusion
FinGPT: Open-Source Financial Large Language Models#Finance#Large Language Models#Economics#Paper#PDF·arxiv.org·Jun 17, 2023FinGPT: Open-Source Financial Large Language Models