Optimizing foundation models from Medprompt o1#Foundation Models#Research#Microsoft#Accuracy#Prompt Engineering·microsoft.com·Nov 27, 2024Optimizing foundation models from Medprompt o1
Training Language Models to Self-Correct via Reinforcement LearningView PDF#Large Language Models#Accuracy#Reinforcement Learning#DeepMind#Paper#PDF·arxiv.org·Sep 22, 2024Training Language Models to Self-Correct via Reinforcement Learning
DataGemma: Using real-world data to address AI hallucinationsresearch paper#Large Language Models#DeepMind#Google#Accuracy#Database#Paper#Blog·blog.google·Sep 13, 2024DataGemma: Using real-world data to address AI hallucinations
Long-form factuality in large language models#Large Language Models#Accuracy#Fact-checking#Paper#PDF·arxiv.org·Mar 29, 2024Long-form factuality in large language models