Training Language Models to Self-Correct via Reinforcement LearningView PDF#Large Language Models#Accuracy#Reinforcement Learning#DeepMind#Paper#PDF·arxiv.org·Sep 22, 2024Training Language Models to Self-Correct via Reinforcement Learning
Long-form factuality in large language models#Large Language Models#Accuracy#Fact-checking#Paper#PDF·arxiv.org·Mar 29, 2024Long-form factuality in large language models