Model-Based Transfer Learning for Contextual Reinforcement Learning#Machine Learning#Transfer Learning#Reinforcement Learning#Paper#PDF#Performance·arxiv.org·Nov 23, 2024Model-Based Transfer Learning for Contextual Reinforcement Learning
Training Language Models to Self-Correct via Reinforcement LearningView PDF#Large Language Models#Accuracy#Reinforcement Learning#DeepMind#Paper#PDF·arxiv.org·Sep 22, 2024Training Language Models to Self-Correct via Reinforcement Learning
Back to Basics: Revisiting REINFORCE Style Optimization for Learning from Human Feedback in LLMs#RLHF#Reinforcement Learning#Large Language Models#Paper#PDF·arxiv.org·Feb 26, 2024Back to Basics: Revisiting REINFORCE Style Optimization for Learning from Human Feedback in LLMs
Pearl: A Production-ready Reinforcement Learning AgentDownload PDF#Reinforcement Learning#Meta#Paper#PDF·arxiv.org·Dec 14, 2023Pearl: A Production-ready Reinforcement Learning Agent
Chip Placement with Deep Reinforcement Learning#Reinforcement Learning#Design#Automation#Google#Paper#PDF·arxiv.org·May 30, 2023Chip Placement with Deep Reinforcement Learning
An Overview of Environmental Features that Impact Deep Reinforcement Learning in Sparse-Reward Domains | Journal of Artificial Intelligence Research#Performance#Deep Learning#Reinforcement Learning#Paper#PDF·jair.org·Apr 26, 2023An Overview of Environmental Features that Impact Deep Reinforcement Learning in Sparse-Reward Domains | Journal of Artificial Intelligence Research
Pre-training generalist agents using offline reinforcement learning#Reinforcement Learning#Google#Paper#PDF·ai.googleblog.com·Feb 23, 2023Pre-training generalist agents using offline reinforcement learning