ActPC-Chem: Discrete Active Predictive Coding for Goal-Guided...
Iterative Reasoning Preference Optimization
OpenELM/OpenELM_Paper.pdf at paper · CarperAI/OpenELM · GitHub
Faster sorting algorithms discovered using deep reinforcement learning
Reprompting: Automated Chain-of-Thought Prompt Inference Through Gibbs Sampling