OpenAI o1 Results on ARC-AGI-Pub
Why OpenAI's GPT-o1 is a Game-Changer: 10 Must-Know Uses (OpenAI Demo)
How OpenAI made o1 "think" – Here is what we think and already know about o1 reinforcement learning
DataGemma: Using real-world data to address AI hallucinations
research paper
Introducing OpenAI o1 - a new series of reasoning models for solving hard problems | OpenAI
AI as a Mirror Into the Self
Matt Shumer on X: "I'm excited to announce Reflection 70B, the world’s top open-source model. Trained using Reflection-Tuning, a technique developed to enable LLMs to fix their own mistakes. 405B coming next week - we expect it to be the best model in the world. Built w/ @GlaiveAI. Read on ⬇️: https://t.co/kZPW1plJuo" / X
Discrete Diffusion Modeling by Estimating the Ratios of the Data Distribution – Paper Explained
MIT researchers use large language models to flag problems in complex systems
Smaller, Safer, More Transparent: Advancing Responsible AI with Gemma
GPT-4o Long Output | OpenAI
[Own work] On Measuring Faithfulness or Self-consistency of Natural Language Explanations
Large Enough
Introducing Llama 3.1: Our most capable models to date
The Vision of Autonomic Computing: Can LLMs Make It a Reality?
Wolfram LLM Benchmarking Project
Prover-Verifier Games improve legibility of language model outputs | OpenAI
Microsoft CTO Kevin Scott thinks LLM “scaling laws” will hold despite criticism
SpreadsheetLLM: Encoding Spreadsheets for Large Language Models
GAVEL: Generating Games Via Evolution and Language Models
View PDF
AI Chatbots Seem as Ethical as a New York Times Advice Columnist
One Thousand and One Pairs: A "novel" challenge for...
View PDF
OpenAI Builds AI to Critique AI
Scalable MatMul-free Language Modeling
View PDF
Empathic AI can’t get under the skin - Nature Machine Intelligence
Open Source LibreChat Offers More Than Just Extra LLMs
Human vs. Machine: Behavioral Differences Between Expert Humans...
LLMs Can't Plan, But Can Help Planning in LLM-Modulo Frameworks
View PDF
Advancing personal health and wellness insights with AI
NATURAL PLAN: Benchmarking LLMs on Natural Language Planning
View PDF