Detecting misbehavior in frontier reasoning models | OpenAI
Mitchell, M. (2010.) Biological Computation.
When Personalization Meets Reality: A Multi-Faceted Analysis of...
View PDF
The AI Agent Index
View PDF
Superintelligence_Strategy_Expert.pdf
The Widespread Adoption of Large Language Model-Assisted Writing...
Why do Experts Disagree on Existential Risk and P(doom)? A Survey...
From Tools to Teammates: Evaluating LLMs in Multi-Session Coding...
View PDF
Gpt 4 5 system card
Towards an AI co-scientist
View PDF
Forecasting rare language model behaviors \ Anthropic
Deep-learning enabled generalized inverse design of multi-port radio-frequency and sub-terahertz passives and integrated circuits - Nature Communications
World and Human Action Models towards gameplay ideation - Nature
Interferometric single-shot parity measurement in InAs–Al hybrid devices - Nature
eXtended Reality and Artificial Intelligence in Medicine and Rehabilitation - Information Systems Frontiers
Download PDF
SWE-Lancer: Can Frontier LLMs Earn $1 Million from Real-World...
View PDF
Lee 2025 ai critical thinking survey
The Labor Market Effects of Generative Artificial Intelligence
Open PDF in Browser
Distillation Scaling Laws
View PDF
Competitive Programming with Large Reasoning Models
On-device Sora: Enabling Diffusion-Based Text-to-Video Generation...
Goku: Flow Based Video Generative Foundation Models
Gold-medalist Performance in Solving Olympiad Geometry with AlphaGeometry2
View PDF
s1: Simple test-time scaling
Our Approach to Frontier AI | Meta
Constitutional Classifiers: Defending against Universal Jailbreaks...
On the Diagram of Thought
Learning to Plan & Reason for Evaluation with Thinking-LLM-as-a-Judge
View PDF
International AI Safety Report 2025
International AI Safety Report 2025
Training Large Language Models to Reason in a Continuous Latent Space