Detecting misbehavior in frontier reasoning models | OpenAI
Test Information Space
Gpt 4 5 system card
Competitive Programming with Large Reasoning Models
OpenAI’s Economic Blueprint | OpenAI
Influence and cyber operations an update october 2024
Prover-Verifier Games improve legibility of language model outputs | OpenAI
Scaling and evaluating sparse autoencoders
View PDF
Diving deep into OpenAI’s new study on LLM’s and bioweapons
Building an early warning system for LLM-aided biological threat creation
Weak to strong generalization
Improving Mathematical Reasoning with Process Supervision
Language models can explain neurons in language models
GPTs are GPTs: An Early Look at the Labor Market Impact Potential of Large Language Models
Gpt 4 Technical Report
ChatGPT vs Sparrow - Battle of Chatbots