Forecasting rare language model behaviors \ Anthropic
Bach, F. (2024). Learning theory from first principles. MIT press.
OpenEQA: From word models to world models
OpenEQA combines challenging open-vocabulary questions with the ability to answer in natural language. This results in a straightforward benchmark that demonstrates a strong understanding of the environment—and poses a considerable challenge to current foundational models. We hope this work motivates additional research into helping AI understand and communicate about the world it sees.
Reading papers is an active discovery process. A return of note taking in the margin.