Forecasting rare language model behaviors \ Anthropic
DataGemma: Using real-world data to address AI hallucinations
research paper
Advancing medical AI with Med-Gemini
OpenEQA: From word models to world models
OpenEQA combines challenging open-vocabulary questions with the ability to answer in natural language. This results in a straightforward benchmark that demonstrates a strong understanding of the environment—and poses a considerable challenge to current foundational models. We hope this work motivates additional research into helping AI understand and communicate about the world it sees.
V-JEPA: The next step toward advanced machine intelligence