Search Test Information Space

Found 36 bookmarks

Custom sorting

29/Nov/2023 - Google-proof, ultra-high-ceiling AI tests (BASIS, GAIA, GPQA) - LifeArchitect.ai LIVE

·youtube.com·Nov 29, 2023

Supporting benchmarks for AI safety with MLCommons

·blog.research.google·Oct 27, 2023

Human Preference Score v2: A Solid Benchmark for Evaluating Human Preferences of Text-to-Image Synthesis

·arxiv.org·Jul 5, 2023

New Records for the Biggest and Smallest AI Computers

·spectrum.ieee.org·Nov 17, 2022

[Own work] VALSE 💃: Benchmark for Vision and Language Models Centered on Linguistic Phenomena

·youtube.com·May 9, 2022

Is AI Training Outstripping Moore’s Law?

·spectrum.ieee.org·Dec 2, 2021