Search Test Information Space

Found 582 bookmarks

Newest

Prover-Verifier Games improve legibility of language model outputs | OpenAI

#Legibility #Large Language Models #OpenAI #Paper #PDF

·openai.com·Jul 18, 2024

Prover-Verifier Games improve legibility of language model outputs | OpenAI

Microsoft CTO Kevin Scott thinks LLM “scaling laws” will hold despite criticism

#Large Language Models #Scale #Forecasting

·arstechnica.com·Jul 16, 2024

Microsoft CTO Kevin Scott thinks LLM “scaling laws” will hold despite criticism

SpreadsheetLLM: Encoding Spreadsheets for Large Language Models

#Spreadsheet #Large Language Models #Microsoft #Paper #PDF

·arxiv.org·Jul 16, 2024

SpreadsheetLLM: Encoding Spreadsheets for Large Language Models

GAVEL: Generating Games Via Evolution and Language Models

View PDF

#Generative AI #Games #Paper #PDF #Large Language Models #Coding #Computer Science #Evolutionary Computation #Design #Automation

·arxiv.org·Jul 15, 2024

GAVEL: Generating Games Via Evolution and Language Models

AI Chatbots Seem as Ethical as a New York Times Advice Columnist

#Philosophy #Large Language Models #ChatGPT

·scientificamerican.com·Jul 14, 2024

AI Chatbots Seem as Ethical as a New York Times Advice Columnist

One Thousand and One Pairs: A "novel" challenge for...

View PDF

#Large Language Models #Reasoning #Paper #PDF

·arxiv.org·Jun 29, 2024

One Thousand and One Pairs: A "novel" challenge for...

OpenAI Builds AI to Critique AI

#OpenAI #ChatGPT #Coding #Verification #Large Language Models

·spectrum.ieee.org·Jun 27, 2024

OpenAI Builds AI to Critique AI

Scalable MatMul-free Language Modeling

View PDF

#Large Language Models #Performance #FPGA #Paper #PDF

·arxiv.org·Jun 26, 2024

Scalable MatMul-free Language Modeling

Empathic AI can’t get under the skin - Nature Machine Intelligence

#Empathy #Large Language Models

·nature.com·Jun 24, 2024

Empathic AI can’t get under the skin - Nature Machine Intelligence

Open Source LibreChat Offers More Than Just Extra LLMs

#Chatbot #Opensource #Large Language Models #Platforms

·thenewstack.io·Jun 22, 2024

Open Source LibreChat Offers More Than Just Extra LLMs

Human vs. Machine: Behavioral Differences Between Expert Humans...

#Simulation #Large Language Models #Defense #Paper #PDF

·arxiv.org·Jun 18, 2024

Human vs. Machine: Behavioral Differences Between Expert Humans...

LLMs Can't Plan, But Can Help Planning in LLM-Modulo Frameworks

View PDF

#Planning #Problem Set #Large Language Models #Paper #PDF

·arxiv.org·Jun 13, 2024

LLMs Can't Plan, But Can Help Planning in LLM-Modulo Frameworks

Advancing personal health and wellness insights with AI

#Health #Large Language Models #Google

·research.google·Jun 12, 2024

Advancing personal health and wellness insights with AI

NATURAL PLAN: Benchmarking LLMs on Natural Language Planning

View PDF

#Planning #Natural Language Processing #Large Language Models #DeepMind #Paper #PDF

·arxiv.org·Jun 10, 2024

NATURAL PLAN: Benchmarking LLMs on Natural Language Planning

How Game Theory Can Make AI More Reliable

#Game Theory #Questions and Answers #Large Language Models

·wired.com·Jun 9, 2024

How Game Theory Can Make AI More Reliable

Scaling and evaluating sparse autoencoders

View PDF

#Large Language Models #Visualization #OpenAI #Paper #PDF #Explainability

·arxiv.org·Jun 7, 2024

Scaling and evaluating sparse autoencoders

To Believe or Not to Believe Your LLM

View PDF

#Large Language Models #Trustworthy #DeepMind #Paper #PDF

·arxiv.org·Jun 5, 2024

To Believe or Not to Believe Your LLM

LLMs achieve adult human performance on higher-order theory of mind tasks

View PDF

#Theory of Mind #Large Language Models #Paper #PDF

·arxiv.org·May 31, 2024

LLMs achieve adult human performance on higher-order theory of mind tasks

Hallucination-Free? Assessing the Reliability of Leading AI Legal Research Tools

#RAG #Large Language Models #Legal #Paper #PDF

·dho.stanford.edu·May 31, 2024

Hallucination-Free? Assessing the Reliability of Leading AI Legal Research Tools

1-bit LLMs Could Solve AI’s Energy Demands

#Small Language Models #Energy #Large Language Models

·spectrum.ieee.org·May 30, 2024

1-bit LLMs Could Solve AI’s Energy Demands

Aya | Cohere For AI

Aya 23 - 8B is the newest.

#Large Language Models #Multilingual #Opensource #Cohere

·cohere.com·May 23, 2024

Aya | Cohere For AI

China’s latest answer to OpenAI is ‘Chat Xi PT’

#Large Language Models #China #Doctrine

·ft.com·May 22, 2024

China’s latest answer to OpenAI is ‘Chat Xi PT’

AI models can outperform humans in tests to identify mental states

#Large Language Models #Theory of Mind

·technologyreview.com·May 20, 2024

AI models can outperform humans in tests to identify mental states

SpeechVerse: A Large-scale Generalizable Audio Language Model

View PDF

#Large Language Models #Speech #Amazon #Paper #PDF

·arxiv.org·May 16, 2024

SpeechVerse: A Large-scale Generalizable Audio Language Model

Evaluating Large Language Models Using “Counterfactual Tasks”

#Performance #Reasoning #Large Language Models #Blog

·aiguide.substack.com·May 14, 2024

Evaluating Large Language Models Using “Counterfactual Tasks”

Large Language Models as Planning Domain Generators

View PDF

#Planning #Large Language Models #Paper #PDF #IBM

·arxiv.org·May 14, 2024

Large Language Models as Planning Domain Generators

LogoMotion: Visually Grounded Code Generation for Content-Aware Animation

#Logo #Animation #Adobe #Large Language Models #Paper #PDF

·arxiv.org·May 14, 2024

LogoMotion: Visually Grounded Code Generation for Content-Aware Animation

What matters when building vision-language models?

#Vision #Large Language Models #Paper #PDF

·arxiv.org·May 14, 2024

What matters when building vision-language models?

IBM open-sources its Granite AI models - and they mean business

#Opensource #Large Language Models #IBM

·zdnet.com·May 14, 2024

IBM open-sources its Granite AI models - and they mean business

U.S. plans AI export controls amid China and Russia tech advances

#AI #Regulation #Government #Large Language Models

·readwrite.com·May 9, 2024

U.S. plans AI export controls amid China and Russia tech advances