Prover-Verifier Games improve legibility of language model outputs | OpenAI#Legibility#Large Language Models#OpenAI#Paper#PDF·openai.com·Jul 18, 2024Prover-Verifier Games improve legibility of language model outputs | OpenAI
Microsoft CTO Kevin Scott thinks LLM “scaling laws” will hold despite criticism#Large Language Models#Scale#Forecasting·arstechnica.com·Jul 16, 2024Microsoft CTO Kevin Scott thinks LLM “scaling laws” will hold despite criticism
SpreadsheetLLM: Encoding Spreadsheets for Large Language Models#Spreadsheet#Large Language Models#Microsoft#Paper#PDF·arxiv.org·Jul 16, 2024SpreadsheetLLM: Encoding Spreadsheets for Large Language Models
GAVEL: Generating Games Via Evolution and Language ModelsView PDF#Generative AI#Games#Paper#PDF#Large Language Models#Coding#Computer Science#Evolutionary Computation#Design#Automation·arxiv.org·Jul 15, 2024GAVEL: Generating Games Via Evolution and Language Models
AI Chatbots Seem as Ethical as a New York Times Advice Columnist#Philosophy#Large Language Models#ChatGPT·scientificamerican.com·Jul 14, 2024AI Chatbots Seem as Ethical as a New York Times Advice Columnist
One Thousand and One Pairs: A "novel" challenge for...View PDF#Large Language Models#Reasoning#Paper#PDF·arxiv.org·Jun 29, 2024One Thousand and One Pairs: A "novel" challenge for...
OpenAI Builds AI to Critique AI#OpenAI#ChatGPT#Coding#Verification#Large Language Models·spectrum.ieee.org·Jun 27, 2024OpenAI Builds AI to Critique AI
Scalable MatMul-free Language ModelingView PDF#Large Language Models#Performance#FPGA#Paper#PDF·arxiv.org·Jun 26, 2024Scalable MatMul-free Language Modeling
Empathic AI can’t get under the skin - Nature Machine Intelligence#Empathy#Large Language Models·nature.com·Jun 24, 2024Empathic AI can’t get under the skin - Nature Machine Intelligence
Open Source LibreChat Offers More Than Just Extra LLMs#Chatbot#Opensource#Large Language Models#Platforms·thenewstack.io·Jun 22, 2024Open Source LibreChat Offers More Than Just Extra LLMs
Human vs. Machine: Behavioral Differences Between Expert Humans...#Simulation#Large Language Models#Defense#Paper#PDF·arxiv.org·Jun 18, 2024Human vs. Machine: Behavioral Differences Between Expert Humans...
LLMs Can't Plan, But Can Help Planning in LLM-Modulo FrameworksView PDF#Planning#Problem Set#Large Language Models#Paper#PDF·arxiv.org·Jun 13, 2024LLMs Can't Plan, But Can Help Planning in LLM-Modulo Frameworks
Advancing personal health and wellness insights with AI#Health#Large Language Models#Google·research.google·Jun 12, 2024Advancing personal health and wellness insights with AI
NATURAL PLAN: Benchmarking LLMs on Natural Language PlanningView PDF#Planning#Natural Language Processing#Large Language Models#DeepMind#Paper#PDF·arxiv.org·Jun 10, 2024NATURAL PLAN: Benchmarking LLMs on Natural Language Planning
How Game Theory Can Make AI More Reliable#Game Theory#Questions and Answers#Large Language Models·wired.com·Jun 9, 2024How Game Theory Can Make AI More Reliable
Scaling and evaluating sparse autoencodersView PDF#Large Language Models#Visualization#OpenAI#Paper#PDF#Explainability·arxiv.org·Jun 7, 2024Scaling and evaluating sparse autoencoders
To Believe or Not to Believe Your LLMView PDF#Large Language Models#Trustworthy#DeepMind#Paper#PDF·arxiv.org·Jun 5, 2024To Believe or Not to Believe Your LLM
LLMs achieve adult human performance on higher-order theory of mind tasksView PDF#Theory of Mind#Large Language Models#Paper#PDF·arxiv.org·May 31, 2024LLMs achieve adult human performance on higher-order theory of mind tasks
Hallucination-Free? Assessing the Reliability of Leading AI Legal Research Tools#RAG#Large Language Models#Legal#Paper#PDF·dho.stanford.edu·May 31, 2024Hallucination-Free? Assessing the Reliability of Leading AI Legal Research Tools
1-bit LLMs Could Solve AI’s Energy Demands#Small Language Models#Energy#Large Language Models·spectrum.ieee.org·May 30, 20241-bit LLMs Could Solve AI’s Energy Demands
Aya | Cohere For AIAya 23 - 8B is the newest.#Large Language Models#Multilingual#Opensource#Cohere·cohere.com·May 23, 2024Aya | Cohere For AI
China’s latest answer to OpenAI is ‘Chat Xi PT’#Large Language Models#China#Doctrine·ft.com·May 22, 2024China’s latest answer to OpenAI is ‘Chat Xi PT’
AI models can outperform humans in tests to identify mental states#Large Language Models#Theory of Mind·technologyreview.com·May 20, 2024AI models can outperform humans in tests to identify mental states
SpeechVerse: A Large-scale Generalizable Audio Language ModelView PDF#Large Language Models#Speech#Amazon#Paper#PDF·arxiv.org·May 16, 2024SpeechVerse: A Large-scale Generalizable Audio Language Model
Evaluating Large Language Models Using “Counterfactual Tasks”#Performance#Reasoning#Large Language Models#Blog·aiguide.substack.com·May 14, 2024Evaluating Large Language Models Using “Counterfactual Tasks”
Large Language Models as Planning Domain GeneratorsView PDF#Planning#Large Language Models#Paper#PDF#IBM·arxiv.org·May 14, 2024Large Language Models as Planning Domain Generators
LogoMotion: Visually Grounded Code Generation for Content-Aware Animation#Logo#Animation#Adobe#Large Language Models#Paper#PDF·arxiv.org·May 14, 2024LogoMotion: Visually Grounded Code Generation for Content-Aware Animation
What matters when building vision-language models?#Vision#Large Language Models#Paper#PDF·arxiv.org·May 14, 2024What matters when building vision-language models?
IBM open-sources its Granite AI models - and they mean business#Opensource#Large Language Models#IBM·zdnet.com·May 14, 2024IBM open-sources its Granite AI models - and they mean business
U.S. plans AI export controls amid China and Russia tech advances#AI#Regulation#Government#Large Language Models·readwrite.com·May 9, 2024U.S. plans AI export controls amid China and Russia tech advances