Search Test Information Space

Found 651 bookmarks

Custom sorting

LLMs Can Plan Only If We Tell Them

View PDF

#Planning #Large Language Models #Paper #PDF

·arxiv.org·Jan 26, 2025

LLMs Can Plan Only If We Tell Them

Wolfram Blog: News, Views and Insights from Wolfram

#Thematic Analysis #Wolfram #Large Language Models

·blog.wolfram.com·Jan 21, 2025

Wolfram Blog: News, Views and Insights from Wolfram

Evolving Deeper LLM Thinking

View PDF

#Large Language Models #Google #Planning

·arxiv.org·Jan 20, 2025

Evolving Deeper LLM Thinking

NeurIPS Poster Large Language Models' Expert-level Global History Knowledge Benchmark (HiST-LLM)

#Benchmark #Large Language Models #History #Paper

·nips.cc·Jan 19, 2025

NeurIPS Poster Large Language Models' Expert-level Global History Knowledge Benchmark (HiST-LLM)

How is Google using AI for internal code migrations?

View PDF

#Software Engineering #Google #Performance #Paper #PDF #Large Language Models

·arxiv.org·Jan 18, 2025

How is Google using AI for internal code migrations?

Titans: Learning to Memorize at Test Time

#Large Language Models #Memory #Performance #Paper #PDF #Google

·arxiv.org·Jan 16, 2025

Titans: Learning to Memorize at Test Time

Meta’s new AI model can translate speech from more than 100 languages

#Language Translation #Machine Translation #Meta #Large Language Models

·technologyreview.com·Jan 15, 2025

Meta’s new AI model can translate speech from more than 100 languages

Alibaba slashes prices on large language models by up to 85% as China AI rivalry heats up

#Alibaba #Large Language Models #Pricing

·cnbc.com·Jan 1, 2025

Alibaba slashes prices on large language models by up to 85% as China AI rivalry heats up

DeepSeek-V3, ultra-large open-source AI, outperforms Llama and Qwen on launch

#Opensource #Large Language Models

·venturebeat.com·Dec 26, 2024

DeepSeek-V3, ultra-large open-source AI, outperforms Llama and Qwen on launch

Could Savannah be the Next San Jose? The Downstream Effects of Large Language Models

#Economics #Geography #Large Language Models #Paper #PDF

·papers.ssrn.com·Dec 26, 2024

Could Savannah be the Next San Jose? The Downstream Effects of Large Language Models

What just happened

#Review #Large Language Models #Small Language Models #Performance

·oneusefulthing.org·Dec 23, 2024

What just happened

12 Days of OpenAI | OpenAI

#Alignment #Reasoning #Large Language Models #OpenAI

·openai.com·Dec 20, 2024

12 Days of OpenAI | OpenAI

SciAgents: Automating Scientific Discovery Through Bioinspired Multi‐Agent Intelligent Graph Reasoning

#Hypothesis #Agents #Framework #Large Language Models #Paper #PDF #Knowledge Graph #Heuristics #Materials #Synthetic Biology

·onlinelibrary.wiley.com·Dec 20, 2024

SciAgents: Automating Scientific Discovery Through Bioinspired Multi‐Agent Intelligent Graph Reasoning

FACTS Grounding: A new benchmark for evaluating the factuality of large language models

#Benchmark #Large Language Models #Fact-checking

·deepmind.google·Dec 18, 2024

FACTS Grounding: A new benchmark for evaluating the factuality of large language models

Brilliant talk by , but he's wrong on one point.

#Criticism #Prediction #Large Language Models

·x.com·Dec 15, 2024

Brilliant talk by , but he's wrong on one point.

AI's Data Dilemma

#News #Research #Large Language Models

·meta.ai·Dec 15, 2024

AI's Data Dilemma

If You Can't Use Them, Recycle Them: Optimizing Merging at...

View PDF

#Large Language Models #Training #Merging #Cohere

·arxiv.org·Dec 10, 2024

If You Can't Use Them, Recycle Them: Optimizing Merging at...

The Surprising Effectiveness of Test-Time Training for Abstract Reasoning

#Large Language Models #Training #Testing #Paper #PDF

·arxiv.org·Dec 9, 2024

The Surprising Effectiveness of Test-Time Training for Abstract Reasoning

(10) I asked AI chatbots to analyze “Alice in Wonderland” | LinkedIn

#Large Language Models #Literature #Analysis

·linkedin.com·Dec 3, 2024

(10) I asked AI chatbots to analyze “Alice in Wonderland” | LinkedIn

Evaluating Character Understanding of Large Language Models via...

#Large Language Models #Literature #Profile #Paper #PDF

·arxiv.org·Dec 3, 2024

Evaluating Character Understanding of Large Language Models via...

Large Language Models Fall Short: Understanding Complex...

View PDF

#Large Language Models #Analysis #Literature #Paper #PDF

·arxiv.org·Dec 3, 2024

Large Language Models Fall Short: Understanding Complex...

Decoding LLMs: How to be visible in generative AI search results

#SEO #Search #Large Language Models

·searchengineland.com·Nov 30, 2024

Decoding LLMs: How to be visible in generative AI search results

Asai, A. and others. (2024). OPENSCHOLAR: SYNTHESIZING SCIENTIFIC LITERATURE WITH RETRIEVAL-AUGMENTED LMS.

#RAG #Large Language Models #Allen Institute #Paper #PDF #Literature Review #Benchmark #Search

·openscholar.allen.ai·Nov 21, 2024

Asai, A. and others. (2024). OPENSCHOLAR: SYNTHESIZING SCIENTIFIC LITERATURE WITH RETRIEVAL-AUGMENTED LMS.

DeepSeek

#Large Language Models #Chatbot #Reasoning #China #Opensource

·deepseek.com·Nov 20, 2024

DeepSeek

Hidden Persuaders: LLMs' Political Leaning and Their Influence...

#Political Science #Bias #Large Language Models #Influencers #Paper #PDF

·arxiv.org·Nov 19, 2024

Hidden Persuaders: LLMs' Political Leaning and Their Influence...

Why LLMs Within Software Development May Be a Dead End

#Software Engineering #Large Language Models #Criticism

·thenewstack.io·Nov 18, 2024

Why LLMs Within Software Development May Be a Dead End

Large Language Model Influence on Diagnostic Reasoning

"The availability of an LLM as a diagnostic aid did not improve physician performance compared with conventional resources in a diagnostic reasoning randomized clinical trial. The LLM alone outperformed physicians even when the LLM was available to them, indicating that further development in human-computer interactions is needed to realize the potential of AI in clinical decision support systems."

The availability of an LLM as a diagnostic aid did not improve physician performance compared with conventional resources in a diagnostic reasoning randomized clinical trial. The LLM alone outperformed physicians even when the LLM was available to them, indicating that further development in human-computer interactions is needed to realize the potential of AI in clinical decision support systems.

#Medical #Diagnostics #Large Language Models #Paper #PDF

·jamanetwork.com·Nov 18, 2024

Large Language Model Influence on Diagnostic Reasoning

A Benchmark for Long-Form Medical Question Answering

View PDF

#Medical #Large Language Models #Questions and Answers #Benchmark

·arxiv.org·Nov 18, 2024

A Benchmark for Long-Form Medical Question Answering