Search Test Information Space

Found 651 bookmarks

Custom sorting

Does Transformer Interpretability Transfer to RNNs?

#RNN #Transformers #Large Language Models #Paper #PDF #EleutherAI

·arxiv.org·Apr 10, 2024

Does Transformer Interpretability Transfer to RNNs?

AI race heats up as OpenAI, Google and Mistral release new models

Instead, LeCun suggested, researchers needed to work on what he called “objective-driven” AI with the ability to reason and plan about the world, rather than just work on words alone.

#Large Language Models #Google #OpenAI #Mistral

·theguardian.com·Apr 10, 2024

AI race heats up as OpenAI, Google and Mistral release new models

Codegemma report

#Gemma #Coding #DeepMind #Google #Large Language Models #Paper #PDF #Opensource

·storage.googleapis.com·Apr 10, 2024

Codegemma report

OpenAI’s GPT Store Is Triggering Copyright Complaints

#Legal #Copyright #Large Language Models #Chatbot #Training #Fine-Tuning #Literature

·wired.com·Apr 4, 2024

OpenAI’s GPT Store Is Triggering Copyright Complaints

Introducing Command R+: A Scalable LLM Built for Business

#Cohere #Large Language Models

·txt.cohere.com·Apr 4, 2024

Introducing Command R+: A Scalable LLM Built for Business

ReALM: Reference Resolution As Language Modeling

#Large Language Models #Paper #PDF #Apple

·arxiv.org·Apr 1, 2024

ReALM: Reference Resolution As Language Modeling

Jamba: A Hybrid Transformer-Mamba Language Model

#Large Language Models #Paper #PDF #Mixture of Experts

·arxiv.org·Apr 1, 2024

Jamba: A Hybrid Transformer-Mamba Language Model

Nay, J. J., Karamardian, D., Lawsky, S. B., Tao, W., Bhat, M., Jain, R., ... & Kasai, J. (2024). Large language models as tax attorneys: a case study in legal capabilities emergence. Philosophical Transactions of the Royal Society A, 382(2270), 20230159.

#Legal #Taxation #Large Language Models #Paper #PDF

·royalsocietypublishing.org·Mar 31, 2024

Long-form factuality in large language models

#Large Language Models #Accuracy #Fact-checking #Paper #PDF

·arxiv.org·Mar 29, 2024

Long-form factuality in large language models

DBRX, the world’s most powerful open-source model, is now on You.com

#You com #Large Language Models #Chatbot

·us2.campaign-archive.com·Mar 29, 2024

DBRX, the world’s most powerful open-source model, is now on You.com

Nexusflow on X: "Have we really squeezed out the capacity of a compact chat model? Thrilled to see our latest open model, Starling-7B, ranks 13th among all models in Chatbot Arena! 🚀 As a 7B model, Starling surpasses larger open and proprietary models, including Claude-2, GPT-3.5-Turbo, Gemini… https://t.co/Q6fWPj3b3z" / X

#Large Language Models #Opensource

·twitter.com·Mar 28, 2024

Introducing Jamba: AI21's Groundbreaking SSM-Transformer Model

#Large Language Models #AI21

·ai21.com·Mar 28, 2024

Introducing Jamba: AI21's Groundbreaking SSM-Transformer Model

Language Models Can Reduce Asymmetry in Information Markets

#Economics #Paper #PDF #Large Language Models

·arxiv.org·Mar 27, 2024

Language Models Can Reduce Asymmetry in Information Markets

Grady Booch on Twitter / X

#Planning #Large Language Models #Review #Criticism #Paper #PDF

·twitter.com·Mar 18, 2024

Grady Booch on Twitter / X

MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training

#Large Language Models #Multimodal #Apple #Paper #PDF

·arxiv.org·Mar 17, 2024

MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training

Apple Announces MM1: A Family of Multimodal LLMs Up To 30B Parameters that are SoTA in Pre-Training Metrics and Perform Competitively after Fine-Tuning

#Apple #Large Language Models #Multimodal

·marktechpost.com·Mar 17, 2024

Apple Announces MM1: A Family of Multimodal LLMs Up To 30B Parameters that are SoTA in Pre-Training Metrics and Perform Competitively after Fine-Tuning

Investigating Continual Pretraining in Large Language Models: Insights and Implications

Download PDF

#Machine Learning #Large Language Models #Paper #PDF #Cohere

·arxiv.org·Mar 15, 2024

Investigating Continual Pretraining in Large Language Models: Insights and Implications

Data Interpreter: An LLM Agent For Data Science

#Large Language Models #Data Science #Paper #PDF #Agents

·arxiv.org·Mar 14, 2024

Data Interpreter: An LLM Agent For Data Science