Does Transformer Interpretability Transfer to RNNs?
AI race heats up as OpenAI, Google and Mistral release new models
Instead, LeCun suggested, researchers needed to work on what he called “objective-driven” AI with the ability to reason and plan about the world, rather than just work on words alone.
Codegemma report
OpenAI’s GPT Store Is Triggering Copyright Complaints
Introducing Command R+: A Scalable LLM Built for Business
ReALM: Reference Resolution As Language Modeling
Jamba: A Hybrid Transformer-Mamba Language Model
Nay, J. J., Karamardian, D., Lawsky, S. B., Tao, W., Bhat, M., Jain, R., ... & Kasai, J. (2024). Large language models as tax attorneys: a case study in legal capabilities emergence. Philosophical Transactions of the Royal Society A, 382(2270), 20230159.
Long-form factuality in large language models
DBRX, the world’s most powerful open-source model, is now on You.com
Nexusflow on X: "Have we really squeezed out the capacity of a compact chat model? Thrilled to see our latest open model, Starling-7B, ranks 13th among all models in Chatbot Arena! 🚀 As a 7B model, Starling surpasses larger open and proprietary models, including Claude-2, GPT-3.5-Turbo, Gemini… https://t.co/Q6fWPj3b3z" / X
Introducing Jamba: AI21's Groundbreaking SSM-Transformer Model
Language Models Can Reduce Asymmetry in Information Markets
Grady Booch on Twitter / X
MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training
Apple Announces MM1: A Family of Multimodal LLMs Up To 30B Parameters that are SoTA in Pre-Training Metrics and Perform Competitively after Fine-Tuning
Investigating Continual Pretraining in Large Language Models: Insights and Implications
Download PDF
Data Interpreter: An LLM Agent For Data Science
Command-R: Retrieval Augmented Generation at Production Scale
Wolfram|Alpha LLM API: Reference & Documentation
Example of style guide prompt .
AI chatbots 'think' in English, research finds
The GPT-4 barrier has finally been broken
Introducing the next generation of Claude \ Anthropic
Large language models can do jaw-dropping things. But nobody knows exactly why.
Functional Benchmarks for Robust Evaluation of Reasoning Performance, and the Reasoning Gap
Download PDF
Adapted Large Language Models Can Outperform Medical Experts in Clinical Text Summarization
Recently published in Nature, https://www.nature.com/articles/s41591-024-02855-5
The CEO who believes Africans must make their own AI tools
Back to Basics: Revisiting REINFORCE Style Optimization for Learning from Human Feedback in LLMs
Introducing Mistral-Large on Azure in partnership with Mistral AI | Microsoft Azure Blog
War Games With Words: LLMs Show Escalation Risk
Artificial Intelligence played Wargames. The result isn't reassuring. By Sabine Hossenfelder.