Transformers

Deep Dive
A Visual Guide to Reasoning LLMs
How do we create LLMs that can reason? Exploring Test-Time Compute Techniques and DeepSeek-R1.
Language Models as Compilers: Simulating Pseudocode Execution Improves Algorithmic Reasoning
We introduce THINK-AND-EXECUTE, a framework that performs reasoning with a pseudocode that contains the common logical structure of a given task.
DeepSeek R1
Fuck You, Show Me The Prompt. –
Quickly understand inscrutable LLM frameworks by intercepting API calls.
https://mitmproxy.org/
The Problem with Reasoners
A new tool that blends your everyday work apps into one. It's the all-in-one workspace for you and your team
What are LLMs?
The Most Dangerous Thing An AI Startup Can Do Is Build For Other AI Startups
How Codeium went from 0 to $10m in ten months, What enterpriseready.io got wrong. A comprehensive braindump on how to be Enterprise Infra Native!
2312.10997v5.pdf
Transformer Explainer: LLM Transformer Model Visually Explained
An interactive visualization tool showing you how transformer models work in large language models (LLM) like GPT.
Hierarchical Navigable Small Worlds (HNSW) | Pinecone
New LLM Pre-training and Post-training Paradigms
New LLM Pre-training and Post-training Paradigms: A Look at How Moderns LLMs Are Trained
Alex Strick van Linschoten - How to think about creating a dataset for LLM finetuning evaluation
I summarise the kinds of evaluations that are needed for a structured data generation task.
Applied LLMs - What We’ve Learned From A Year of Building with LLMs
A practical guide to building successful LLM products, covering the tactical, operational, and strategic.
An interview with the most prolific jailbreaker of ChatGPT and other leading LLMs
Pliny the Prompter has been finding ways to jailbreak, or remove the prohibitions and restrictions on leading LLMs, since last year.
How LLMs Work, Explained Without Math
I'm sure you agree that it has become impossible to ignore Generative AI (GenAI), as we are constantly bombarded with mainstream news about Large Language Models (LLMs). Very likely you have tried…
What I've Learned Building Interactive Embedding Visualizations
How We Saved 10s of Thousands of Dollars Deploying Low Cost Open Source AI Technologies At Scale with Kubernetes
Scaling up generative AI operations can be costly. At OpenSauced, we faced this challenge while building StarSearch, until we found a low cost solution to deploy an OpenAI-compatible API using open source technology.
But what is a GPT? Visual intro to transformers | Chapter 5, Deep Learning
Unpacking how large language models work under the hoodEarly view of the next chapter for patrons: https://3b1b.co/early-attentionSpecial thanks to these sup...
How Intuit data analysts write SQL 2x faster with internal GenAI tool
Reporting on the productivity impact of SQL generation with generative AI.
How we built Text-to-SQL at Pinterest
Adam Obeng | Data Scientist, Data Platform Science; J.C. Zhong | Tech Lead, Analytics Platform; Charlie Gu | Sr. Manager, Engineering