Learn AI

566 bookmarks

Custom sorting

Understanding Transformers Using A Minimal Example

Visualizing the internal state of a Transformer model

·rti.github.io·today at 6:55 PM

Understanding Transformers Using A Minimal Example

Defeating Nondeterminism in LLM Inference

Reproducibility is a bedrock of scientific progress. However, it’s remarkably difficult to get reproducible results out of large language models. For example, you might observe that asking ChatGPT the same question multiple times provides different results. This by itself is not surprising, since getting a result from a language model involves “sampling”, a process that converts the language model’s output into a probability distribution and probabilistically selects a token. What might be more surprising is that even when we adjust the temperature down to 0This means that the LLM always chooses the highest probability token, which is called greedy sampling. (thus making the sampling theoretically deterministic), LLM APIs are still not deterministic in practice (see past discussions here, here, or here). Even when running inference on your own hardware with an OSS inference library like vLLM or SGLang, sampling still isn’t deterministic (see here or here).

·thinkingmachines.ai·today at 4:45 PM

Defeating Nondeterminism in LLM Inference

Agentic Design Patterns

Agentic Design Patterns A Hands-On Guide to Building Intelligent Systems, Antonio Gulli Table of Contents - total 424 pages = 1+2+1+1+4+9+103+61+34+114+74+5+4 11 Dedication, 1 page Acknowledgment, 2 pages [final, last read done] Foreword, 1 page [final, last read done] A Thought Leader's ...

·docs.google.com·Sep 9, 2025

Agentic Design Patterns

How Dropbox Built an AI Product Dash with RAG and AI Agents

In this article, we look at how Dropbox leveraged RAG and AI Agents to make Dash a reality.

·blog.bytebytego.com·Sep 2, 2025

How Dropbox Built an AI Product Dash with RAG and AI Agents

CaMeL offers a promising new direction for mitigating prompt injection attacks

In the two and a half years that we’ve been talking about prompt injection attacks I’ve seen alarmingly little progress towards a robust solution. The new paper Defeating Prompt Injections …

·simonwillison.net·Aug 28, 2025

CaMeL offers a promising new direction for mitigating prompt injection attacks

Don't bother parsing: Just use images for RAG | Morphik Blog

If search is the game, looks matter

·morphik.ai·Jul 22, 2025

Don't bother parsing: Just use images for RAG | Morphik Blog

The 4 Patterns of AI Native Development — AI Engineer summit Edition

·youtube.com·Jul 21, 2025

The 4 Patterns of AI Native Development — AI Engineer summit Edition

The AI Engineer Roadmap

Want to build AI-powered apps, but don't know where to start? You need a roadmap.

·aihero.dev·Jun 27, 2025

The AI Engineer Roadmap

LangGraph for complex workflows — surma.dev

I may be late to the party, but LangGraph lets you build complex workflow architectures and codify them as powerful automations. Also LLMs, if you want. But you don’t have to!

·surma.dev·Jun 23, 2025

LangGraph for complex workflows — surma.dev

karpathy/nanoGPT: The simplest, fastest repository for training/finetuning medium-sized GPTs.

The simplest, fastest repository for training/finetuning medium-sized GPTs. - karpathy/nanoGPT

·github.com·Jun 15, 2025

karpathy/nanoGPT: The simplest, fastest repository for training/finetuning medium-sized GPTs.

Introduction - Hugging Face LLM Course

We’re on a journey to advance and democratize artificial intelligence through open source and open science.

·huggingface.co·Jun 15, 2025

Introduction - Hugging Face LLM Course

The Annotated Transformer

·nlp.seas.harvard.edu·Jun 15, 2025

The Annotated Transformer

Evals are not all you need

·marble.onl·Jun 12, 2025

Evals are not all you need

Giles' blog

Posts in the 'LLM from scratch' category on Giles Thomas’s blog. Insights on AI, startups, software development, and technical projects, drawn from 30 years of experience.

·gilesthomas.com·Jun 12, 2025

Giles' blog

LLM Agents are simply Graph — Tutorial For Dummies

Ever wondered how AI agents actually work behind the scenes?

·pocketflow.substack.com·Jun 2, 2025

LLM Agents are simply Graph — Tutorial For Dummies

Enhanced Agentic-RAG: What If Chatbots Could Deliver Near-Human Precision? | Uber Blog

Genie is Uber’s internal on-call copilot, designed to provide real-time support for thousands of queries across multiple help channels in Slack®. It enables users to receive prompt responses with proper citations from Uber’s internal documentation. It also improves the productivity of on-call engineers and subject matter experts (SMEs) by reducing the effort required to address common, ad-hoc queries. While Genie streamlines the development of an LLM-powered on-call Slack bot, ensuring the accuracy and relevance of its responses remains a significant challenge. This blog details our efforts to improve Genie’s answer quality to near-human precision, allowing SMEs to rely on it for most queries without concern over potential misinformation in the engineering security and privacy domain.

·uber.com·Jun 2, 2025

Enhanced Agentic-RAG: What If Chatbots Could Deliver Near-Human Precision? | Uber Blog

Dummy's Guide to Modern Samplers

An idiot's comprehensive guide to modern sampling

·rentry.co·May 31, 2025

Dummy's Guide to Modern Samplers

As an Experienced LLM User, I Actually Don't Use Generative LLMs Often

But for what I do use LLMs for, it’s invaluable.

·minimaxir.com·May 31, 2025

As an Experienced LLM User, I Actually Don't Use Generative LLMs Often

Launch HN: Exa (YC S21) – The web as a database | Hacker News

·news.ycombinator.com·May 31, 2025

Launch HN: Exa (YC S21) – The web as a database | Hacker News

🪆 Introduction to Matryoshka Embedding Models

We’re on a journey to advance and democratize artificial intelligence through open source and open science.

·huggingface.co·May 31, 2025

🪆 Introduction to Matryoshka Embedding Models

SDK - Anthropic

Programmatically integrate Claude Code into your applications using the SDK.

·docs.anthropic.com·May 30, 2025

SDK - Anthropic

LLM function calls don't scale; code orchestration is simpler, more effective.

One common practice for working with MCP tools calls is to put the outputs from a tool back into the LLM as a message, and ask the LLM for the next step. ...

·jngiam.bearblog.dev·May 30, 2025

LLM function calls don't scale; code orchestration is simpler, more effective.

Matt Pocock on X: "Check if your markdown looks janky here: https://t.co/Iw3aKDQDS7" / X

Check if your markdown looks janky here: https://t.co/Iw3aKDQDS7

·x.com·May 29, 2025

Matt Pocock on X: "Check if your markdown looks janky here: https://t.co/Iw3aKDQDS7" / X

Cline

An autonomous AI coding assistant for VS Code with Plan/Act modes, terminal execution, file editing, and Model Context Protocol for custom tools

·cline.bot·May 28, 2025

Cline

Why I No Longer Recommend RAG for Autonomous Coding Agents

The RAG narrative is oddly insidious, a mind virus

·pashpashpash.substack.com·May 27, 2025

Why I No Longer Recommend RAG for Autonomous Coding Agents

Everything You Need To Know About AI Agents

Check out Recraft’s newest featues for free – https://go.recraft.ai/codingsloth. Use my code SLOTH11 for $11 off any paid plan. AI Agents are kinda crazy ngl. // SLOTH ARTISTS // Pixel Art Sloth: https://www.behance.net/harveydentmustdie // NEWSLETTER // Sloth Bytes: https://slothbytes.beehiiv.com/subscribe // BUSINESS INQUIRIES // For business: thecodingsloth@smoothmedia.co For brand partnerships: https://tally.so/r/mZVvKa // SOCIALS // Twitter: https://twitter.com/TheCodingSloth1 TikTok: https://www.tiktok.com/@thecodingsloth Discord: https://discord.gg/2ByMHqTNca // TOOLS/THINGS I REALLY LIKE // If you wanna build 10x developer level projects check out CodeCrafters https://app.codecrafters.io/join?via=TheCodingSloth If you want to build an awesome newsletter like Sloth Bytes I use beehiiv https://www.beehiiv.com?via=the-coding-sloth If you want to make nice looking apps: https://mobbin.com/?via=sloth (some of these links are affiliates, so I'll earn some money which supports the channel!)

·youtube.com·May 21, 2025

Everything You Need To Know About AI Agents

Embeddings are underrated

·technicalwriting.dev·May 19, 2025

Embeddings are underrated

Home / X

·x.com·May 18, 2025

Home / X

AI Agents Roadmap - roadmap.sh

Learn how to design, build and ship AI agents with this interactive step by step guide in 2025. We also have resources and short descriptions attached to the roadmap items so you can get everything you want to learn in one place.

·roadmap.sh·May 17, 2025

AI Agents Roadmap - roadmap.sh

sketch blog: The Unreasonable Effectiveness of an LLM Agent Loop with Tool Use

·sketch.dev·May 17, 2025

sketch blog: The Unreasonable Effectiveness of an LLM Agent Loop with Tool Use