AI/ML

2329 bookmarks

Custom sorting

Richard Feynman and The Connection Machine

For Richard, a crazy idea was an opportunity to either prove it wrong or prove it right.

#history #science

·longnow.org·today at 10:35 AM

Richard Feynman and The Connection Machine

Defeating Nondeterminism in LLM Inference

Reproducibility is a bedrock of scientific progress. However, it’s remarkably difficult to get reproducible results out of large language models. For example, you might observe that asking ChatGPT the same question multiple times provides different results. This by itself is not surprising, since getting a result from a language model involves “sampling”, a process that converts the language model’s output into a probability distribution and probabilistically selects a token. What might be more surprising is that even when we adjust the temperature down to 0This means that the LLM always chooses the highest probability token, which is called greedy sampling. (thus making the sampling theoretically deterministic), LLM APIs are still not deterministic in practice (see past discussions here, here, or here). Even when running inference on your own hardware with an OSS inference library like vLLM or SGLang, sampling still isn’t deterministic (see here or here).

#transformers #math

·thinkingmachines.ai·today at 10:34 AM

Defeating Nondeterminism in LLM Inference

A very common question I see about LLMs concerns why they can't be made to deliver the same response to the same prompt by setting a fixed random number seed. …

#math #transformers

·simonwillison.net·today at 10:34 AM

Defeating Nondeterminism in LLM Inference

Will Amazon S3 Vectors Kill Vector Databases—or Save Them? - Zilliz blog

AWS S3 Vectors aims for 90% cost savings for vector storage. But will it kill vectordbs like Milvus? A deep dive into costs, limits, and the future of tiered storage.

#database #data #aws

·zilliz.com·Sep 9, 2025

Will Amazon S3 Vectors Kill Vector Databases—or Save Them? - Zilliz blog

Exploring Environments Hub: Your Language Model needs better (open) environments to learn

A Blog post by Stefano Fiorucci on Hugging Face

#model training

·huggingface.co·Sep 7, 2025

Exploring Environments Hub: Your Language Model needs better (open) environments to learn

GitHub - Varietyz/Disciplined-AI-Software-Development: This methodology provides a structured approach for collaborating with AI systems on software development projects. It addresses common issues like code bloat, architectural drift, and context dilution through systematic constraints and validation checkpoints.

This methodology provides a structured approach for collaborating with AI systems on software development projects. It addresses common issues like code bloat, architectural drift, and context dilu...

#agent #assistant #code #prompt

·github.com·Sep 7, 2025

I hooked Obsidian to a local LLM and it beats NotebookLM at its own game

My notes now talk back and it’s terrifyingly useful.

#obsidian #local model #knowledge base

·makeuseof.com·Sep 7, 2025

I hooked Obsidian to a local LLM and it beats NotebookLM at its own game

GPT-5 Thinking in ChatGPT (aka Research Goblin) is shockingly good at search

“Don’t use chatbots as search engines” was great advice for several years... until it wasn’t. I wrote about how good OpenAI’s o3 was at using its Bing-backed search tool back …

#prompt #search #agent #tools #RAG

·simonwillison.net·Sep 7, 2025

GPT-5 Thinking in ChatGPT (aka Research Goblin) is shockingly good at search

The Evidence That AI Is Destroying Jobs For Young People Just Got Stronger

A big nerd debate with bigger implications for the future of work, technology, and the economy

#economics

·derekthompson.org·Sep 6, 2025

The Evidence That AI Is Destroying Jobs For Young People Just Got Stronger

Why RDF Is the Natural Knowledge Layer for AI Systems

Part 1 of 6 in the series “LLMs Need Knowledge Graphs. Use RDF or End Up Rebuilding It.”

#knowledge base #nlp #database #data

·bryon.io·Sep 6, 2025

Why RDF Is the Natural Knowledge Layer for AI Systems

GitHub Copilot Custom Chat Modes

Learn how to create custom chat modes in VS Code for GitHub Copilot to enhance your workflow in large, complex projects with specialized AI configurations.

#vscode #agent #IDE #programming

·harrybin.de·Sep 5, 2025

GitHub Copilot Custom Chat Modes

GitHub - github/awesome-copilot: Community-contributed instructions, prompts, and configurations to help you make the most of GitHub Copilot.

Community-contributed instructions, prompts, and configurations to help you make the most of GitHub Copilot. - github/awesome-copilot

#vscode

·github.com·Sep 5, 2025

GitHub - github/awesome-copilot: Community-contributed instructions, prompts, and configurations to help you make the most of GitHub Copilot.

Introducing the Awesome GitHub Copilot Customizations repo - Microsoft for Developers

Today we’re excited to announce the launch of the Awesome GitHub Copilot Customizations repo! The Awesome Copilot repo is a community-driven resource with custom instructions, reusable prompts, and custom chat modes that helps you get consistent AI assistance. In other words, Awesome Copilot helps you get the most out of GitHub Copilot by letting you tailor it […]

#vscode

·developer.microsoft.com·Sep 5, 2025

Introducing the Awesome GitHub Copilot Customizations repo - Microsoft for Developers

Code Review with GitHub Copilot in Visual Studio Code

#vscode

·nikiforovall.blog·Sep 5, 2025

Code Review with GitHub Copilot in Visual Studio Code

Understanding Transformers Using A Minimal Example

Visualizing the internal state of a Transformer model

#transformers #tutorial #embedding

·rti.github.io·Sep 4, 2025

Understanding Transformers Using A Minimal Example

How To Become A Mechanistic Interpretability Researcher — AI Alignment Forum

Note: If you’ll forgive the shameless self-promotion, applications for my MATS stream are open until Sept 12. I help people write a mech interp paper…

#science #training

·alignmentforum.org·Sep 4, 2025

How To Become A Mechanistic Interpretability Researcher — AI Alignment Forum

Spec-driven development with AI: Get started with a new open source toolkit

Developers can use their AI tool of choice for spec-driven development with this open source toolkit.

#tools #programming #assistant #agent #prompt

·github.blog·Sep 4, 2025

Spec-driven development with AI: Get started with a new open source toolkit

Testing VLMs and LLMs for robotics w/ the Jetson Thor devkit

Exploring the Jetson Thor devkit w/ some local LLMs and VLMs.More info on the Jetson Thor Devkit: https://nvda.ws/45xIU4BNeural Networks from Scratch book: h...

#vision #local model #hardware

·youtube.com·Sep 3, 2025

Testing VLMs and LLMs for robotics w/ the Jetson Thor devkit

tokens are getting more expensive

"language models will get cheaper by 10x" will not save ai subscriptions from the short squeeze

#efficiency #economics

·ethanding.substack.com·Sep 1, 2025

tokens are getting more expensive

Cline + LM Studio: the local coding stack with Qwen3 Coder 30B

An autonomous AI coding assistant for VS Code with Plan/Act modes, terminal execution, file editing, and Model Context Protocol for custom tools

#agent #IDE #vscode #tools #local model

·cline.bot·Sep 1, 2025

Cline + LM Studio: the local coding stack with Qwen3 Coder 30B

Are OpenAI and Anthropic Really Losing Money on Inference?

#economics

·martinalderson.com·Aug 29, 2025

Are OpenAI and Anthropic Really Losing Money on Inference?

The current state of gpt-5

The GPT-5 launch was uh, rough. A lot went wrong here, and I want to talk about what really happened...Thank you Kilo Code for sponsoring! Check them out at:...

#model training #programming

·youtube.com·Aug 25, 2025

The current state of gpt-5

Midjourney TV

#art #entertainment #image #video

·midjourney.tv·Aug 24, 2025

What I learned about productivity this year

What I gave up, what I kept, and what's new. PLUS: How I'm using AI

#productivity #journalism #writing

·platformer.news·Aug 24, 2025

What I learned about productivity this year

What makes Claude Code so damn good (and how to recreate that magic in your agent)!?

Claude Code is the most delightful AI agent/workflow I have used so far. Not only does it make targeted edits or vibe coding throwaway tools less annoying, ...

#agent #code #tools #programming #architecture

·minusx.ai·Aug 24, 2025

What makes Claude Code so damn good (and how to recreate that magic in your agent)!?

My experience creating software with LLM coding agents - Part 2 (Tips)

My experience creating software with LLM coding agents - Part 2 This post details my experiences creating software with LLM coding agents,...

#prompt #agent #code #programming

·efitz-thoughts.blogspot.com·Aug 24, 2025

My experience creating software with LLM coding agents - Part 2 (Tips)

College student’s “time travel” AI experiment accidentally outputs real 1834 history

Hobbyist training AI on Victorian texts gets an unexpected history lesson from his own creation.

#history #science #model training

·arstechnica.com·Aug 24, 2025

College student’s “time travel” AI experiment accidentally outputs real 1834 history

too many model context protocol servers and LLM allocations on the dance floor

This blog post intends to be a definitive guide to context engineering fundamentals from the perspective of an engineer who builds commercial coding assistants and harnesses for a living. Just two weeks ago, I was back over in San Francisco, and there was a big event on Model Context Protocol

#agent #prompt #tools

·ghuntley.com·Aug 23, 2025

too many model context protocol servers and LLM allocations on the dance floor

Useful reminder from Geoffrey Huntley of the infrequently discussed significant token cost of using MCP. Geoffrey estimate estimates that the usable context window something like Amp or Cursor is around …

#agent #tools

·simonwillison.net·Aug 23, 2025

too many model context protocol servers and LLM allocations on the dance floor

AI Agents Need Data Integrity - Schneier on Security

Think of the Web as a digital territory with its own social contract. In 2014, Tim Berners-Lee called for a “Magna Carta for the Web” to restore the balance of power between individuals and institutions. This mirrors the original charter’s purpose: ensuring that those who occupy a territory have a meaningful stake in its governance. Web 3.0—the distributed, decentralized Web of tomorrow—is finally poised to change the Internet’s dynamic by returning ownership to data creators. This will change many things about what’s often described as the “CIA triad” of ...

#ethics #security #data

·schneier.com·Aug 23, 2025

AI Agents Need Data Integrity - Schneier on Security