What We Learned from a Year of Building with LLMs (Part I)
AI/ML
GitHub - simonw/llm-mlc: LLM plugin for running models using MLC
LLM plugin for running models using MLC. Contribute to simonw/llm-mlc development by creating an account on GitHub.
GitHub - taketwo/llm-ollama: LLM plugin providing access to local Ollama models using HTTP API
LLM plugin providing access to local Ollama models using HTTP API - taketwo/llm-ollama
Tensor Labbet · A blog of deep learnings
What kind of music is this?
"This collection appears to be primarily alternati…" Go see Molmo's answer!
Hacker plants false memories in ChatGPT to steal user data in perpetuity
Emails, documents, and other untrusted content can plant malicious memories.
What I’ve Learned in the Past Year Spent Building an AI Video Editor - Make Art with Python
Lessons from An Unexpected Year in AI
How Chain-of-Thought Reasoning Helps Neural Networks Compute | Quanta Magazine
Large language models do better at solving problems when they show their work. Researchers are beginning to understand why.
EvolutionaryScale
ESM3. Enabling scientists to understand, imagine, and create proteins.
GitHub - divelab/DIG: A library for graph deep learning research
A library for graph deep learning research. Contribute to divelab/DIG development by creating an account on GitHub.
Taking a closer look at AI’s supposed energy apocalypse
AI is just one small part of data centers’ soaring energy use.
Curiosity - AI search for everything
The ultimate AI productivity app that protects your privacy. Bring all your apps and data into one AI-powered search and assistant. Get it for you and for your teams today.
GitHub - thiswillbeyourgithub/WDoc: Summarize and query from a lot of heterogeneous documents. Any LLM provider, any filetype, scalable, under developpement
Summarize and query from a lot of heterogeneous documents. Any LLM provider, any filetype, scalable, under developpement - thiswillbeyourgithub/WDoc
How streaming LLM APIs work | Simon Willison’s TILs
I decided to have a poke around and see if I could figure out how the HTTP streaming APIs from the various hosted LLM providers actually worked. Here are my notes so far.
Building A GPT-Style LLM Classifier From Scratch
Finetuning a GPT Model for Spam Classification
Introducing Contextual Retrieval \ Anthropic
Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.
GitHub - ictnlp/LLaMA-Omni: LLaMA-Omni is a low-latency and high-quality end-to-end speech interaction model built upon Llama-3.1-8B-Instruct, aiming to achieve speech capabilities at the GPT-4o level.
LLaMA-Omni is a low-latency and high-quality end-to-end speech interaction model built upon Llama-3.1-8B-Instruct, aiming to achieve speech capabilities at the GPT-4o level. - ictnlp/LLaMA-Omni
Sourcetable | Your AI Data Analyst
Sourcetable is an AI spreadsheet that helps you analyze data and create reports. Chat with your data, create charts and graphs, build financial models, + more.
qhjqhj00/MemoRAG: Empowering RAG with a memory-based data interface for all-purpose applications!
Empowering RAG with a memory-based data interface for all-purpose applications! - qhjqhj00/MemoRAG
Introducing Contextual Retrieval
Here's an interesting new embedding/RAG technique, described by Anthropic but it should work for any embedding model against any other LLM. One of the big challenges in implementing semantic search …
Generative ML in chemistry is bottlenecked by synthesis — LessWrong
Introduction Every single time I design a protein — using ML or otherwise — I am confident that it is capable of being manufactured. I simply reach o…
Reverse engineering OpenAI’s o1
What productionizing test-time compute shows us about the future of AI. Exploration has landed in language model training.
dleemiller/WordLlama: Things you can do with the token embeddings of an LLM
Things you can do with the token embeddings of an LLM - dleemiller/WordLlama
Reader-LM: Small Language Models for Cleaning and Converting HTML to Markdown
Reader-LM-0.5B and Reader-LM-1.5B are two novel small language models inspired by Jina Reader, designed to convert raw, noisy HTML from the open web into clean markdown.
AI chatbots might be better at swaying conspiracy theorists than humans
Co-author Gordon Pennycook: "The work overturns a lot of how we thought about conspiracies."
Notes on OpenAI’s new o1 chain-of-thought models
OpenAI released two major new preview models today: o1-preview and o1-mini (that mini one is also a preview, despite the name)—previously rumored as having the codename “strawberry”. There’s a lot …
files-to-prompt 0.3
New version of my `files-to-prompt` CLI tool for turning a bunch of files into a prompt suitable for piping to an LLM, [described here previously](https://simonwillison.net/2024/Apr/8/files-to-prompt/). It now has a `-c/--cxml` …
Announcing The Assistant | Kagi Blog
Yes, the rumours are true! Kagi has been thoughtfully integrating AI into our search experience, creating a smarter, faster, and more intuitive search.
Start at 80% done on any writing, thinking, or creative task.
Build Spirals with your voice, personality, and style to instantly start at 80% done on any repeat writing, thinking, or decision-making task.
Post-apocalyptic education
What comes after the Homework Apocalypse