Attention in transformers, visually explained | Chapter 6, Deep Learning
Demystifying attention, the key mechanism inside transformers and LLMs.Instead of sponsored ad reads, these lessons are funded directly by viewers: https://3...
A solid pattern to build LLM Applications (feat. Claude)
The thing about modern AI development - both developing things with AI and developing AI things - is that you often need to know the right magic incantations...
Building files-to-prompt entirely using Claude 3 Opus
files-to-prompt is a new tool I built to help me pipe several files at once into prompts to LLMs such as Claude and GPT-4. When combined with my LLM command-line …
Command R is a conversational model that excels in language tasks and supports multiple languages, making it ideal for coding use cases that require instruction models. It responds well to preambles that follow a specific structure and format, enhancing its performance.
nilsherzig/LLocalSearch: LLocalSearch is a completely locally running search aggregator using LLM Agents. The user can ask a question and the system will use a chain of LLMs to find the answer. The user can see the progress of the agents and the final answer. No OpenAI or Google API keys are needed.
LLocalSearch is a completely locally running search aggregator using LLM Agents. The user can ask a question and the system will use a chain of LLMs to find the answer. The user can see the progres...
‘Lavender’: The AI machine directing Israel’s bombing spree in Gaza
The Israeli army has marked tens of thousands of Gazans as suspects for assassination, using an AI targeting system with little human oversight and a permissive policy for casualties, +972 and Local Call reveal.
As time progresses, AI models are achieving higher reasoning accuracy while their associated costs continue to drastically decrease. What does it mean for our future?
But what is a GPT? Visual intro to Transformers | Deep learning, chapter 5
An introduction to transformers and their prerequisitesEarly view of the next chapter for patrons: https://3b1b.co/early-attentionOther recommended resources...
In this video I go through an example of building a Custom Crew with CrewAI and compare the Sequential vs Hierarchical process methodsCODESequential Colab: h...
Heads up, Bay Area guys ditched their AVP already and buzz about DSPy now. Could DSPy be the fresh go-to framework for prompt engineering after LangChain and LlamaIndex?
Running OCR against PDFs and images directly in your browser
I attended the Story Discovery At Scale data journalism conference at Stanford this week. One of the perennial hot topics at any journalism conference concerns data extraction: how can we …