AI/ML

2274 bookmarks

Custom sorting

"I want Llama3 to perform 10x with my private knowledge" - Local Agentic RAG w/ llama3

Advanced RAG 101 - build agentic RAG with llama3Get free HubSpot report of how AI is redefining startup GTM strategy: https://clickhubspot.com/4hx🔗 Links- J...

#RAG #scraping #OCR

·youtube.com·Mar 24, 2025

"I want Llama3 to perform 10x with my private knowledge" - Local Agentic RAG w/ llama3

LlamaParse - LlamaIndex

#OCR #RAG

·docs.llamaindex.ai·Mar 24, 2025

LlamaParse - LlamaIndex

The Most Important Algorithm in Machine Learning

Shortform link: https://shortform.com/artem In this video we will talk about backpropagation – an algorithm powering the entire field of machine learning and try to derive it from first principles. OUTLINE: 00:00 Introduction 01:28 Historical background 02:50 Curve Fitting problem 06:26 Random vs guided adjustments 09:43 Derivatives 14:34 Gradient Descent 16:23 Higher dimensions 21:36 Chain Rule Intuition 27:01 Computational Graph and Autodiff 36:24 Summary 38:16 Shortform 39:20 Outro USEFUL RESOURCES: Andrej Karpathy's playlist: https://youtube.com/playlist?list=PLAqhIrjkxbuWI23v9cThsA9GvCAUhRvKZ&si=zBUZW5kufVPLVy9E Jürgen Schmidhuber's blog on the history of backprop: https://people.idsia.ch/~juergen/who-invented-backpropagation.html CREDITS: Icons by https://www.freepik.com/

#learn #model training #math #tutorial

·youtube.com·Mar 22, 2025

The Most Important Algorithm in Machine Learning

Artificial Intelligence | Higher Education from Cambridge

Discover Artificial Intelligence, 3rd Edition, David L. Poole, HB ISBN: 9781009258197 on Higher Education from Cambridge

#book

·cambridge.org·Mar 21, 2025

Artificial Intelligence | Higher Education from Cambridge

MCP Package Registry | Model Context Protocol

A CLI that helps you easily install and manage Model Context Protocol Servers. Simple package management with comprehensive analytics and GitHub integration.

#agent

·mcp-get.com·Mar 20, 2025

MCP Package Registry | Model Context Protocol

punkpeye/awesome-mcp-servers: A collection of MCP servers.

A collection of MCP servers. Contribute to punkpeye/awesome-mcp-servers development by creating an account on GitHub.

#agent

·github.com·Mar 20, 2025

punkpeye/awesome-mcp-servers: A collection of MCP servers.

Claude MCP has Changed AI Forever - Here's What You NEED to Know

Everyone is starting to realize how big of a deal Claude’s Model Context Protocol (MCP) is - it’s the first ever “standard” for connecting LLMs with services like your database, Slack, GitHub, web search, etc. It’s VERY powerful and not well understood by many, so in this video I break down everything you need to know about MCP at a high level. I go quick here unlike my usual videos, but I call out a bunch of different resources you can use to dive into anything deeper that you’re curious about - MCP architecture, building your own MCP server, integrating your custom AI agent with MCP, etc. ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ Check out Stagehand, an incredible tool to crawl and scrape websites with natural language which I used in this video: https://github.com/browserbase/stagehand And here is the Stagehand MCP server that I showcased (you will need a Browserbase API key which is free to start!): https://github.com/browserbase/mcp-server-browserbase/blob/main/stagehand/README.md ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ Documentation for Claude’s MCP: https://modelcontextprotocol.io/introduction List of MCP Servers on GitHub: https://github.com/modelcontextprotocol/servers Example n8n MCP Agent: https://github.com/coleam00/ottomator-agents/tree/main/n8n-mcp-agent n8n Community Node for MCP: https://github.com/nerding-io/n8n-nodes-mcp Example Pydantic AI MCP Agent: https://github.com/coleam00/ottomator-agents/tree/main/pydantic-ai-mcp-agent Dive deep into the architecture of MCP: https://modelcontextprotocol.io/docs/concepts/architecture ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 00:00 - MCP is Blowing Up 01:55 - What is MCP? 03:12 - Making MCP "Click" (Deep Dive with Diagrams) 05:33 - How Agents Work with MCP 07:16 - Word of Caution - What MCP Isn't 08:17 - Where You Can Use MCP 09:47 - MCP Servers You Can Use NOW 11:18 - How to Set Up MCP Servers 12:08 - Using MCP Servers in Claude Desktop 13:11 - MCP Demo in Claude Desktop (Brave + Stagehand) 14:09 - Building with MCP (Servers and Clients) 15:22 - Building Your Own MCP Server 18:09 - MCP with n8n AI Agents 20:10 - MCP with Python AI Agents 21:56 - The Future of MCP 23:51 - Outro ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ Join me as I push the limits of what is possible with AI. I'll be uploading videos at least two times a week - Sundays and Wednesdays at 7:00 PM CDT!

#agent #scraping #tutorial

·youtube.com·Mar 20, 2025

Claude MCP has Changed AI Forever - Here's What You NEED to Know

modelcontextprotocol/create-python-server: Create a Python MCP server

Create a Python MCP server. Contribute to modelcontextprotocol/create-python-server development by creating an account on GitHub.

#server #agent

·github.com·Mar 20, 2025

modelcontextprotocol/create-python-server: Create a Python MCP server

MCP server: A step-by-step guide to building from scratch

In this comprehensive guide, the author discusses MCP, it's components, and takes a deep dive on how to build servers from scratch.

#agent #server #architecture #systems #tutorial

·composio.dev·Mar 20, 2025

MCP server: A step-by-step guide to building from scratch

My Thoughts on the Future of “AI”

Nicholas Carlini, previously deeply skeptical about the utility of LLMs, discusses at length his thoughts on where the technology might go. He presents compelling, detailed arguments for both ends of …

#philosophy

·simonwillison.net·Mar 19, 2025

My Thoughts on the Future of “AI”

How DeepSeek Rewrote the Transformer [MLA]

Thanks to KiwiCo for sponsoring today’s video! Go to https://www.kiwico.com/welchlabs and use code WELCHLABS for 50% off your first monthly club crate or for 20% off your first Panda Crate! MLA/DeepSeek Poster at 17:12 (Free shipping for a limited time with code DEEPSEEK): https://www.welchlabs.com/resources/mladeepseek-attention-poster-13x19 Limited edition MLA Poster and Signed Book: https://www.welchlabs.com/resources/deepseek-bundle-mla-poster-and-signed-book-limited-run Imaginary Numbers book is back in stock! https://www.welchlabs.com/resources/imaginary-numbers-book Special Thanks to Patrons https://www.patreon.com/c/welchlabs Juan Benet, Ross Hanson, Yan Babitski, AJ Englehardt, Alvin Khaled, Eduardo Barraza, Hitoshi Yamauchi, Jaewon Jung, Mrgoodlight, Shinichi Hayashi, Sid Sarasvati, Dominic Beaumont, Shannon Prater, Ubiquity Ventures, Matias Forti, Brian Henry, Tim Palade, Petar Vecutin, Nicolas baumann, Jason Singh, Robert Riley, vornska, Barry Silverman, Jake Ehrlich References DeepSeek-V2 paper: https://arxiv.org/pdf/2405.04434 DeepSeek-R1 paper: https://arxiv.org/abs/2501.12948 Great Article by Ege Erdil: https://epoch.ai/gradient-updates/how-has-deepseek-improved-the-transformer-architecture GPT-2 Visualizaiton: https://github.com/TransformerLensOrg/TransformerLens Manim Animations: https://github.com/stephencwelch/manim_videos Technical Notes 1. Note that DeepSeek-V2 paper claims a KV cache size reduction of 93.3%. They don’t exactly publish their methodology, but as far as I can tell it’s something likes this: start with Deepseek-v2 hyperparameters here: https://huggingface.co/deepseek-ai/DeepSeek-V2/blob/main/configuration_deepseek.py. num_hidden_layers=30, num_attention_heads=32, v_head_dim = 128. If DeepSeek-v2 was implemented with traditional MHA, then KV cache size would be 2*32*128*30*2=491,520 B/token. With MLA with a KV cache size of 576, we get a total cache size of 576*30=34,560 B/token. The percent reduction in KV cache size is then equal to (491,520-34,560)/492,520=92.8%. The numbers I present in this video follow the same approach but are for DeepSeek-v3/R1 architecture: https://huggingface.co/deepseek-ai/DeepSeek-V3/blob/main/config.json. num_hidden_layers=61, num_attention_heads=128, v_head_dim = 128. So traditional MHA cache would be 2*128*128*61*2 = 3,997,696 B/token. MLA reduces this to 576*61*2=70,272 B/token. Tor the DeepSeek-V3/R1 architecture, MLA reduces the KV cache size by a factor of 3,997,696/70,272 =56.9X. 2. I claim a couple times that MLA allows DeepSeek to generate tokens more than 6x faster than a vanilla transformer. The DeepSeek-V2 paper claims a slightly less than 6x throughput improvement with MLA, but since the V3/R1 architecture is heavier, we expect a larger lift, which is why i claim “more than 6x faster than a vanilla transformer” - in reality it’s probably significantly more than 6x for the V3/R1 architecture. 3. In all attention patterns and walkthroughs, we’re ignoring the |beginning of sentence| token. “The American flag is red, white, and” actually maps to 10 tokens if we include this starting token, and may attention patterns do assign high values to this token. 4. We’re ignoring bias terms matrix equations. 5. We’re ignoring positional embeddings. These are fascinating. See DeepSeek papers and ROPE.

#tutorial

·youtube.com·Mar 19, 2025

How DeepSeek Rewrote the Transformer [MLA]

1.1 What is Artificial Intelligence? ‣ Chapter 1 Artificial Intelligence and Agents ‣ Artificial Intelligence: Foundations of Computational Agents, 3rd Edition

·artint.info·Mar 19, 2025

1.1 What is Artificial Intelligence? ‣ Chapter 1 Artificial Intelligence and Agents ‣ Artificial Intelligence: Foundations of Computational Agents, 3rd Edition

SmolDocling - The SmolOCR Solution?

In this video I look at SmolDocling and how it compares to the other OCR solutions that are out there, both open and proprietary. Blog: https://huggingface.c...

#OCR #vision #image

·youtube.com·Mar 18, 2025

SmolDocling - The SmolOCR Solution?

How to Build an In-N-Out Agent with OpenAI Agents SDK

In this video, I take a deeper dive look at the OpenAI Agents SDK and how it can be used to build a fast food agent. Colab: https://dripl.ink/MZw2R For more tutorials on using LLMs and building agents, check out my Patreon Patreon: https://www.patreon.com/SamWitteveen Twitter: https://x.com/Sam_Witteveen 🕵️ Interested in building LLM Agents? Fill out the form below Building LLM Agents Form: https://drp.li/dIMes 👨‍💻Github: https://github.com/samwit/llm-tutorials ⏱️Time Stamps: 00:00 Intro 00:11 Creating an In-N-Out Agent (Colab Demo) 00:40 In-N-Out Burger Agent 04:35 Streaming runs 05:40 Adding Tools 08:20 Websearch Tool 09:45 Agents as Tools 12:21 Giving it a Chat Memory

#agent #code #tutorial

·youtube.com·Mar 17, 2025

How to Build an In-N-Out Agent with OpenAI Agents SDK

Gemma 3: What You Need To Know - Gradient Flow

Gemma 3 represents Google’s approach to accessible AI, bridging the gap between cutting-edge research and practical application. While the Gemini family represents Google’s flagship, closed, and most powerful models, Gemma offers a lightweight, “open” counterpart designed for wider use and customization. Specifically, Gemma 3’s model weights are openly released, allowing developers to download, deploy, andContinue reading "Gemma 3: What You Need To Know"

#local model

·gradientflow.com·Mar 15, 2025

Gemma 3: What You Need To Know - Gradient Flow

Gemma 3 - The NEW Gemma Family Members Have Arrived!!!

In this video, I look at the release of the new Gemma 3 models, which come in four different flavors: a 1B, a 4B, a 12B, and the new Big 27B parameter model. Demo: https://huggingface.co/spaces/huggingface-projects/gemma-3-12b-it Blog: https://blog.google/technology/developers/gemma-3/?linkId=sam_witteveen Model Weights: https://huggingface.co/collections/google/gemma-3-release-67c6c6f89c4f76621268bb6d For more tutorials on using LLMs and building agents, check out my Patreon Patreon: https://www.patreon.com/SamWitteveen Twitter: https://x.com/Sam_Witteveen 🕵️ Interested in building LLM Agents? Fill out the form below Building LLM Agents Form: https://drp.li/dIMes 👨‍💻Github: https://github.com/samwit/llm-tutorials ⏱️Time Stamps:

#vision #OCR

·youtube.com·Mar 12, 2025

Gemma 3 - The NEW Gemma Family Members Have Arrived!!!

A Bear Case: My Predictions Regarding AI Progress — LessWrong

This isn't really a "timeline", as such – I don't know the timings – but this is my current, fairly optimistic take on where we're heading. …

·lesswrong.com·Mar 10, 2025

A Bear Case: My Predictions Regarding AI Progress — LessWrong

GetCyber - How to back up, downgrade, and restore Ollama on macOS without losing models or data

How to back up, downgrade, and restore Ollama on macOS without losing models or data

#local model #backup

·getcyber.me·Mar 9, 2025

GetCyber - How to back up, downgrade, and restore Ollama on macOS without losing models or data

DeepSeek-R1: Model Architecture

This article provides an in-depth exploration of the DeepSeek-R1 model architecture. Let’s trace DeepSeek-R1 model from input to the output…

#architecture

·shaktiwadekar.medium.com·Mar 7, 2025

DeepSeek-R1: Model Architecture

Mistral OCR - Multimodal & Multilingual OCR

In this video, I look at the latest release from Mistral AI, which is their Mistral OCR model. I look at how it works and how it compares to other models, as well as how you can get started using it with code. Colab: https://dripl.ink/Sr4Uk Blog: https://mistral.ai/news/mistral-ocr For more tutorials on using LLMs and building agents, check out my Patreon Patreon: https://www.patreon.com/SamWitteveen Twitter: https://x.com/Sam_Witteveen 🕵️ Interested in building LLM Agents? Fill out the form below Building LLM Agents Form: https://drp.li/dIMes 👨‍💻Github: https://github.com/samwit/llm-tutorials ⏱️Time Stamps: 00:00 Intro 00:17 Other models 00:35 Mistral OCR Blog 05:45 Mistral OCR Demo 13:47 Mistral OCR Batch inference

#OCR #vision

·youtube.com·Mar 7, 2025

Mistral OCR - Multimodal & Multilingual OCR

Can’t afford “Deep Research”? Me either. We don’t have to thanks to Ai2

I'm sure OpenAI's implementation of "deep research" is great, but I can't afford that. Ai2’s ScholarQA tool is FREE and open source!! Allen AI’s Scholar QA: https://scholarqa.allen.ai/ Please Like and Subscribe to support the channel! @LearnMetaAnalysis Access state of the art LLMs all in one place with ChatLLM – My 3 month review of ChatLLM: https://youtu.be/_Z3nLKvTbGc Tutorials and how-to guides: Connect a LLM to your Zotero (or any other local folder): https://youtu.be/b2BSZfOtD_w Conventional meta-analysis: https://www.youtube.com/playlist?list=PLXa5cTEormkEbYpBIgikgE0y9QR7QIgzs Three-level meta-analysis: https://www.youtube.com/playlist?list=PLXa5cTEormkHwRmu_TJXa7fSb6-WBXXoJ Three-level meta-analysis with correlated and hierarchical effects and robust variance estimation: https://www.youtube.com/playlist?list=PLXa5cTEormkEGenfcnp9X5dQUhmm7f9Jp Want free point and click (no coding required) meta-analysis software? Check out Simple Meta-Analysis: https://learnmeta-analysis.com/pages/simple-meta-analysis-software Tired of manually extracting data for systematic review and meta-analysis? Check out AI-Assisted Data Extraction, a free package for R! https://youtu.be/HuWXbe7hgFc Free ebook on meta-analysis in R (no download required): https://noah-schroeder.github.io/reviewbook/ Visit our website at https://learnmeta-analysis.com/ 0:00 OpenAI’s Deep Research 0:36 ScholarQA 1:26 First Test 11:49 Second Test 21:15 Debrief

#RAG #literature #knowledge base #deepresearch

·youtube.com·Mar 7, 2025

Can’t afford “Deep Research”? Me either. We don’t have to thanks to Ai2

SmolVLM2: Bringing Video Understanding to Every Device

We’re on a journey to advance and democratize artificial intelligence through open source and open science.

#vision #video

·huggingface.co·Mar 7, 2025

SmolVLM2: Bringing Video Understanding to Every Device

A Deepdive into Aya Vision: Advancing the Frontier of Multilingual Multimodality

We’re on a journey to advance and democratize artificial intelligence through open source and open science.

#vision

·huggingface.co·Mar 7, 2025

A Deepdive into Aya Vision: Advancing the Frontier of Multilingual Multimodality

PaliGemma 2 Mix - New Instruction Vision Language Models by Google

We’re on a journey to advance and democratize artificial intelligence through open source and open science.

#vision

·huggingface.co·Mar 7, 2025

PaliGemma 2 Mix - New Instruction Vision Language Models by Google

FastRTC: The Real-Time Communication Library for Python

We’re on a journey to advance and democratize artificial intelligence through open source and open science.

#voice #audio #python

·huggingface.co·Mar 7, 2025

FastRTC: The Real-Time Communication Library for Python

itsmostafa/inference-speed-tests: Local LLM inference speed tests on various devices

Local LLM inference speed tests on various devices - itsmostafa/inference-speed-tests

·github.com·Mar 7, 2025

itsmostafa/inference-speed-tests: Local LLM inference speed tests on various devices

Inference speed comparisons between M1 Pro and maxed-out M4 Max

I currently own a MacBook M1 Pro (32GB RAM, 16-core GPU) and now a maxed-out MacBook M4 Max (128GB RAM, 40-core GPU) and ran some inference speed...

·redditmedia.com·Mar 7, 2025

Inference speed comparisons between M1 Pro and maxed-out M4 Max

Hands on with Deep Research

Deep Research is the title of a new mode in several GenAI apps, including Google’s Gemini, OpenAI’s ChatGPT, and most recently, Perplexity. In this article, I will be focusing on the currently most hyped of these: OpenAI’s Deep Research. Although they weren’t first to release a product with this title (that was Google), they have […]

#search #deepresearch

·leonfurze.com·Mar 6, 2025

Hands on with Deep Research

Sumandora/remove-refusals-with-transformers: Implements harmful/harmless refusal removal using pure HF Transformers

Implements harmful/harmless refusal removal using pure HF Transformers - Sumandora/remove-refusals-with-transformers

#transformers #model training #fine tuning

·github.com·Mar 5, 2025

Sumandora/remove-refusals-with-transformers: Implements harmful/harmless refusal removal using pure HF Transformers

granite-snack-cookbook/recipes/RAG/Granite_Multimodal_RAG.ipynb at main · ibm-granite-community/granite-snack-cookbook

Granite Snack Cookbook -- easily consumable recipes (python notebooks) that showcase the capabilities of the Granite models - ibm-granite-community/granite-snack-cookbook

#RAG #vision

·github.com·Mar 5, 2025

granite-snack-cookbook/recipes/RAG/Granite_Multimodal_RAG.ipynb at main · ibm-granite-community/granite-snack-cookbook