AI/ML

AI/ML

2478 bookmarks
Custom sorting
Turn ANY Website into LLM Knowledge in SECONDS
Turn ANY Website into LLM Knowledge in SECONDS
One of the biggest challenges we face with LLMs is their knowledge is too general and limited for anything new. That’s why RAG is such a huge topic when it comes to AI right now - it’s a method for providing an LLM with external knowledge you curate so it can become an expert at something it wasn’t before - a specific AI framework, your ecommerce store, you name it. The problem is, that “curate” step can be very difficult and slow. That is where Crawl4AI comes in! Crawl4AI is an open source web crawling framework specifically designed for scraping websites and formatting the output in the BEST possible way for an LLM to understand. The best part is it solves a LOT of problems we typically have with systems that crawl websites - usually they are slow, resource intensive, and complicated. But Crawl4AI is VERY fast, intuitive, easy to set up, and extremely memory efficient. In this video, I show you how to use Crawl4AI to super easily crawl websites for LLMs in just seconds, and at the end I even show you a RAG AI agent I’ve built to be a “Pydantic AI” framework expert using Crawl4AI to build the knowledgebase. And you could really take this and use it for any website you want. Next video I'll do a deep dive into this agent! ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ Register now for the oTTomator AI Agent Hackathon with a $6,000 prize pool! https://studio.ottomator.ai/hackathon/register All code for this Crawl4AI RAG Agent can be found here: https://github.com/coleam00/ottomator-agents/tree/main/crawl4AI-agent Crawl4AI GitHub: https://github.com/unclecode/crawl4ai ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 00:00 - The Beauty of Crawl4AI 02:16 - Why Crawl4AI? 05:25 - Basic Crawl4AI Example - Single Page Crawl 06:56 - Crawling Multiple Pages 08:58 - Ethics of Web Scraping 10:01 - Crawling Multiple Pages Continued 12:24 - FAST Parallel Page Crawling 15:19 - Crawl4AI RAG AI Agent 17:48 - Outro ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ Join me as I push the limits of what is possible with AI. I'll be uploading videos at least two times a week - Sundays and Wednesdays at 7:00 PM CDT! Sundays and Wednesdays are for everything AI, focusing on providing insane and practical educational value. I will also post sometimes on Fridays at 7:00 PM CDT - specifically for platform showcases - sometimes sponsored, always creative in approach!
·youtube.com·
Turn ANY Website into LLM Knowledge in SECONDS
ChatGPT reveals the system prompt for ChatGPT Tasks
ChatGPT reveals the system prompt for ChatGPT Tasks
OpenAI just started rolling out [Scheduled tasks in ChatGPT](https://help.openai.com/en/articles/10291617-scheduled-tasks-in-chatgpt), a new feature where you can say things like "Remind me to write the tests in five minutes" and ChatGPT will …
·simonwillison.net·
ChatGPT reveals the system prompt for ChatGPT Tasks
Pydantic AI + DeepSeek V3 - The BEST AI Agent Combo
Pydantic AI + DeepSeek V3 - The BEST AI Agent Combo
I get asked a lot what my process looks like for building AI agents, so I recently kicked off a mini series showing my entire process! In this series, we’ll build AI agent that can consume entire GitHub repositories so you can ask it questions about all the code in the repo. In this video (3rd one in the series), I show you how to take an AI agent prototype built with n8n and turn it into a full custom coded agent EASILY with Pydantic AI. We’ll also use DeepSeek V3 for the LLM so it’s super powerful and still dirt cheap! Keep in mind that the n8n prototype is optional - this can very much be a standalone Pydantic AI guide. The best LLM or agent framework could change in a month. I keep this guide high level (while still covering technical details) so there is a lot to get out of this even if you aren't using Pydantic AI or DeepSeek V3 for your LLM. ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ Register now for the oTTomator AI Agent Hackathon with a $6,000 prize pool! https://studio.ottomator.ai/hackathon/register Try the n8n version of this GitHub agent now on the Live Agent Studio (Pydantic version coming soon): https://studio.ottomator.ai All code for this Pydantic GitHub agent can be found here: https://github.com/coleam00/ottomator-agents/tree/main/pydantic-github-agent And the n8n version of this agent: https://github.com/coleam00/ottomator-agents/tree/main/n8n-github-assistant Pydantic AI documentation: https://ai.pydantic.dev/ ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 00:00 - Intro 02:04 - Where We are in the AI Agent Roadmap 04:11 - n8n Prototype - Our Blueprint 06:46 - Live Agent Studio GitHub Agent 07:24 - Pydantic AI's Beautiful Docs 10:28 - Agent Code Overview 11:23 - Building our Pydantic AI Agent 21:23 - Building the Agent Chat Tooling 25:17 - Testing our Agent 28:23 - Outro ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ Join me as I push the limits of what is possible with AI. I'll be uploading videos at least two times a week - Sundays and Wednesdays at 7:00 PM CDT! Sundays and Wednesdays are for everything AI, focusing on providing insane and practical educational value. I will also post sometimes on Fridays at 7:00 PM CDT - specifically for platform showcases - sometimes sponsored, always creative in approach!
·youtube.com·
Pydantic AI + DeepSeek V3 - The BEST AI Agent Combo
LLM CLI Notes and Twitter Headless ScreenShot | John Maeda’s Blog
LLM CLI Notes and Twitter Headless ScreenShot | John Maeda’s Blog
To get going from: https://github.com/simonw/llm brew install llm llm keys set openai llm "count to three" # uses default of 3.5-turbo llm chat -m 4o llm models llm install llm-claude-3 llm keys set claude llm models llm -m claude-3-5-sonnet-latest "count to three" llm logs llm --system 'respond in json' "How…
·maeda.pm·
LLM CLI Notes and Twitter Headless ScreenShot | John Maeda’s Blog
Import AI 395: AI and energy demand; distributed training via DeMo; and Phi-4
Import AI 395: AI and energy demand; distributed training via DeMo; and Phi-4
Welcome to Import AI, a newsletter about AI research. Import AI runs on lattes, ramen, and feedback from readers. If you’d like to support this, please subscribe. Subscribe now AI is driving a mass…
·jack-clark.net·
Import AI 395: AI and energy demand; distributed training via DeMo; and Phi-4
𝗼𝟯 “𝗔𝗥𝗖 𝗔𝗚𝗜” 𝗽𝗼𝘀𝘁𝗺𝗼𝗿𝘁𝗲𝗺 𝗺𝗲𝗴𝗮𝘁𝗵𝗿𝗲𝗮𝗱: 𝘄𝗵𝘆 𝘁𝗵𝗶𝗻𝗴𝘀 𝗴𝗼𝘁 𝗵𝗲𝗮𝘁𝗲𝗱, 𝘄𝗵𝗮𝘁 𝘄𝗲𝗻𝘁 𝘄𝗿𝗼𝗻𝗴, 𝗮𝗻𝗱 𝘄𝗵𝗮𝘁 𝗶𝘁 𝗮𝗹𝗹 𝗺𝗲𝗮𝗻𝘀
𝗼𝟯 “𝗔𝗥𝗖 𝗔𝗚𝗜” 𝗽𝗼𝘀𝘁𝗺𝗼𝗿𝘁𝗲𝗺 𝗺𝗲𝗴𝗮𝘁𝗵𝗿𝗲𝗮𝗱: 𝘄𝗵𝘆 𝘁𝗵𝗶𝗻𝗴𝘀 𝗴𝗼𝘁 𝗵𝗲𝗮𝘁𝗲𝗱, 𝘄𝗵𝗮𝘁 𝘄𝗲𝗻𝘁 𝘄𝗿𝗼𝗻𝗴, 𝗮𝗻𝗱 𝘄𝗵𝗮𝘁 𝗶𝘁 𝗮𝗹𝗹 𝗺𝗲𝗮𝗻𝘀
Kevin Roose, of Hard Fork and NYT, was so impressed with OpenAI’s rollout that he joked “of course they have to announce AGI the day my vacation starts”.
·garymarcus.substack.com·
𝗼𝟯 “𝗔𝗥𝗖 𝗔𝗚𝗜” 𝗽𝗼𝘀𝘁𝗺𝗼𝗿𝘁𝗲𝗺 𝗺𝗲𝗴𝗮𝘁𝗵𝗿𝗲𝗮𝗱: 𝘄𝗵𝘆 𝘁𝗵𝗶𝗻𝗴𝘀 𝗴𝗼𝘁 𝗵𝗲𝗮𝘁𝗲𝗱, 𝘄𝗵𝗮𝘁 𝘄𝗲𝗻𝘁 𝘄𝗿𝗼𝗻𝗴, 𝗮𝗻𝗱 𝘄𝗵𝗮𝘁 𝗶𝘁 𝗮𝗹𝗹 𝗺𝗲𝗮𝗻𝘀
Things we learned about LLMs in 2024
Things we learned about LLMs in 2024
A lot has happened in the world of Large Language Models over the course of 2024. Here’s a review of things we figured out about the field in the past …
·simonwillison.net·
Things we learned about LLMs in 2024
4. The Ollama Course - Using the CLI
4. The Ollama Course - Using the CLI
Welcome back to the Ollama course! In this video, we dive deep into the command line interface (CLI) of Ollama, exploring all the powerful options and comman...
·youtube.com·
4. The Ollama Course - Using the CLI
Excerpt from a message I just posted in a #diaspora team internal f...
Excerpt from a message I just posted in a #diaspora team internal f...
Excerpt from a message I just posted in a #diaspora team internal forum category. The context here is that I recently get pinged by slowness/load spikes on the diaspora* project web infrastructure (Discourse, Wiki, the project website, ...), and looking at the traffic logs makes me impressively angry. In the last 60 days, the diaspora* web assets received 11.3 million requests. That equals to 2.19 req/s - which honestly isn't that much. I mean, it's more than your average personal blog, but nothing that my infrastructure shouldn't be able to handle. However, here's what's grinding my fucking gears. Looking at the top user agent statistics, there are the leaders: 2.78 million requests - or 24.6% of all traffic - is coming from Mozilla/5.0 AppleWebKit/537.36 (KHTML, like Gecko; compatible; GPTBot/1.2; +https://openai.com/gptbot). 1.69 million reuqests - 14.9% - Mozilla/5.0 (Macintosh; Intel Mac OS X 10_10_1) AppleWebKit/600.2.5 (KHTML, like Gecko) Version/8.0.2 Safari/600.2.5 (Amazonb...
·pod.geraspora.de·
Excerpt from a message I just posted in a #diaspora team internal f...
Project update + 2 significant news stories
Project update + 2 significant news stories
Trump threatens 100% tariffs on Brics nations over dollar currency rivalry; Severe flooding displaces over 122,000 in Malaysia
·newsletter.newsminimalist.com·
Project update + 2 significant news stories
About | News Minimalist
About | News Minimalist
News Minimalist is the AI curator that finds the 1% of stories actually worth reading. Experience only important news without junk, clickbait, or ads.
·newsminimalist.com·
About | News Minimalist
Brandon-c-tech/RAG-logger: RAG Logger is an open-source logging tool designed specifically for Retrieval-Augmented Generation (RAG) applications. It serves as a lightweight, open-source alternative to LangSmith, focusing on RAG-specific logging needs.
Brandon-c-tech/RAG-logger: RAG Logger is an open-source logging tool designed specifically for Retrieval-Augmented Generation (RAG) applications. It serves as a lightweight, open-source alternative to LangSmith, focusing on RAG-specific logging needs.
RAG Logger is an open-source logging tool designed specifically for Retrieval-Augmented Generation (RAG) applications. It serves as a lightweight, open-source alternative to LangSmith, focusing on ...
·github.com·
Brandon-c-tech/RAG-logger: RAG Logger is an open-source logging tool designed specifically for Retrieval-Augmented Generation (RAG) applications. It serves as a lightweight, open-source alternative to LangSmith, focusing on RAG-specific logging needs.