Found 2208 bookmarks
Newest
Continuous AI
Continuous AI
GitHub Next have coined the term "Continuous AI" to describe "all uses of automated AI to support software collaboration on any platform". It's intended as an echo of Continuous Integration …
·simonwillison.net·
Continuous AI
EASIEST Way to Fine-Tune a LLM and Use It With Ollama
EASIEST Way to Fine-Tune a LLM and Use It With Ollama
Get started with 10Web and their AI Website Builder API: https://10web.io/website-builder-api/?utm_source=YouTube&utm_medium=Influencer&utm_campaign=TechWithTim Today, you'll learn how to fine-tune LLMs in Python for use in Ollama. I'll walk you through it step by step, give you all the code and show you how to test it out. DevLaunch is my mentorship program where I personally help developers go beyond tutorials, build real-world projects, and actually land jobs. No fluff. Just real accountability, proven strategies, and hands-on guidance. Learn more here - https://training.devlaunch.us/tim ⏳ Timestamps ⏳ 00:00 | What is Fine-Tuning? 02:25 | Gathering Data 05:52 | Google Collab Setup 09:17 | Fine-Tuning with Unsloth 16:58 | Model Setup in Ollama 🎞 Video Resources 🎞 Code in this video: https://drive.google.com/drive/folders/1p4ZilsJsdxB5lH6ZBMdIEJBt0WVUMsDq?usp=sharing Notebook Google Collab: https://colab.research.google.com/drive/1NsRGmHVupulRzsq9iUTx8V8WgTSpO_04?usp=sharing Hashtags #Python #Ollama #LLM
·youtube.com·
EASIEST Way to Fine-Tune a LLM and Use It With Ollama
QWEN-3: EASIEST WAY TO FINE-TUNE WITH REASONING 🙌
QWEN-3: EASIEST WAY TO FINE-TUNE WITH REASONING 🙌
Learn how to fine‑tune Qwen‑3‑14B on your own data—with LoRA adapters, Unsloth’s 4‑bit quantization, and just 12 GB of VRAM—while preserving its chain‑of‑thought reasoning. I’ll walk you through dataset prep, the key hyper‑parameters that prevent catastrophic forgetting, and the exact Colab notebook to get you running in minutes. Build a lightweight, reasoning‑ready Qwen‑3 model tailored to your project today! LINKS: https://qwenlm.github.io/blog/qwen3/ https://docs.unsloth.ai/basics/unsloth-dynamic-2.0-ggufs https://huggingface.co/datasets/unsloth/OpenMathReasoning-mini https://docs.unsloth.ai/basics/qwen3-how-to-run-and-fine-tune https://huggingface.co/datasets/mlabonne/FineTome-100k https://docs.unsloth.ai/get-started/fine-tuning-guide https://arxiv.org/html/2308.08747v5 https://heidloff.net/article/efficient-fine-tuning-lora/ NOTEBOOK: https://colab.research.google.com/github/unslothai/notebooks/blob/main/nb/Qwen3_(14B)-Reasoning-Conversational.ipynb Fine-tuning Playlist: https://www.youtube.com/playlist?list=PLVEEucA9MYhPjLFhcIoNxw8FkN28-ixAn Website: https://engineerprompt.ai/ RAG Beyond Basics Course: https://prompt-s-site.thinkific.com/courses/rag Let's Connect: 🦾 Discord: https://discord.com/invite/t4eYQRUcXB ☕ Buy me a Coffee: https://ko-fi.com/promptengineering |🔴 Patreon: https://www.patreon.com/PromptEngineering 💼Consulting: https://calendly.com/engineerprompt/consulting-call 📧 Business Contact: engineerprompt@gmail.com Become Member: http://tinyurl.com/y5h28s6h 💻 Pre-configured localGPT VM: https://bit.ly/localGPT (use Code: PromptEngineering for 50% off). Signup for Newsletter, localgpt: https://tally.so/r/3y9bb0 Fine-Tuning Qwen-3 Models: Step-by-Step Guide 00:00 Introduction to Fine-Tuning Qwen-3 01:24 Understanding Catastrophic Forgetting and LoRa Adapters 03:06 Installing and Using unsloth for Fine-Tuning 04:19 Code Walkthrough: Preparing Your Dataset 07:14 Combining Reasoning and Non-Reasoning Datasets 09:48 Prompt Templates and Fine-Tuning 16:13 Inference and Hyperparameter Settings 18:11 Saving and Loading LoRa Adapters
·youtube.com·
QWEN-3: EASIEST WAY TO FINE-TUNE WITH REASONING 🙌
The LLM's RL Revelation We Didn't See Coming
The LLM's RL Revelation We Didn't See Coming
Try out Warp 2.0 now, the current rank #1 AI on Terminal Bench, outperforming Claude Code: https://go.warp.dev/bycloud You can also use code "BYCLOUD" to get Warp Pro for 1 month free. (limited for 1,000 redemptions) My Newsletter https://mail.bycloud.ai/ my project: find, discover & explain AI research semantically https://findmypapers.ai/ My Patreon (get bundle access for my newsletter & findmypapers) https://www.patreon.com/c/bycloud Training language models to follow instructions with human feedback [Paper] https://arxiv.org/abs/2203.02155 DeepSeek-R1 (Aha Moment) [Paper] https://arxiv.org/abs/2501.12948 Understanding R1-Zero-Like Training: A Critical Perspective [Paper] https://arxiv.org/pdf/2503.20783 Does Reinforcement Learning Really Incentivize Reasoning Capacity in LLMs Beyond the Base Model? [Paper] https://arxiv.org/abs/2504.13837 Reinforcement Learning Finetunes Small Subnetworks in Large Language Models [Paper] https://arxiv.org/abs/2505.11711 Spurious Rewards: Rethinking Training Signals in RLVR [Paper] https://arxiv.org/abs/2506.10947 Try out my new fav place to learn how to code https://scrimba.com/?via=bycloudAI This video is supported by the kind Patrons & YouTube Members: 🙏Nous Research, Chris LeDoux, Ben Shaener, DX Research Group, Poof N' Inu, Andrew Lescelius, Deagan, Robert Zawiasa, Ryszard Warzocha, Tobe2d, Louis Muk, Akkusativ, Kevin Tai, Mark Buckler, NO U, Tony Jimenez, Ângelo Fonseca, jiye, Anushka, Asad Dhamani, Binnie Yiu, Calvin Yan, Clayton Ford, Diego Silva, Etrotta, Gonzalo Fidalgo, Handenon, Hector, Jake Disco very, Michael Brenner, Nilly K, OlegWock, Daddy Wen, Shuhong Chen, Sid_Cipher, Stefan Lorenz, Sup, tantan assawade, Thipok Tham, Thomas Di Martino, Thomas Lin, Richárd Nagyfi, Paperboy, mika, Leo, Berhane-Meskel, Kadhai Pesalam, mayssam, Bill Mangrum, nyaa, Toru Mon [Discord] https://discord.gg/NhJZGtH [Twitter] https://twitter.com/bycloudai [Patreon] https://www.patreon.com/bycloud [Business Inquiries] bycloud@smoothmedia.co [Profile & Banner Art] https://twitter.com/pygm7 [Video Editor] @Booga04 [Ko-fi] https://ko-fi.com/bycloudai
·youtube.com·
The LLM's RL Revelation We Didn't See Coming
Build and share AI-powered apps with Claude
Build and share AI-powered apps with Claude
Anthropic have added one of the most important missing features to Claude Artifacts: apps built as artifacts now have the ability to run their own prompts against Claude via a …
·simonwillison.net·
Build and share AI-powered apps with Claude
Nxtscape
Nxtscape
Nxtscape is a browser that is built for productivity and privacy.
·nxtscape.ai·
Nxtscape
How OpenElections Uses LLMs
How OpenElections Uses LLMs
The OpenElections project collects detailed election data for the USA, all the way down to the precinct level. This is a surprisingly hard problem: while county and state-level results are …
·simonwillison.net·
How OpenElections Uses LLMs