GitHub Next have coined the term "Continuous AI" to describe "all uses of automated AI to support software collaboration on any platform". It's intended as an echo of Continuous Integration …
EASIEST Way to Fine-Tune a LLM and Use It With Ollama
Get started with 10Web and their AI Website Builder API: https://10web.io/website-builder-api/?utm_source=YouTube&utm_medium=Influencer&utm_campaign=TechWithTim
Today, you'll learn how to fine-tune LLMs in Python for use in Ollama. I'll walk you through it step by step, give you all the code and show you how to test it out.
DevLaunch is my mentorship program where I personally help developers go beyond tutorials, build real-world projects, and actually land jobs. No fluff. Just real accountability, proven strategies, and hands-on guidance. Learn more here - https://training.devlaunch.us/tim
⏳ Timestamps ⏳
00:00 | What is Fine-Tuning?
02:25 | Gathering Data
05:52 | Google Collab Setup
09:17 | Fine-Tuning with Unsloth
16:58 | Model Setup in Ollama
🎞 Video Resources 🎞
Code in this video: https://drive.google.com/drive/folders/1p4ZilsJsdxB5lH6ZBMdIEJBt0WVUMsDq?usp=sharing
Notebook Google Collab: https://colab.research.google.com/drive/1NsRGmHVupulRzsq9iUTx8V8WgTSpO_04?usp=sharing
Hashtags
#Python #Ollama #LLM
Learn how to fine‑tune Qwen‑3‑14B on your own data—with LoRA adapters, Unsloth’s 4‑bit quantization, and just 12 GB of VRAM—while preserving its chain‑of‑thought reasoning. I’ll walk you through dataset prep, the key hyper‑parameters that prevent catastrophic forgetting, and the exact Colab notebook to get you running in minutes. Build a lightweight, reasoning‑ready Qwen‑3 model tailored to your project today!
LINKS:
https://qwenlm.github.io/blog/qwen3/
https://docs.unsloth.ai/basics/unsloth-dynamic-2.0-ggufs
https://huggingface.co/datasets/unsloth/OpenMathReasoning-mini
https://docs.unsloth.ai/basics/qwen3-how-to-run-and-fine-tune
https://huggingface.co/datasets/mlabonne/FineTome-100k
https://docs.unsloth.ai/get-started/fine-tuning-guide
https://arxiv.org/html/2308.08747v5
https://heidloff.net/article/efficient-fine-tuning-lora/
NOTEBOOK: https://colab.research.google.com/github/unslothai/notebooks/blob/main/nb/Qwen3_(14B)-Reasoning-Conversational.ipynb
Fine-tuning Playlist: https://www.youtube.com/playlist?list=PLVEEucA9MYhPjLFhcIoNxw8FkN28-ixAn
Website: https://engineerprompt.ai/
RAG Beyond Basics Course:
https://prompt-s-site.thinkific.com/courses/rag
Let's Connect:
🦾 Discord: https://discord.com/invite/t4eYQRUcXB
☕ Buy me a Coffee: https://ko-fi.com/promptengineering
|🔴 Patreon: https://www.patreon.com/PromptEngineering
💼Consulting: https://calendly.com/engineerprompt/consulting-call
📧 Business Contact: engineerprompt@gmail.com
Become Member: http://tinyurl.com/y5h28s6h
💻 Pre-configured localGPT VM: https://bit.ly/localGPT (use Code: PromptEngineering for 50% off).
Signup for Newsletter, localgpt:
https://tally.so/r/3y9bb0
Fine-Tuning Qwen-3 Models: Step-by-Step Guide
00:00 Introduction to Fine-Tuning Qwen-3
01:24 Understanding Catastrophic Forgetting and LoRa Adapters
03:06 Installing and Using unsloth for Fine-Tuning
04:19 Code Walkthrough: Preparing Your Dataset
07:14 Combining Reasoning and Non-Reasoning Datasets
09:48 Prompt Templates and Fine-Tuning
16:13 Inference and Hyperparameter Settings
18:11 Saving and Loading LoRa Adapters
Try out Warp 2.0 now, the current rank #1 AI on Terminal Bench, outperforming Claude Code: https://go.warp.dev/bycloud
You can also use code "BYCLOUD" to get Warp Pro for 1 month free. (limited for 1,000 redemptions)
My Newsletter
https://mail.bycloud.ai/
my project: find, discover & explain AI research semantically
https://findmypapers.ai/
My Patreon (get bundle access for my newsletter & findmypapers)
https://www.patreon.com/c/bycloud
Training language models to follow instructions with human feedback
[Paper] https://arxiv.org/abs/2203.02155
DeepSeek-R1 (Aha Moment)
[Paper] https://arxiv.org/abs/2501.12948
Understanding R1-Zero-Like Training: A Critical Perspective
[Paper] https://arxiv.org/pdf/2503.20783
Does Reinforcement Learning Really Incentivize Reasoning Capacity in LLMs Beyond the Base Model?
[Paper] https://arxiv.org/abs/2504.13837
Reinforcement Learning Finetunes Small Subnetworks in Large Language Models
[Paper] https://arxiv.org/abs/2505.11711
Spurious Rewards: Rethinking Training Signals in RLVR
[Paper] https://arxiv.org/abs/2506.10947
Try out my new fav place to learn how to code https://scrimba.com/?via=bycloudAI
This video is supported by the kind Patrons & YouTube Members:
🙏Nous Research, Chris LeDoux, Ben Shaener, DX Research Group, Poof N' Inu, Andrew Lescelius, Deagan, Robert Zawiasa, Ryszard Warzocha, Tobe2d, Louis Muk, Akkusativ, Kevin Tai, Mark Buckler, NO U, Tony Jimenez, Ângelo Fonseca, jiye, Anushka, Asad Dhamani, Binnie Yiu, Calvin Yan, Clayton Ford, Diego Silva, Etrotta, Gonzalo Fidalgo, Handenon, Hector, Jake Disco very, Michael Brenner, Nilly K, OlegWock, Daddy Wen, Shuhong Chen, Sid_Cipher, Stefan Lorenz, Sup, tantan assawade, Thipok Tham, Thomas Di Martino, Thomas Lin, Richárd Nagyfi, Paperboy, mika, Leo, Berhane-Meskel, Kadhai Pesalam, mayssam, Bill Mangrum, nyaa,
Toru Mon
[Discord] https://discord.gg/NhJZGtH
[Twitter] https://twitter.com/bycloudai
[Patreon] https://www.patreon.com/bycloud
[Business Inquiries] bycloud@smoothmedia.co
[Profile & Banner Art] https://twitter.com/pygm7
[Video Editor] @Booga04
[Ko-fi] https://ko-fi.com/bycloudai
Gemma 3n + MLX-VLM: Run Deepmind's Game-Changing Open Source Multimodal Model on Your Mac!
🚀 GEMMA 3N + MLX-VLM: Run DeepMind's Revolutionary Multimodal Model on Your Mac!DeepMind just dropped Gemma 3n - their first multimodal model to achieve a 1...
My AI Workflow for Understanding Any Codebase | Peter Steinberger
A quick tip on how I use repo2txt and Google AI Studio to understand new codebases. Gemini's 1M token context window is perfect for asking questions about code.
Extremely consequential new open weights model release from Google today: Multimodal by design: Gemma 3n natively supports image, audio, video, and text inputs and text outputs. Optimized for on-device: Engineered …
Phoenix.new is Fly’s entry into the prompt-driven app development space
Here’s a fascinating new entrant into the AI-assisted-programming / coding-agents space by Fly.io, introduced on their blog in Phoenix.new – The Remote AI Runtime for Phoenix: describe an app in …
Anthropic have added one of the most important missing features to Claude Artifacts: apps built as artifacts now have the ability to run their own prompts against Claude via a …
The OpenElections project collects detailed election data for the USA, all the way down to the precinct level. This is a surprisingly hard problem: while county and state-level results are …
Agentic Misalignment: How LLMs could be insider threats
One of the most entertaining details in the Claude 4 system card concerned blackmail: We then provided it access to emails implying that (1) the model will soon be taken …