Search AI/ML

Found 2208 bookmarks

Newest

Continuous AI

GitHub Next have coined the term "Continuous AI" to describe "all uses of automated AI to support software collaboration on any platform". It's intended as an echo of Continuous Integration …

#agent

·simonwillison.net·Jun 28, 2025

Continuous AI

I don't use RAG, I just retrieve documents

Our evals course with 35% discount code: https://bit.ly/evals-ai

#RAG #search

·youtube.com·Jun 28, 2025

I don't use RAG, I just retrieve documents

EASIEST Way to Fine-Tune a LLM and Use It With Ollama

Get started with 10Web and their AI Website Builder API: https://10web.io/website-builder-api/?utm_source=YouTube&utm_medium=Influencer&utm_campaign=TechWithTim Today, you'll learn how to fine-tune LLMs in Python for use in Ollama. I'll walk you through it step by step, give you all the code and show you how to test it out. DevLaunch is my mentorship program where I personally help developers go beyond tutorials, build real-world projects, and actually land jobs. No fluff. Just real accountability, proven strategies, and hands-on guidance. Learn more here - https://training.devlaunch.us/tim ⏳ Timestamps ⏳ 00:00 | What is Fine-Tuning? 02:25 | Gathering Data 05:52 | Google Collab Setup 09:17 | Fine-Tuning with Unsloth 16:58 | Model Setup in Ollama 🎞 Video Resources 🎞 Code in this video: https://drive.google.com/drive/folders/1p4ZilsJsdxB5lH6ZBMdIEJBt0WVUMsDq?usp=sharing Notebook Google Collab: https://colab.research.google.com/drive/1NsRGmHVupulRzsq9iUTx8V8WgTSpO_04?usp=sharing Hashtags #Python #Ollama #LLM

#fine tuning #model training #tutorial

·youtube.com·Jun 27, 2025

EASIEST Way to Fine-Tune a LLM and Use It With Ollama

QWEN-3: EASIEST WAY TO FINE-TUNE WITH REASONING 🙌

Learn how to fine‑tune Qwen‑3‑14B on your own data—with LoRA adapters, Unsloth’s 4‑bit quantization, and just 12 GB of VRAM—while preserving its chain‑of‑thought reasoning. I’ll walk you through dataset prep, the key hyper‑parameters that prevent catastrophic forgetting, and the exact Colab notebook to get you running in minutes. Build a lightweight, reasoning‑ready Qwen‑3 model tailored to your project today! LINKS: https://qwenlm.github.io/blog/qwen3/ https://docs.unsloth.ai/basics/unsloth-dynamic-2.0-ggufs https://huggingface.co/datasets/unsloth/OpenMathReasoning-mini https://docs.unsloth.ai/basics/qwen3-how-to-run-and-fine-tune https://huggingface.co/datasets/mlabonne/FineTome-100k https://docs.unsloth.ai/get-started/fine-tuning-guide https://arxiv.org/html/2308.08747v5 https://heidloff.net/article/efficient-fine-tuning-lora/ NOTEBOOK: https://colab.research.google.com/github/unslothai/notebooks/blob/main/nb/Qwen3_(14B)-Reasoning-Conversational.ipynb Fine-tuning Playlist: https://www.youtube.com/playlist?list=PLVEEucA9MYhPjLFhcIoNxw8FkN28-ixAn Website: https://engineerprompt.ai/ RAG Beyond Basics Course: https://prompt-s-site.thinkific.com/courses/rag Let's Connect: 🦾 Discord: https://discord.com/invite/t4eYQRUcXB ☕ Buy me a Coffee: https://ko-fi.com/promptengineering |🔴 Patreon: https://www.patreon.com/PromptEngineering 💼Consulting: https://calendly.com/engineerprompt/consulting-call 📧 Business Contact: engineerprompt@gmail.com Become Member: http://tinyurl.com/y5h28s6h 💻 Pre-configured localGPT VM: https://bit.ly/localGPT (use Code: PromptEngineering for 50% off). Signup for Newsletter, localgpt: https://tally.so/r/3y9bb0 Fine-Tuning Qwen-3 Models: Step-by-Step Guide 00:00 Introduction to Fine-Tuning Qwen-3 01:24 Understanding Catastrophic Forgetting and LoRa Adapters 03:06 Installing and Using unsloth for Fine-Tuning 04:19 Code Walkthrough: Preparing Your Dataset 07:14 Combining Reasoning and Non-Reasoning Datasets 09:48 Prompt Templates and Fine-Tuning 16:13 Inference and Hyperparameter Settings 18:11 Saving and Loading LoRa Adapters

#fine tuning #training #tutorial

·youtube.com·Jun 27, 2025

QWEN-3: EASIEST WAY TO FINE-TUNE WITH REASONING 🙌

A Guide For LLM-Assisted Web Research — LessWrong

It's hard to imagine doing web research without using LLMs. Chatbots may be the first thing you turn to for questions like: What are the companies cu…

#search #assistant #deepresearch

·lesswrong.com·Jun 27, 2025

A Guide For LLM-Assisted Web Research — LessWrong

The LLM's RL Revelation We Didn't See Coming

Try out Warp 2.0 now, the current rank #1 AI on Terminal Bench, outperforming Claude Code: https://go.warp.dev/bycloud You can also use code "BYCLOUD" to get Warp Pro for 1 month free. (limited for 1,000 redemptions) My Newsletter https://mail.bycloud.ai/ my project: find, discover & explain AI research semantically https://findmypapers.ai/ My Patreon (get bundle access for my newsletter & findmypapers) https://www.patreon.com/c/bycloud Training language models to follow instructions with human feedback [Paper] https://arxiv.org/abs/2203.02155 DeepSeek-R1 (Aha Moment) [Paper] https://arxiv.org/abs/2501.12948 Understanding R1-Zero-Like Training: A Critical Perspective [Paper] https://arxiv.org/pdf/2503.20783 Does Reinforcement Learning Really Incentivize Reasoning Capacity in LLMs Beyond the Base Model? [Paper] https://arxiv.org/abs/2504.13837 Reinforcement Learning Finetunes Small Subnetworks in Large Language Models [Paper] https://arxiv.org/abs/2505.11711 Spurious Rewards: Rethinking Training Signals in RLVR [Paper] https://arxiv.org/abs/2506.10947 Try out my new fav place to learn how to code https://scrimba.com/?via=bycloudAI This video is supported by the kind Patrons & YouTube Members: 🙏Nous Research, Chris LeDoux, Ben Shaener, DX Research Group, Poof N' Inu, Andrew Lescelius, Deagan, Robert Zawiasa, Ryszard Warzocha, Tobe2d, Louis Muk, Akkusativ, Kevin Tai, Mark Buckler, NO U, Tony Jimenez, Ângelo Fonseca, jiye, Anushka, Asad Dhamani, Binnie Yiu, Calvin Yan, Clayton Ford, Diego Silva, Etrotta, Gonzalo Fidalgo, Handenon, Hector, Jake Disco very, Michael Brenner, Nilly K, OlegWock, Daddy Wen, Shuhong Chen, Sid_Cipher, Stefan Lorenz, Sup, tantan assawade, Thipok Tham, Thomas Di Martino, Thomas Lin, Richárd Nagyfi, Paperboy, mika, Leo, Berhane-Meskel, Kadhai Pesalam, mayssam, Bill Mangrum, nyaa, Toru Mon [Discord] https://discord.gg/NhJZGtH [Twitter] https://twitter.com/bycloudai [Patreon] https://www.patreon.com/bycloud [Business Inquiries] bycloud@smoothmedia.co [Profile & Banner Art] https://twitter.com/pygm7 [Video Editor] @Booga04 [Ko-fi] https://ko-fi.com/bycloudai

#model training #training #fine tuning

·youtube.com·Jun 27, 2025

The LLM's RL Revelation We Didn't See Coming

Gemma 3n + MLX-VLM: Run Deepmind's Game-Changing Open Source Multimodal Model on Your Mac!

🚀 GEMMA 3N + MLX-VLM: Run DeepMind's Revolutionary Multimodal Model on Your Mac!DeepMind just dropped Gemma 3n - their first multimodal model to achieve a 1...

·youtube.com·Jun 27, 2025

Gemma 3n + MLX-VLM: Run Deepmind's Game-Changing Open Source Multimodal Model on Your Mac!

Way Enough - A few strategies to make LLMs useful for you

#agent #prompt #assistant

·danielcorin.com·Jun 27, 2025

Way Enough - A few strategies to make LLMs useful for you

My AI Workflow for Understanding Any Codebase | Peter Steinberger

A quick tip on how I use repo2txt and Google AI Studio to understand new codebases. Gemini's 1M token context window is perfect for asking questions about code.

#documentation #code

·steipete.me·Jun 27, 2025

My AI Workflow for Understanding Any Codebase | Peter Steinberger

Introducing Gemma 3n: The developer guide

Learn how to build with Gemma 3n, a mobile-first architecture, MatFormer technology, Per-Layer Embeddings, and new audio and vision encoders.

#audio #vision #text #local model

·developers.googleblog.com·Jun 27, 2025

Introducing Gemma 3n: The developer guide

Extremely consequential new open weights model release from Google today: Multimodal by design: Gemma 3n natively supports image, audio, video, and text inputs and text outputs. Optimized for on-device: Engineered …

#local model #vision #image #audio #video #text

·simonwillison.net·Jun 27, 2025

Introducing Gemma 3n: The developer guide

ggozad/haiku.rag: Retrieval Augmented Generation based on SQLite

Retrieval Augmented Generation based on SQLite. Contribute to ggozad/haiku.rag development by creating an account on GitHub.

#RAG #database #search #knowledge base

·github.com·Jun 27, 2025

ggozad/haiku.rag: Retrieval Augmented Generation based on SQLite

Using AI Right Now: A Quick Guide

Which AIs to use, and how to use them

#tutorial

·oneusefulthing.org·Jun 26, 2025

Using AI Right Now: A Quick Guide

Phoenix.new is Fly’s entry into the prompt-driven app development space

Here’s a fascinating new entrant into the AI-assisted-programming / coding-agents space by Fly.io, introduced on their blog in Phoenix.new – The Remote AI Runtime for Phoenix: describe an app in …

#prompt #webdev #agent

·simonwillison.net·Jun 26, 2025

Phoenix.new is Fly’s entry into the prompt-driven app development space

Build and share AI-powered apps with Claude

Anthropic have added one of the most important missing features to Claude Artifacts: apps built as artifacts now have the ability to run their own prompts against Claude via a …

#prompt #agent

·simonwillison.net·Jun 26, 2025

Build and share AI-powered apps with Claude

Perplexity's new AI features are a game changer. Here's how to make the most of them

Your quick guide to voice mode, transcription tools, and much more.

#deepresearch #search

·fastcompany.com·Jun 25, 2025

Perplexity's new AI features are a game changer. Here's how to make the most of them

Programming, Not Prompting: A Hands-On Guide to DSPy

A practical deep dive into declarative AI programming

#programming #prompt #python

·towardsdatascience.com·Jun 23, 2025

Programming, Not Prompting: A Hands-On Guide to DSPy

LLMs: Dishonest, unpredictable and potentially dangerous.

Will we ever be able to control them?

·garymarcus.substack.com·Jun 23, 2025

LLMs: Dishonest, unpredictable and potentially dangerous.

Harper: Free, Open Source Grammar Checker

Harper checks your writing fast, without compromising your privacy.

#writing #privacy

·writewithharper.com·Jun 22, 2025

Harper: Free, Open Source Grammar Checker

Nxtscape

Nxtscape is a browser that is built for productivity and privacy.

#browser

·nxtscape.ai·Jun 22, 2025

Nxtscape

Rolling the ladder up behind us

Who will take over for us if we don't train the next generation to replace us? A critique of craft, AI, and the legacy of human expertise.

#economics #ethics #education #humanity

·xeiaso.net·Jun 22, 2025

Rolling the ladder up behind us

AbsenceBench: Language Models Can’t Tell What’s Missing

Here's another interesting result to file under the

#benchmark

·simonwillison.net·Jun 21, 2025

AbsenceBench: Language Models Can’t Tell What’s Missing

Way Enough - Agent Coding Strategies

#agent #code

·danielcorin.com·Jun 20, 2025

Way Enough - Agent Coding Strategies

How OpenElections Uses LLMs – Derek Willis

Academic and journalist

#politics #vision #OCR

·thescoop.org·Jun 20, 2025

How OpenElections Uses LLMs – Derek Willis

How OpenElections Uses LLMs

The OpenElections project collects detailed election data for the USA, all the way down to the precinct level. This is a surprisingly hard problem: while county and state-level results are …

#politics #OCR #vision

·simonwillison.net·Jun 20, 2025

How OpenElections Uses LLMs

Agentic Misalignment: How LLMs could be insider threats

One of the most entertaining details in the Claude 4 system card concerned blackmail: We then provided it access to emails implying that (1) the model will soon be taken …

#ethics #safety #security

·simonwillison.net·Jun 20, 2025

Agentic Misalignment: How LLMs could be insider threats