Search AI/ML

Found 21 bookmarks

Custom sorting

From shiny object to sober reality: The vector database story, two years later

#RAG #database #search #embedding #architecture

·venturebeat.com·Nov 19, 2025

From shiny object to sober reality: The vector database story, two years later

Understanding Transformers Using A Minimal Example

Visualizing the internal state of a Transformer model

#transformers #tutorial #embedding

·rti.github.io·Sep 4, 2025

Understanding Transformers Using A Minimal Example

GitHub - apple/embedding-atlas: Embedding Atlas is a tool that provides interactive visualizations for large embeddings. It allows you to visualize, cross-filter, and search embeddings and metadata.

Embedding Atlas is a tool that provides interactive visualizations for large embeddings. It allows you to visualize, cross-filter, and search embeddings and metadata. - apple/embedding-atlas

#embedding #search #visualization #data science

·github.com·Aug 12, 2025

GitHub - apple/embedding-atlas: Embedding Atlas is a tool that provides interactive visualizations for large embeddings. It allows you to visualize, cross-filter, and search embeddings and metadata.

LLM Embeddings Explained: A Visual and Intuitive Guide - a Hugging Face Space by hesamation

This app explains how language models transform text into meaningful representations through embeddings. It provides a visual guide to help you understand traditional and modern language model tech...

#embedding #transformers #tutorial

·huggingface.co·Jul 30, 2025

LLM Embeddings Explained: A Visual and Intuitive Guide - a Hugging Face Space by hesamation

RetrievalTutorials/tutorials/LevelsOfTextSplitting/5_Levels_Of_Text_Splitting.ipynb at main · FullStackRetrieval-com/RetrievalTutorials

Contribute to FullStackRetrieval-com/RetrievalTutorials development by creating an account on GitHub.

#RAG #search #embedding

·github.com·Jul 9, 2025

RetrievalTutorials/tutorials/LevelsOfTextSplitting/5_Levels_Of_Text_Splitting.ipynb at main · FullStackRetrieval-com/RetrievalTutorials

Evaluating Chunking Strategies for Retrieval | Chroma Research

#RAG #embedding #search #benchmark

·research.trychroma.com·Jul 9, 2025

Evaluating Chunking Strategies for Retrieval | Chroma Research

Vector Search RAG Tutorial – Combine Your Data with LLMs with Advanced Search

Learn how to use vector search and embeddings to easily combine your data with large language models like GPT-4. You will first learn the concepts and then create three projects. ✏️ Course developed by Beau Carnes. 💻 Code: https://github.com/beaucarnes/vector-search-tutorial 🔗 Access MongoDB Atlas: https://cloud.mongodb.com/ 🏗️ MongoDB provided a grant to make this course possible. ⭐️ Contents ⭐️ ⌨️ (00:00) Introduction ⌨️ (01:18) What are vector embeddings? ⌨️ (02:39) What is vector search? ⌨️ (03:40) MongoDB Atlas vector search ⌨️ (04:30) Project 1: Semantic search for movie database ⌨️ (32:55) Project 2: RAG with Atlas Vector Search, LangChain, OpenAI ⌨️ (54:36) Project 3: Chatbot connected to your documentation 🎉 Thanks to our Champion and Sponsor supporters: 👾 davthecoder 👾 jedi-or-sith 👾 南宮千影 👾 Agustín Kussrow 👾 Nattira Maneerat 👾 Heather Wcislo 👾 Serhiy Kalinets 👾 Justin Hual 👾 Otis Morgan 👾 Oscar Rahnama -- Learn to code for free and get a developer job: https://www.freecodecamp.org Read hundreds of articles on programming: https://freecodecamp.org/news ❤️ Support for this channel comes from our friends at Scrimba – the coding platform that's reinvented interactive learning: https://scrimba.com/freecodecamp

#RAG #embedding #search #tutorial

·youtube.com·Jun 9, 2025

Vector Search RAG Tutorial – Combine Your Data with LLMs with Advanced Search

The Best Embedding Models for Information Retrieval in 2025 | DataStax

Learn how the latest and greatest embedding models stack up against each other, as well as against some open source competition.

#embedding #search #text

·datastax.com·Jun 9, 2025

The Best Embedding Models for Information Retrieval in 2025 | DataStax

Qwen3 Embedding: Advancing Text Embedding and Reranking Through Foundation Models

GITHUB HUGGING FACE MODELSCOPE DISCORD We release Qwen3 Embedding series, a new proprietary model of the Qwen model family. These models are specifically designed for text embedding, retrieval, and reranking tasks, built on the Qwen3 foundation model. Leveraging Qwen3’s robust multilingual text understanding capabilities, the series achieves state-of-the-art performance across multiple benchmarks for text embedding and reranking tasks. We have open-sourced this series of text embedding and reranking models under the Apache 2.

#embedding #text #search

·qwenlm.github.io·Jun 9, 2025

Qwen3 Embedding: Advancing Text Embedding and Reranking Through Foundation Models

Qwen3 Embedding

New family of embedding models from Qwen, in three sizes: 0.6B, 4B, 8B - and two categories: Text Embedding and Text Reranking. The full collection can be browsed on Hugging …

#embedding #transformers #text

·simonwillison.net·Jun 8, 2025

Qwen3 Embedding

An Intro to RAG with sqlite-vec & llamafile!

A brief introduction to using llamafile (a single-file tool for working with large language models) and sqlite-vec (A SQLite extension for vector search) to build a Retrival Augmentation Generation (RAG) application. This was a live online event hosted on Dec 17th 2024 in the Mozilla AI Discord, join us for the next event at at https://discord.gg/Ve7WeCJFXk LINKS: - Doc w/ links to all mentioned projects/blog posts: https://docs.google.com/document/d/17GYLzlGUyJF9EDeaa1P-dFFZnkwxATnBcg5KnNgpvPE/edit?usp=sharing - Slides: https://docs.google.com/presentation/d/14Szda-VnZzepL-1U9Nb7sXQg_TTf56OQ-KtUIMQ5xug/edit?usp=sharing

#RAG #database #search #embedding #local model

·youtube.com·Jun 8, 2025

An Intro to RAG with sqlite-vec & llamafile!

Olow304/memvid: Video-based AI memory library. Store millions of text chunks in MP4 files with lightning-fast semantic search. No database needed.

Video-based AI memory library. Store millions of text chunks in MP4 files with lightning-fast semantic search. No database needed. - Olow304/memvid

#video #embedding #search #local model

·github.com·Jun 7, 2025

Olow304/memvid: Video-based AI memory library. Store millions of text chunks in MP4 files with lightning-fast semantic search. No database needed.

Qwen 3 Embeddings & Rerankers

In this video I look at the new release from Qwen of their new Embedding and Reranking models which are start of the art and most importantly open weights mo...

#embedding #RAG

·youtube.com·Jun 6, 2025

Qwen 3 Embeddings & Rerankers

asg017/sqlite-vec: A vector search SQLite extension that runs anywhere!

A vector search SQLite extension that runs anywhere! - asg017/sqlite-vec

#database #RAG #embedding #search

·github.com·Jun 5, 2025

asg017/sqlite-vec: A vector search SQLite extension that runs anywhere!

How sqlite-vec Works for Storing and Querying Vector Embeddings

Learn how `sqlite-vec` turns SQLite into a fast, embedded vector search engine. With support for float32, int8, and bit vectors, optimized distance metrics, and native SQL integration, it's ideal for offline AI, semantic search, and lightweight ML apps. This post walks through how it works and why it's surprisingly powerful.

#RAG #tutorial #embedding

·dev.to·May 20, 2025

How sqlite-vec Works for Storing and Querying Vector Embeddings

Finding the Best Open-Source Embedding Model for RAG

Looking for the best open-source embedding model for your RAG app? We share a comparison workflow so you can stop paying the OpenAI tax.

#RAG #embedding #tutorial

·medium.com·May 20, 2025

Finding the Best Open-Source Embedding Model for RAG

How to improve the local LLM connected to Zotero for stunning results. So easy even I can do it.

Learn how to make simple changes that help your LLM chat with Zotero like a pro! I’m getting well written, well-cited results from a 2b parameter LLM. Please Like and Subscribe to support the channel! @LearnMetaAnalysis Embedding result testing: https://docs.google.com/spreadsheets/d/1P3rOLEO_NtCUYxaFIVaVZfMv4BOkQb3w/edit?usp=sharing&ouid=111617079417577058774&rtpof=true&sd=true Granite 3.1 Dense is my favorite LLM for this setup right now, it's available in 2b and 8b versions for ollama - https://ollama.com/library/granite3.1-dense:2b Snowflake Arctic Embed 2 has performed well for me so far as an embedding model: https://ollama.com/library/snowflake-arctic-embed2 MTEB leaderboard to see what embedding models perform well at different tasks: https://huggingface.co/spaces/mteb/leaderboard How to connect a LLM to Zotero - https://youtu.be/b2BSZfOtD_w I generally prefer local, private LLMs, but if you need large SOTA models like ChatGPT, Claude, Deepseek, Gemini, or Grok, check out ChatLLM - My 3 month review of ChatLLM: https://youtu.be/_Z3nLKvTbGc Tutorials and how-to guides: Conventional meta-analysis: https://www.youtube.com/playlist?list=PLXa5cTEormkEbYpBIgikgE0y9QR7QIgzs Three-level meta-analysis: https://www.youtube.com/playlist?list=PLXa5cTEormkHwRmu_TJXa7fSb6-WBXXoJ Three-level meta-analysis with correlated and hierarchical effects and robust variance estimation: https://www.youtube.com/playlist?list=PLXa5cTEormkEGenfcnp9X5dQUhmm7f9Jp Want free point and click (no coding required) meta-analysis software? Check out Simple Meta-Analysis: https://learnmeta-analysis.com/pages/simple-meta-analysis-software Tired of manually extracting data for systematic review and meta-analysis? Check out AI-Assisted Data Extraction, a free package for R! https://youtu.be/HuWXbe7hgFc Free ebook on meta-analysis in R (no download required): https://noah-schroeder.github.io/reviewbook/ Visit our website at https://learnmeta-analysis.com/ 0:15 Knowledge 0:59 Help make this better 1:32 Modify ‘knowledge’ settings 5:46 Demo of results 7:22 Top K 11:25 Testing Different embeddings 13:25 Use # not models 14:45 Impatient people (like me!) start here 21:38 Example Results

#RAG #local model #zotero #embedding

·youtube.com·Feb 27, 2025

How to improve the local LLM connected to Zotero for stunning results. So easy even I can do it.

Introducing Contextual Retrieval

Here's an interesting new embedding/RAG technique, described by Anthropic but it should work for any embedding model against any other LLM. One of the big challenges in implementing semantic search …

#RAG #embedding

·simonwillison.net·Sep 20, 2024

Introducing Contextual Retrieval

Finetuning Open-Source LLMs

This video offers a quick dive into the world of finetuning Large Language Models (LLMs). This video covers - common usage scenarios for pretrained LLMs- par...

#fine tuning #model training #embedding

·youtube.com·Jan 20, 2024

Finetuning Open-Source LLMs

Execute Jina embeddings with a CLI using llm-embed-jina

Berlin-based Jina AI just released a new family of embedding models, boasting that they are the “world’s first open-source 8K text embedding model” and that they rival OpenAI’s text-embedding-ada-002 in …

#embedding

·simonwillison.net·Oct 26, 2023

Execute Jina embeddings with a CLI using llm-embed-jina

GitHub - Dicklesworthstone/llama_embeddings_fastapi_service

Contribute to Dicklesworthstone/llama_embeddings_fastapi_service development by creating an account on GitHub.

#embedding

·github.com·Aug 17, 2023

GitHub - Dicklesworthstone/llama_embeddings_fastapi_service