From shiny object to sober reality: The vector database story, two years later
Understanding Transformers Using A Minimal Example
Visualizing the internal state of a Transformer model
GitHub - apple/embedding-atlas: Embedding Atlas is a tool that provides interactive visualizations for large embeddings. It allows you to visualize, cross-filter, and search embeddings and metadata.
Embedding Atlas is a tool that provides interactive visualizations for large embeddings. It allows you to visualize, cross-filter, and search embeddings and metadata. - apple/embedding-atlas
LLM Embeddings Explained: A Visual and Intuitive Guide - a Hugging Face Space by hesamation
This app explains how language models transform text into meaningful representations through embeddings. It provides a visual guide to help you understand traditional and modern language model tech...
RetrievalTutorials/tutorials/LevelsOfTextSplitting/5_Levels_Of_Text_Splitting.ipynb at main · FullStackRetrieval-com/RetrievalTutorials
Contribute to FullStackRetrieval-com/RetrievalTutorials development by creating an account on GitHub.
Evaluating Chunking Strategies for Retrieval | Chroma Research
Vector Search RAG Tutorial – Combine Your Data with LLMs with Advanced Search
Learn how to use vector search and embeddings to easily combine your data with large language models like GPT-4. You will first learn the concepts and then create three projects.
✏️ Course developed by Beau Carnes.
💻 Code: https://github.com/beaucarnes/vector-search-tutorial
🔗 Access MongoDB Atlas: https://cloud.mongodb.com/
🏗️ MongoDB provided a grant to make this course possible.
⭐️ Contents ⭐️
⌨️ (00:00) Introduction
⌨️ (01:18) What are vector embeddings?
⌨️ (02:39) What is vector search?
⌨️ (03:40) MongoDB Atlas vector search
⌨️ (04:30) Project 1: Semantic search for movie database
⌨️ (32:55) Project 2: RAG with Atlas Vector Search, LangChain, OpenAI
⌨️ (54:36) Project 3: Chatbot connected to your documentation
🎉 Thanks to our Champion and Sponsor supporters:
👾 davthecoder
👾 jedi-or-sith
👾 南宮千影
👾 Agustín Kussrow
👾 Nattira Maneerat
👾 Heather Wcislo
👾 Serhiy Kalinets
👾 Justin Hual
👾 Otis Morgan
👾 Oscar Rahnama
--
Learn to code for free and get a developer job: https://www.freecodecamp.org
Read hundreds of articles on programming: https://freecodecamp.org/news
❤️ Support for this channel comes from our friends at Scrimba – the coding platform that's reinvented interactive learning: https://scrimba.com/freecodecamp
The Best Embedding Models for Information Retrieval in 2025 | DataStax
Learn how the latest and greatest embedding models stack up against each other, as well as against some open source competition.
Qwen3 Embedding: Advancing Text Embedding and Reranking Through Foundation Models
GITHUB HUGGING FACE MODELSCOPE DISCORD
We release Qwen3 Embedding series, a new proprietary model of the Qwen model family. These models are specifically designed for text embedding, retrieval, and reranking tasks, built on the Qwen3 foundation model. Leveraging Qwen3’s robust multilingual text understanding capabilities, the series achieves state-of-the-art performance across multiple benchmarks for text embedding and reranking tasks. We have open-sourced this series of text embedding and reranking models under the Apache 2.
Qwen3 Embedding
New family of embedding models from Qwen, in three sizes: 0.6B, 4B, 8B - and two categories: Text Embedding and Text Reranking. The full collection can be browsed on Hugging …
An Intro to RAG with sqlite-vec & llamafile!
A brief introduction to using llamafile (a single-file tool for working with large language models) and sqlite-vec (A SQLite extension for vector search) to build a Retrival Augmentation Generation (RAG) application.
This was a live online event hosted on Dec 17th 2024 in the Mozilla AI Discord, join us for the next event at at https://discord.gg/Ve7WeCJFXk
LINKS:
- Doc w/ links to all mentioned projects/blog posts: https://docs.google.com/document/d/17GYLzlGUyJF9EDeaa1P-dFFZnkwxATnBcg5KnNgpvPE/edit?usp=sharing
- Slides: https://docs.google.com/presentation/d/14Szda-VnZzepL-1U9Nb7sXQg_TTf56OQ-KtUIMQ5xug/edit?usp=sharing
Olow304/memvid: Video-based AI memory library. Store millions of text chunks in MP4 files with lightning-fast semantic search. No database needed.
Video-based AI memory library. Store millions of text chunks in MP4 files with lightning-fast semantic search. No database needed. - Olow304/memvid
Qwen 3 Embeddings & Rerankers
In this video I look at the new release from Qwen of their new Embedding and Reranking models which are start of the art and most importantly open weights mo...
asg017/sqlite-vec: A vector search SQLite extension that runs anywhere!
A vector search SQLite extension that runs anywhere! - asg017/sqlite-vec
How sqlite-vec Works for Storing and Querying Vector Embeddings
Learn how `sqlite-vec` turns SQLite into a fast, embedded vector search engine. With support for float32, int8, and bit vectors, optimized distance metrics, and native SQL integration, it's ideal for offline AI, semantic search, and lightweight ML apps. This post walks through how it works and why it's surprisingly powerful.
Finding the Best Open-Source Embedding Model for RAG
Looking for the best open-source embedding model for your RAG app? We share a comparison workflow so you can stop paying the OpenAI tax.
How to improve the local LLM connected to Zotero for stunning results. So easy even I can do it.
Learn how to make simple changes that help your LLM chat with Zotero like a pro! I’m getting well written, well-cited results from a 2b parameter LLM.
Please Like and Subscribe to support the channel! @LearnMetaAnalysis
Embedding result testing: https://docs.google.com/spreadsheets/d/1P3rOLEO_NtCUYxaFIVaVZfMv4BOkQb3w/edit?usp=sharing&ouid=111617079417577058774&rtpof=true&sd=true
Granite 3.1 Dense is my favorite LLM for this setup right now, it's available in 2b and 8b versions for ollama - https://ollama.com/library/granite3.1-dense:2b
Snowflake Arctic Embed 2 has performed well for me so far as an embedding model: https://ollama.com/library/snowflake-arctic-embed2
MTEB leaderboard to see what embedding models perform well at different tasks: https://huggingface.co/spaces/mteb/leaderboard
How to connect a LLM to Zotero - https://youtu.be/b2BSZfOtD_w
I generally prefer local, private LLMs, but if you need large SOTA models like ChatGPT, Claude, Deepseek, Gemini, or Grok, check out ChatLLM - My 3 month review of ChatLLM: https://youtu.be/_Z3nLKvTbGc
Tutorials and how-to guides:
Conventional meta-analysis: https://www.youtube.com/playlist?list=PLXa5cTEormkEbYpBIgikgE0y9QR7QIgzs
Three-level meta-analysis: https://www.youtube.com/playlist?list=PLXa5cTEormkHwRmu_TJXa7fSb6-WBXXoJ
Three-level meta-analysis with correlated and hierarchical effects and robust variance estimation: https://www.youtube.com/playlist?list=PLXa5cTEormkEGenfcnp9X5dQUhmm7f9Jp
Want free point and click (no coding required) meta-analysis software? Check out Simple Meta-Analysis: https://learnmeta-analysis.com/pages/simple-meta-analysis-software
Tired of manually extracting data for systematic review and meta-analysis? Check out AI-Assisted Data Extraction, a free package for R! https://youtu.be/HuWXbe7hgFc
Free ebook on meta-analysis in R (no download required): https://noah-schroeder.github.io/reviewbook/
Visit our website at https://learnmeta-analysis.com/
0:15 Knowledge
0:59 Help make this better
1:32 Modify ‘knowledge’ settings
5:46 Demo of results
7:22 Top K
11:25 Testing Different embeddings
13:25 Use # not models
14:45 Impatient people (like me!) start here
21:38 Example Results
Introducing Contextual Retrieval
Here's an interesting new embedding/RAG technique, described by Anthropic but it should work for any embedding model against any other LLM. One of the big challenges in implementing semantic search …
Finetuning Open-Source LLMs
This video offers a quick dive into the world of finetuning Large Language Models (LLMs). This video covers - common usage scenarios for pretrained LLMs- par...
Execute Jina embeddings with a CLI using llm-embed-jina
Berlin-based Jina AI just released a new family of embedding models, boasting that they are the “world’s first open-source 8K text embedding model” and that they rival OpenAI’s text-embedding-ada-002 in …
GitHub - Dicklesworthstone/llama_embeddings_fastapi_service
Contribute to Dicklesworthstone/llama_embeddings_fastapi_service development by creating an account on GitHub.