Apple Intelligence On Device LLM Details
Learn AI
The AI Backend
The AI Backend * work in progress, please provide feedback so we can improve Just like in 1995 it was obvious that every business needs an internet presence to stay competitive, in 2024 it's obvious that every software needs intelligence to stay competitive. Software products generally have 3 c...
Applied LLMs - What We’ve Learned From A Year of Building with LLMs
A practical guide to building successful LLM products.
Reproducing GPT-2 (124M) in llm.c in 90 minutes for $20 · karpathy/llm.c · Discussion #481
Let's reproduce the GPT-2 (124M) in llm.c (~4,000 lines of C/CUDA) in 90 minutes for $20. The 124M model is the smallest model in the GPT-2 series released by OpenAI in 2019, and is actually quite ...
The Surprising Power of Next Word Prediction: Large Language Models Explained, Part 1 | Center for Security and Emerging Technology
Large language models (LLMs), the technology that powers generative artificial intelligence (AI) products like ChatGPT or Google Gemini, are often thought of as chatbots that predict the next word. But that isn't the full story of what LLMs are and how they work. This is the first blog post in a three-part series explaining some key elements of how LLMs function. This blog post covers pre-training—the process by which LLMs learn to predict the next word—and why it’s so surprisingly powerful.
A Survey of Techniques for Maximizing LLM Performance
Join us for a comprehensive survey of techniques designed to unlock the full potential of Language Model Models (LLMs). Explore strategies such as fine-tunin...
OpenAI Platform
Explore developer resources, tutorials, API docs, and dynamic examples to get the most out of OpenAI's platform.
neuml/txtai: 💡 All-in-one open-source embeddings database for semantic search, LLM orchestration and language model workflows
💡 All-in-one open-source embeddings database for semantic search, LLM orchestration and language model workflows - neuml/txtai
A High-level Overview of Large Language Models - Borealis AI
Gain valuable insights into essential topics such as LLM training, prompt engineering, concerns, applications, and more.
The Expanding Dark Forest and Generative AI
Proving you're a human on a web flooded with generative AI content
AI Engineering: From Agents to LLM OS (plus demos from AI Engineer Singapore meetup)
i gave a talk at the recent AI Eng Singapore meetup (https://www.latent.space/p/community, scroll down) about the past 1 year in agents thinking and the bui...
metaskills/experts at labnotes.org
Experts.js is the easiest way to create and deploy OpenAI's Assistants and link them together as Tools to create advanced Multi AI Agent Systems with expanded memory and attention to detail. - ...
RAG - jxnl.co
Notes about my hobbies and other things I find interesting.
How to build your own Perplexity for any dataset - Learnings from building “Ask Hacker Search”
How does something like Perplexity work, and how do we make our own? And having done that, what turned out to be the most interesting or challenging parts?
karpathy/minbpe: Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.
Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization. - karpathy/minbpe
naklecha/llama3-from-scratch: llama3 implementation one matrix multiplication at a time
llama3 implementation one matrix multiplication at a time - naklecha/llama3-from-scratch
VRSEN/agency-swarm: An opensource agent orchestration framework built on top of the latest OpenAI Assistants API.
An opensource agent orchestration framework built on top of the latest OpenAI Assistants API. - VRSEN/agency-swarm
Why I'm Staying Away from Crew AI: My Honest Opinion
Crew AI is not suitable for production use cases. I’ll be going through why I believe this is the case and what you should do instead when building your own ...
LLMs vs. SLMs: The Differences in Large & Small Language Models | Splunk
Today’s language models are powering ChatGPT and other popular AIs. Learn the differences between LLMs and SLMs in this detailed article.
Beginner's Guide to RAG
SingleStore Webinar
Machine Unlearning in 2024
Practical Introduction to Natural Language Processing
Transform yourself from a Python Developer to a Junior NLP Data Scientist with practical projects.
How to Build High-Accuracy Serverless RAG Using Amazon Bedrock and Kendra on AWS
Serverless RAG on AWS — Amazon Bedrock, Amazon Kendra, AWS Lambda, Claude-2, LangChain, and Streamlit.
AmazonKendraRetriever
lyzr/lyzr/utils/chat_utils.py at a3325f58a086ef9610b9c08995c8bf86b777d408 · LyzrCore/lyzr
Lyzr SDKs help you to build all your favorite GenAI SaaS products as enterprise applications in minutes. - LyzrCore/lyzr
Building RAG at 5 different levels
The unreasonable effectiveness of embeddingsOr how I learned to stop worrying and love the hallucinations.This week I dived deep into vector databases. My go...
Musings on Building a Generative AI Product
When evaluating RAG, it's not just the final answer that is worth evaluating
Evaluating intermediate steps (like query rephrasing and retrieved documents) is also super important when trying to make sure RAG works
Goo tutorial by on how to do this!
— Harrison Chase (@hwchase17)
Building Agentic RAG with LlamaIndex - DeepLearning.AI
Learn how to build an agent that can reason over your documents and answer complex questions. Learn from the co-founder and CEO of LlamaIndex
Mervin Praison
Site Reliability Engineer
I'm writing a new vector search SQLite Extension
sqlite-vec is an new vector search SQLite extension, coming soon!
Blob<96>