Learn AI

559 bookmarks

Custom sorting

Applied LLMs - What We’ve Learned From A Year of Building with LLMs

A practical guide to building successful LLM products.

·applied-llms.org·Jun 7, 2024

Applied LLMs - What We’ve Learned From A Year of Building with LLMs

Reproducing GPT-2 (124M) in llm.c in 90 minutes for $20 · karpathy/llm.c · Discussion #481

Let's reproduce the GPT-2 (124M) in llm.c (~4,000 lines of C/CUDA) in 90 minutes for $20. The 124M model is the smallest model in the GPT-2 series released by OpenAI in 2019, and is actually quite ...

·github.com·Jun 7, 2024

Reproducing GPT-2 (124M) in llm.c in 90 minutes for $20 · karpathy/llm.c · Discussion #481

The Surprising Power of Next Word Prediction: Large Language Models Explained, Part 1 | Center for Security and Emerging Technology

Large language models (LLMs), the technology that powers generative artificial intelligence (AI) products like ChatGPT or Google Gemini, are often thought of as chatbots that predict the next word. But that isn't the full story of what LLMs are and how they work. This is the first blog post in a three-part series explaining some key elements of how LLMs function. This blog post covers pre-training—the process by which LLMs learn to predict the next word—and why it’s so surprisingly powerful.

·cset.georgetown.edu·Jun 7, 2024

The Surprising Power of Next Word Prediction: Large Language Models Explained, Part 1 | Center for Security and Emerging Technology

A Survey of Techniques for Maximizing LLM Performance

Join us for a comprehensive survey of techniques designed to unlock the full potential of Language Model Models (LLMs). Explore strategies such as fine-tunin...

·youtube.com·Jun 5, 2024

A Survey of Techniques for Maximizing LLM Performance

OpenAI Platform

Explore developer resources, tutorials, API docs, and dynamic examples to get the most out of OpenAI's platform.

·platform.openai.com·Jun 5, 2024

OpenAI Platform

neuml/txtai: 💡 All-in-one open-source embeddings database for semantic search, LLM orchestration and language model workflows

💡 All-in-one open-source embeddings database for semantic search, LLM orchestration and language model workflows - neuml/txtai

·github.com·Jun 5, 2024

neuml/txtai: 💡 All-in-one open-source embeddings database for semantic search, LLM orchestration and language model workflows

A High-level Overview of Large Language Models - Borealis AI

Gain valuable insights into essential topics such as LLM training, prompt engineering, concerns, applications, and more.

·borealisai.com·Jun 3, 2024

A High-level Overview of Large Language Models - Borealis AI

The Expanding Dark Forest and Generative AI

Proving you're a human on a web flooded with generative AI content

·maggieappleton.com·May 27, 2024

The Expanding Dark Forest and Generative AI

AI Engineering: From Agents to LLM OS (plus demos from AI Engineer Singapore meetup)

i gave a talk at the recent AI Eng Singapore meetup (https://www.latent.space/p/community, scroll down) about the past 1 year in agents thinking and the bui...

·youtube.com·May 25, 2024

AI Engineering: From Agents to LLM OS (plus demos from AI Engineer Singapore meetup)

metaskills/experts at labnotes.org

Experts.js is the easiest way to create and deploy OpenAI's Assistants and link them together as Tools to create advanced Multi AI Agent Systems with expanded memory and attention to detail. - ...

·github.com·May 25, 2024

metaskills/experts at labnotes.org

RAG - jxnl.co

Notes about my hobbies and other things I find interesting.

·jxnl.co·May 25, 2024

RAG - jxnl.co

How to build your own Perplexity for any dataset - Learnings from building “Ask Hacker Search”

How does something like Perplexity work, and how do we make our own? And having done that, what turned out to be the most interesting or challenging parts?

·jnnnthnn.com·May 24, 2024

How to build your own Perplexity for any dataset - Learnings from building “Ask Hacker Search”

karpathy/minbpe: Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.

Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization. - karpathy/minbpe

·github.com·May 24, 2024

karpathy/minbpe: Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.

naklecha/llama3-from-scratch: llama3 implementation one matrix multiplication at a time

llama3 implementation one matrix multiplication at a time - naklecha/llama3-from-scratch

·github.com·May 24, 2024

naklecha/llama3-from-scratch: llama3 implementation one matrix multiplication at a time

VRSEN/agency-swarm: An opensource agent orchestration framework built on top of the latest OpenAI Assistants API.

An opensource agent orchestration framework built on top of the latest OpenAI Assistants API. - VRSEN/agency-swarm

·github.com·May 24, 2024

VRSEN/agency-swarm: An opensource agent orchestration framework built on top of the latest OpenAI Assistants API.

Why I'm Staying Away from Crew AI: My Honest Opinion

Crew AI is not suitable for production use cases. I’ll be going through why I believe this is the case and what you should do instead when building your own ...

·youtube.com·May 22, 2024

Why I'm Staying Away from Crew AI: My Honest Opinion

LLMs vs. SLMs: The Differences in Large & Small Language Models | Splunk

Today’s language models are powering ChatGPT and other popular AIs. Learn the differences between LLMs and SLMs in this detailed article.

·splunk.com·May 22, 2024

LLMs vs. SLMs: The Differences in Large & Small Language Models | Splunk

Beginner's Guide to RAG

SingleStore Webinar

·aibyhand.substack.com·May 21, 2024

Beginner's Guide to RAG

Machine Unlearning in 2024

·ai.stanford.edu·May 18, 2024

Machine Unlearning in 2024

Practical Introduction to Natural Language Processing

Transform yourself from a Python Developer to a Junior NLP Data Scientist with practical projects.

·learnnlp.academy·May 15, 2024

Practical Introduction to Natural Language Processing

How to Build High-Accuracy Serverless RAG Using Amazon Bedrock and Kendra on AWS

Serverless RAG on AWS — Amazon Bedrock, Amazon Kendra, AWS Lambda, Claude-2, LangChain, and Streamlit.

AmazonKendraRetriever

·medium.com·May 15, 2024

How to Build High-Accuracy Serverless RAG Using Amazon Bedrock and Kendra on AWS

lyzr/lyzr/utils/chat_utils.py at a3325f58a086ef9610b9c08995c8bf86b777d408 · LyzrCore/lyzr

Lyzr SDKs help you to build all your favorite GenAI SaaS products as enterprise applications in minutes. - LyzrCore/lyzr

·github.com·May 15, 2024

lyzr/lyzr/utils/chat_utils.py at a3325f58a086ef9610b9c08995c8bf86b777d408 · LyzrCore/lyzr

Building RAG at 5 different levels

The unreasonable effectiveness of embeddingsOr how I learned to stop worrying and love the hallucinations.This week I dived deep into vector databases. My go...

·youtube.com·May 15, 2024

Building RAG at 5 different levels

Musings on Building a Generative AI Product

·linkedin.com·May 10, 2024

Musings on Building a Generative AI Product

When evaluating RAG, it's not just the final answer that is worth evaluating

Evaluating intermediate steps (like query rephrasing and retrieved documents) is also super important when trying to make sure RAG works Goo tutorial by on how to do this! — Harrison Chase (@hwchase17)

·x.com·May 10, 2024

When evaluating RAG, it's not just the final answer that is worth evaluating