Learn AI

559 bookmarks

Custom sorting

Generate synthetic data for evaluating RAG systems using Amazon Bedrock | Amazon Web Services

In this post, we explain how to use Anthropic Claude on Amazon Bedrock to generate synthetic data for evaluating your RAG system.

·aws.amazon.com·Oct 8, 2024

Generate synthetic data for evaluating RAG systems using Amazon Bedrock | Amazon Web Services

Best practices for building robust generative AI applications with Amazon Bedrock Agents – Part 1 | Amazon Web Services

In this post, we show you how to create accurate and reliable agents. Agents helps you accelerate generative AI application development by orchestrating multistep tasks. Agents use the reasoning capability of foundation models (FMs) to break down user-requested tasks into multiple steps.

·aws.amazon.com·Oct 8, 2024

Best practices for building robust generative AI applications with Amazon Bedrock Agents – Part 1 | Amazon Web Services

Achieve operational excellence with well-architected generative AI solutions using Amazon Bedrock | Amazon Web Services

In this post, we discuss scaling up generative AI for different lines of businesses (LOBs) and address the challenges that come around legal, compliance, operational complexities, data privacy and security.

·aws.amazon.com·Oct 8, 2024

Achieve operational excellence with well-architected generative AI solutions using Amazon Bedrock | Amazon Web Services

A helpful approach to navigating the SEO AI shift

Explore AI's impact on SEO strategies and content creation. Find out how to balance AI tools with human expertise for better search rankings.

·builder.io·Oct 4, 2024

A helpful approach to navigating the SEO AI shift

Don’t Build AI Products The Way Everyone Else Is Doing It

More and more, people are building products as thin wrappers over other models, especially LLMs. This can cause major issues, and there is a better way.

·builder.io·Oct 4, 2024

Don’t Build AI Products The Way Everyone Else Is Doing It

(10) Career Advice For A World After AI - YouTube

Enjoy the videos and music that you love, upload original content and share it all with friends, family and the world on YouTube.

·youtube.com·Oct 2, 2024

(10) Career Advice For A World After AI - YouTube

18 months of pgvector learnings in 47 minutes

📌 Github with all Code, Slides, Resources ⇒ https://tsdb.co/busy-dev-resources PostgreSQL is catching fire as the default database choice for AI applications. But how can you best leverage all that PostgreSQL has to offer for AI applications? What are best practices to follow, common pitfalls to avoid, and tools that will accelerate your development? Avthar Sewrathan, PM AI and Vector @ Timescale shares his learning from helping developers build AI applications with PostgreSQL over the past 18 months. Avthar covers the state of the union of developing AI applications in 2024, and why PostgreSQL can save you time and headaches now and in the future. 🛠 𝗥𝗲𝗹𝗲𝘃𝗮𝗻𝘁 𝗥𝗲𝘀𝗼𝘂𝗿𝗰𝗲𝘀 📌 Free Trial of Timescale ⇒ https://tsdb.co/busy-dev-signup 📌 pgai ⇒ https://tsdb.co/pgai 📌 pgvectorscale ⇒ https://tsdb.co/pgvectorscale 🐯 𝗔𝗯𝗼𝘂𝘁 𝗧𝗶𝗺𝗲𝘀𝗰𝗮𝗹𝗲 At Timescale, we see a world made better via innovative technologies, and we are dedicated to serving software developers and businesses worldwide, enabling them to build the next wave of computing. Timescale is a remote-first company with a global workforce backed by top-tier investors with a track record of success in the industry. 💻 𝗙𝗶𝗻𝗱 𝗨𝘀 𝗢𝗻𝗹𝗶𝗻𝗲! 🔍 Website ⇒ https://tsdb.co/homepage 🔍 Slack ⇒ https://slack.timescale.com 🔍 GitHub ⇒ https://github.com/timescale 🔍 Twitter ⇒ https://twitter.com/timescaledb 🔍 Twitch ⇒ https://www.twitch.tv/timescaledb 🔍 LinkedIn ⇒ https://www.linkedin.com/company/timescaledb 🔍 Timescale Blog ⇒ https://tsdb.co/blog 🔍 Timescale Documentation ⇒ https://tsdb.co/docs 📚 𝗖𝗵𝗮𝗽𝘁𝗲𝗿𝘀 00:00 What You’ll Learn Today 02:03 Foundations of AI with Postgres 03:58 Understanding Vector Data and Databases 06:25 Types of AI Applications with Postgres 08:53 Postgres Extensions for AI 14:25 Demo: Using pgVector and pgAI 21:42 Vector Search Indexes in Postgres 25:14 Vector Compression and Indexing Options 26:13 Introducing Streaming Disk Index 28:12 Creating Vector Search Indices in Postgres 30:22 Advanced Topics in AI Systems 32:14 Evaluation Driven Development 35:51 Filtered Vector Search 40:32 Hybrid Search Techniques 42:31 Multi-Tenancy in RAG Applications 44:41 Text to SQL in AI Applications 45:33 Conclusion and Resources

·m.youtube.com·Oct 1, 2024

18 months of pgvector learnings in 47 minutes

Production RAG with a Postgres Vector Store and Open-Source Models

Explore Retrieval Augmented Generation (RAG) with Postgres Vector Store for sophisticated search functionalities in Django applications, leveraging the power of open-source models.

·christophergs.com·Oct 1, 2024

Production RAG with a Postgres Vector Store and Open-Source Models

Some advice and good practices when integrating an LLM in your application

When integrating an LLM into your applicaton to extend it and make it smarter, it’s important to be aware of the pitfalls and best practices you need to follow to avoid some common problems and integrate them successfully. This article will guide you through some key best practices that I’ve come across. Understanding the Challenges of Implementing LLMs in Real-World Applications One of the first challenges is that LLMs are constantly being improved.

·glaforge.dev·Oct 1, 2024

Some advice and good practices when integrating an LLM in your application

The Unreasonable Effectiveness of Prompt "Engineering"

Check out the FREE non-technical guide for using AI in your business here: https://clickhubspot.com/fuu7 This video imma be yapping about why prompt engineering is unreasonable and everything that is a bit sus about it. The sequel of this video will probably be me roasting o1 (hopefully not) check out my newsletter https://mail.bycloud.ai/ Special thanks to LDJ for helping with this video OpenAI o1 [Announcement] https://openai.com/index/introducing-openai-o1-preview/ [Blogs] https://openai.com/o1/ Claude 3.5 Sonnet [Announcement] https://www.anthropic.com/news/claude-3-5-sonnet [AntThinking] https://gist.github.com/dedlim/6bf6d81f77c19e20cd40594aa09e3ecd Language Models (Mostly) Know What They Know [Paper] https://arxiv.org/abs/2207.05221 Plan-and-Solve Prompting [Paper] https://arxiv.org/abs/2305.04091 Llama-3.1 System Card [Documentation] https://www.llama.com/docs/model-cards-and-prompt-formats/meta-llama-3/ Think Before You Speak [Paper] https://arxiv.org/abs/2310.02226 Future Lens [Paper] https://arxiv.org/abs/2311.04897 This video is supported by the kind Patrons & YouTube Members: 🙏Andrew Lescelius, alex j, Chris LeDoux, Alex Maurice, Miguilim, Deagan, FiFaŁ, Robert Zawiasa, Owen Ingraham, Daddy Wen, Tony Jimenez, Panther Modern, Jake Disco, Demilson Quintao, Penumbraa, Shuhong Chen, Hongbo Men, happi nyuu nyaa, Carol Lo, Mose Sakashita, Miguel, Bandera, Gennaro Schiano, gunwoo, Ravid Freedman, Mert Seftali, Mrityunjay, Richárd Nagyfi, Timo Steiner, Henrik G Sundt, projectAnthony, Brigham Hall, Kyle Hudson, Kalila, Jef Come, Jvari Williams, Tien Tien, BIll Mangrum, owned, Janne Kytölä, SO, Richárd Nagyfi, Hector, Drexon, Claxvii 177th, Inferencer, Michael Brenner, Akkusativ, Oleg Wock, FantomBloth, Thipok Tham, Clayton Ford, Theo, Handenon, Diego Silva, mayssam [Discord] https://discord.gg/NhJZGtH [Twitter] https://twitter.com/bycloudai [Patreon] https://www.patreon.com/bycloud [Music 1] Spirit Blossom - RomanBelov [Music 2] Tie Me Down - PremiumMusicOdyssey [Music 3] Dimmed Lights - PremiumMusicOdyssey [Music 4] Wrapped in your Love - PremiumMusicOdyssey [Profile & Banner Art] https://twitter.com/pygm7 [Video Editor] @bhargavesque 0:00 Intro 4:21 The prooompting basics 6:36 How companies make prompting works 12:11 Does how you prompt REALLY matter?

·youtube.com·Oct 1, 2024

The Unreasonable Effectiveness of Prompt "Engineering"

AI should do chores, not the fun stuff

What’s the *right* use for AI? Laurie Voss thinks it’s great at doing boring chores, and in this episode we learn what that means and how we can put the robo...

·youtube.com·Sep 27, 2024

AI should do chores, not the fun stuff

How we built Townie – an app that generates fullstack apps

Like Claude Artifacts, but with a backend and database

·blog.val.town·Sep 27, 2024

How we built Townie – an app that generates fullstack apps

LLM University (LLMU)

Welcome to LLM University, your premier learning destination for mastering Enterprise AI technologies. Designed for developers and technical professionals, our hub offers comprehensive resources, expert-led courses, and step-by-step guides to help you start building quickly and stay ahead in the rapidly evolving AI landscape.

·cohere.com·Sep 27, 2024

LLM University (LLMU)

Getting Started with Retrieval-Augmented Generation

Part 1 of the LLM University module on Retrieval-Augmented Generation.

·cohere.com·Sep 26, 2024

Getting Started with Retrieval-Augmented Generation

ack-sec/toyberry: Toy implementation of Strawberry

Toy implementation of Strawberry . Contribute to ack-sec/toyberry development by creating an account on GitHub.

·github.com·Sep 26, 2024

ack-sec/toyberry: Toy implementation of Strawberry

codelion/optillm: Optimizing inference proxy for LLMs

Optimizing inference proxy for LLMs.

·github.com·Sep 26, 2024

codelion/optillm: Optimizing inference proxy for LLMs

cohere-ai/notebooks: Code examples and jupyter notebooks for the Cohere Platform

Code examples and jupyter notebooks for the Cohere Platform - cohere-ai/notebooks

·github.com·Sep 26, 2024

cohere-ai/notebooks: Code examples and jupyter notebooks for the Cohere Platform

Fine-tuning | How-to guides

Full parameter fine-tuning is a method that fine-tunes all the parameters of all the layers of the pre-trained model.

·llama.com·Sep 26, 2024

Fine-tuning | How-to guides

karpathy/llm.c: LLM training in simple, raw C/CUDA

LLM training in simple, raw C/CUDA.

·github.com·Sep 26, 2024

karpathy/llm.c: LLM training in simple, raw C/CUDA

Tutorials on Tinygrad

Tutorials on tinygrad

·mesozoic-egg.github.io·Sep 26, 2024

Tutorials on Tinygrad

(10) Flexible RAG: Development and evaluation strategies - YouTube

Enjoy the videos and music that you love, upload original content and share it all with friends, family and the world on YouTube.

·youtube.com·Sep 26, 2024

(10) Flexible RAG: Development and evaluation strategies - YouTube

anthropic-cookbook/skills/contextual-embeddings/guide.ipynb at main · anthropics/anthropic-cookbook

A collection of notebooks/recipes showcasing some fun and effective ways of using Claude. - anthropics/anthropic-cookbook

·github.com·Sep 22, 2024

anthropic-cookbook/skills/contextual-embeddings/guide.ipynb at main · anthropics/anthropic-cookbook

(8) The Best RAG Technique Yet? Anthropic’s Contextual Retrieval Explained! - YouTube

Enjoy the videos and music that you love, upload original content and share it all with friends, family and the world on YouTube.

·youtube.com·Sep 22, 2024

(8) The Best RAG Technique Yet? Anthropic’s Contextual Retrieval Explained! - YouTube

llm.c's Origin and the Future of LLM Compilers - Andrej Karpathy at CUDA MODE

An informal capture from the CUDA mode hackathon today.https://github.com/karpathy/llm.c

·youtube.com·Sep 22, 2024

llm.c's Origin and the Future of LLM Compilers - Andrej Karpathy at CUDA MODE

Notes on OpenAI’s new o1 chain-of-thought models

OpenAI released two major new preview models today: o1-preview and o1-mini (that mini one is not a preview)—previously rumored as having the codename “strawberry”. There’s a lot to understand about …

·simonwillison.net·Sep 21, 2024

Notes on OpenAI’s new o1 chain-of-thought models

Mihaiii/semantic-autocomplete: A blazing-fast semantic search React component. Match by meaning, not just by letters. Search as you type without waiting (no debounce needed). Rank by cosine similarity.

A blazing-fast semantic search React component. Match by meaning, not just by letters. Search as you type without waiting (no debounce needed). Rank by cosine similarity. - Mihaiii/semantic-autocom...

·github.com·Sep 19, 2024

OpenAI o1 is Better Than I Expected

🚀 https://neetcode.io/ - A better way to prepare for Coding Interviews🧑‍💼 LinkedIn: https://www.linkedin.com/in/navdeep-singh-3aaa14161/🐦 Twitter: https:...

·youtube.com·Sep 16, 2024

OpenAI o1 is Better Than I Expected

(7) Breaking Down Meta's Billion Dollar LLM Blueprint [Llama-3.1 Full Breakdown] - YouTube

Enjoy the videos and music that you love, upload original content and share it all with friends, family and the world on YouTube.

·youtube.com·Sep 16, 2024

(7) Breaking Down Meta's Billion Dollar LLM Blueprint [Llama-3.1 Full Breakdown] - YouTube

RAG in production

Practical RAG techniques for engineers: learn production-ready solutions from industry experts to optimize performance, cut costs, and enhance the accuracy and relevance of your applications.

·wandb.courses·Sep 14, 2024

RAG in production

Announcing Kong AI Gateway 3.8 With Semantic Caching and Security, 6 New LLM Load-Balancing Algorithms, and More LLMs

Kong AI Gateway 3.8 includes a new class of intelligent semantic plugins, new advanced load balancing capabilities for LLMs, and the official support for AWS Bedrock and GCP Vertex, in addition to all the other supported LLM providers.

·konghq.com·Sep 12, 2024

Announcing Kong AI Gateway 3.8 With Semantic Caching and Security, 6 New LLM Load-Balancing Algorithms, and More LLMs