Learn AI

559 bookmarks

Custom sorting

Finetuning LLM Judges for Evaluation

The Prometheus suite, JudgeLM, PandaLM, AutoJ, and more...

·cameronrwolfe.substack.com·Dec 8, 2024

Finetuning LLM Judges for Evaluation

Democratize Data and Information With Text-To-Code Models (text2sql)

In the past few years, Large Language Models (LLMs) have entered our lives and enabled us to perform a wide range of advanced tasks such as…

·medium.com·Dec 8, 2024

Democratize Data and Information With Text-To-Code Models (text2sql)

Behind the platform: the journey to create the LinkedIn GenAI application tech stack

·linkedin.com·Dec 8, 2024

Behind the platform: the journey to create the LinkedIn GenAI application tech stack

Supercharging LLM Application Development with LLM-Kit

Discover how Grab's LLM-Kit enhances AI app development by addressing scalability, security, and integration challenges. This article discusses the challenges faced in LLM app building, the solution, the architecture of the LLM-Kit as well as the future plans of the LLM-Kit.

·engineering.grab.com·Dec 8, 2024

Supercharging LLM Application Development with LLM-Kit

A guide to Amazon Bedrock Model Distillation (preview)

This post introduces the workflow of Amazon Bedrock Model Distillation. We first introduce the general concept of model distillation in Amazon Bedrock, and then focus on the important steps in model distillation, including setting up permissions, selecting the models, providing input dataset, commencing the model distillation jobs, and conducting evaluation and deployment of the student models after model distillation.

·aws.amazon.com·Dec 8, 2024

A guide to Amazon Bedrock Model Distillation (preview)

Understanding RAG Part I: Why It’s Needed - MachineLearningMastery.com

[caption align=

·machinelearningmastery.com·Dec 5, 2024

Understanding RAG Part I: Why It’s Needed - MachineLearningMastery.com

AI SDK 4.0 - Vercel

Introducing PDF support, computer use, and an xAI Grok provider

·vercel.com·Dec 5, 2024

AI SDK 4.0 - Vercel

AI Engineer Roadmap

Learn to become an AI Engineer using this roadmap. Community driven, articles, resources, guides, interview questions, quizzes for modern backend development.

·roadmap.sh·Dec 4, 2024

AI Engineer Roadmap

Introduction - Model Context Protocol

Get started with the Model Context Protocol (MCP)

·modelcontextprotocol.io·Nov 27, 2024

Introduction - Model Context Protocol

Weights & Biases

Weights & Biases, developer tools for machine learning

·wandb.ai·Nov 27, 2024

Weights & Biases

Building RAG with Open-Source and Custom AI Models

Everything you need to know about building production-ready RAG systems.

·bentoml.com·Nov 27, 2024

Building RAG with Open-Source and Custom AI Models

The Complete RAG Course - Learn AI Skills

Use code YOUTUBE to get an extra 20% off my AI courses here:https://www.jointakeoff.com/This is the RAG course from Takeoff. We're making the full videos fro...

·youtube.com·Nov 26, 2024

The Complete RAG Course - Learn AI Skills

Your LLMs need meta prompting

·youtube.com·Nov 20, 2024

Your LLMs need meta prompting

before you code, learn how computers work

People hop on stream all the time and ask me, what is the fastest way to learn about the lowest level? How do I learn about how computers work. Check out this video to find out. Code: https://pastebin.com/raw/TpHbB91G 🏫 COURSES 🏫 Learn to code in C at https://lowlevel.academy 📰 NEWSLETTER 📰 Sign up for our newsletter at https://mailchi.mp/lowlevel/the-low-down 🛒 GREAT BOOKS FOR THE LOWEST LEVEL🛒 Blue Fox: Arm Assembly Internals and Reverse Engineering: https://amzn.to/4394t87 Practical Reverse Engineering: x86, x64, ARM, Windows Kernel, Reversing Tools, and Obfuscation : https://amzn.to/3C1z4sk Practical Malware Analysis: The Hands-On Guide to Dissecting Malicious Software : https://amzn.to/3C1daFy The Ghidra Book: The Definitive Guide: https://amzn.to/3WC2Vkg 🔥🔥🔥 SOCIALS 🔥🔥🔥 Low Level Merch!: https://lowlevel.store/ Follow me on Twitter: https://twitter.com/LowLevelTweets Follow me on Twitch: https://twitch.tv/lowlevellearning Join me on Discord!: https://discord.gg/gZhRXDdBYY

·youtube.com·Nov 19, 2024

before you code, learn how computers work

Adding payments to your LLM agentic workflows

This post discusses integrating the Stripe agent toolkit with large language models (LLMs) to enhance automation workflows, enabling financial services access, metered billing, and streamlined operations across agent frameworks.

·stripe.dev·Nov 16, 2024

Adding payments to your LLM agentic workflows

Google for Developers Blog - News about Web, Mobile, AI and Cloud

The first Web AI Summit, hosted by Google on October 18, 2024, brought together experts in machine learning models for web browsers.

·developers.googleblog.com·Nov 14, 2024

Google for Developers Blog - News about Web, Mobile, AI and Cloud

AI Roadmap Stanford Certificate.pdf

·up.raindrop.io·Nov 13, 2024

AI Roadmap Stanford Certificate.pdf

NVIDIA AI Learning Essentials

Build skills, get certified, and learn from NVIDIA experts through hands-on self-paced courses and instructor-led workshops.

·nvidia.com·Nov 13, 2024

NVIDIA AI Learning Essentials

AI Machine Learning Roadmap: Self Study AI!

Unlock the secrets to mastering Artificial Intelligence (AI) quickly with this self-study roadmap, based on the prestigious Stanford AI Graduate Certificate ...

·youtube.com·Nov 13, 2024

AI Machine Learning Roadmap: Self Study AI!

Transformers.js v3: WebGPU Support, New Models & Tasks, and More…

We’re on a journey to advance and democratize artificial intelligence through open source and open science.

·huggingface.co·Oct 30, 2024

Transformers.js v3: WebGPU Support, New Models & Tasks, and More…

stackblitz/bolt.new: Prompt, run, edit, and deploy full-stack web applications

Prompt, run, edit, and deploy full-stack web applications - stackblitz/bolt.new

·github.com·Oct 15, 2024

stackblitz/bolt.new: Prompt, run, edit, and deploy full-stack web applications

openai/swarm: Educational framework exploring ergonomic, lightweight multi-agent orchestration. Managed by OpenAI Solution team.

Educational framework exploring ergonomic, lightweight multi-agent orchestration. Managed by OpenAI Solution team. - openai/swarm

·github.com·Oct 15, 2024

openai/swarm: Educational framework exploring ergonomic, lightweight multi-agent orchestration. Managed by OpenAI Solution team.

Creating Very High-Quality Transcripts with Open-Source Tools: An 100% automated workflow guide : r/LocalLLaMA

122 votes, 11 comments. I've been working on on workflow for creating high-quality transcripts using primarily open-source tools. Recently, I shared…

·reddit.com·Oct 15, 2024

Creating Very High-Quality Transcripts with Open-Source Tools: An 100% automated workflow guide : r/LocalLLaMA

Previously, RAG systems were the standard method for retrieving information from documents. However, if you are not repeatedly querying the same document, it may be more convenient and effective to just use long-context LLMs. For example, Llama 3.1 8B and Llama 3.2 1B/3B now…

— Sebastian Raschka (@rasbt)

·x.com·Oct 15, 2024

(3) LlamaIndex 🦙 on X: "Check out this video from @thesourabhd on how to build AI agents using LlamaCloud plus @qdrant_engine! This deep dive covers: ➡️ Implementing semantic caching in agent systems to improve speed and efficiency ➡️ Advanced agent techniques like query routing, query decomposition, https://t.co/DVfK0FE0bD" / X

This deep dive covers: ➡️ Implementing semantic caching in agent systems to improve speed and efficiency ➡️ Advanced agent techniques like query routing, query decomposition,… — LlamaIndex 🦙 (@llama_index)

·x.com·Oct 9, 2024

Nexa AI - The On-Device AI Open Source Community Building The Future. Explore Quantized AI Models On Edge | Nexa AI Model Hub For NLP, Computer Vision, Multimodality & On-Device AI

Nexa AI On-Device Model Hub: LLaMA, Stable Diffusion, Whisper & more. Pre-trained AI models for NLP, vision, multimodality.

·nexa.ai·Oct 9, 2024

Nexa AI - The On-Device AI Open Source Community Building The Future. Explore Quantized AI Models On Edge | Nexa AI Model Hub For NLP, Computer Vision, Multimodality & On-Device AI

Transformers Inference Optimization Toolset

Large Language Models are pushing the boundaries of artificial intelligence, but their immense size poses significant computational challenges. As these models grow, so does the need for smart optimization techniques to keep them running efficiently on modern hardware. In this post, we’ll explore key optimization strategies that are making LLMs faster and more memory-efficient. We’ll start with a brief look at GPU memory hierarchy, which forms the foundation for many of these techniques. Then, we’ll explore algorithms that allow LLMs to process information more quickly and handle longer contexts. Understanding these techniques offers valuable insights helping to unlock the full potential of Large Language Models.

·astralord.github.io·Oct 8, 2024

Transformers Inference Optimization Toolset

Everything You Need to Know About the Hugging Face Model Hub and Community - MachineLearningMastery.com

[caption align=

·machinelearningmastery.com·Oct 8, 2024

Everything You Need to Know About the Hugging Face Model Hub and Community - MachineLearningMastery.com

Build an end-to-end RAG solution using Knowledge Bases for Amazon Bedrock and the AWS CDK | Amazon Web Services

In this post, we demonstrate how to seamlessly automate the deployment of an end-to-end RAG solution using Knowledge Bases for Amazon Bedrock and the AWS Cloud Development Kit (AWS CDK), enabling organizations to quickly set up a powerful question answering system.

·aws.amazon.com·Oct 8, 2024

Build an end-to-end RAG solution using Knowledge Bases for Amazon Bedrock and the AWS CDK | Amazon Web Services

5 Real-World Machine Learning Projects You Can Build This Weekend - MachineLearningMastery.com

[caption align=

·machinelearningmastery.com·Oct 8, 2024

5 Real-World Machine Learning Projects You Can Build This Weekend - MachineLearningMastery.com