GenAI

346 bookmarks

Newest

Transformers

Deep Dive

·e2eml.school·yesterday at 6:56 PM

Transformers

The State of Reinforcement Learning for LLM Reasoning

Understanding GRPO and New Insights from Reasoning Model Papers

Reinforcement Learning

·magazine.sebastianraschka.com·Jul 30, 2025

The State of Reinforcement Learning for LLM Reasoning

Six Principles for Production AI Agents

Practical lessons from building production agentic systems

Agents

·app.build·Jul 29, 2025

Six Principles for Production AI Agents

Why agent infrastructure matters

Learn why agent infrastructure is essential to handling stateful, long-running tasks — and how LangGraph Platform provides the runtime support needed to build and scale reliable agents.

Infrastructure

·blog.langchain.com·Jul 29, 2025

Why agent infrastructure matters

Context Engineering for AI Agents: Lessons from Building Manus

This post shares the local optima Manus arrived at through our own "SGD". If you're building your own AI agent, we hope these principles help you converge faster.

Context Engineering #AI Agents #Context Engineering

·manus.im·Jul 24, 2025

Context Engineering for AI Agents: Lessons from Building Manus

Understanding RAG vs Fine-Tuning

Discover the key differences between RAG and fine-tuning, what each approach can bring, and how to choose the right AI approach for your business goals.

Fine-Tuning #Fine-tuning #RAG

·cohere.com·Jul 24, 2025

Understanding RAG vs Fine-Tuning

Docs for AI agents

Agents #AI Agents

·technicalwriting.dev·Jul 24, 2025

Docs for AI agents

Urn:li:ugc post:7351284834956185600

Document Parsers #Document Parsing

·linkedin.com·Jul 22, 2025

Urn:li:ugc post:7351284834956185600

Reinforcement Learning (RL) Guide | Unsloth Documentation

Learn all about Reinforcement Learning (RL) and how to train your own DeepSeek-R1 reasoning model with Unsloth using GRPO. A complete guide from beginner to advanced.

Reinforcement Learning

·docs.unsloth.ai·Jul 22, 2025

Reinforcement Learning (RL) Guide | Unsloth Documentation

Advanced: Reinforcement Learning, Kernels, Reasoning, Quantization & Agents AIE 2025

➤ Check out our updated Reinforcement Learning guide!

Reinforcement Learning

·docs.google.com·Jul 22, 2025

Advanced: Reinforcement Learning, Kernels, Reasoning, Quantization & Agents AIE 2025

Utkarsh Kanwat - AI Engineer

AI Engineer at ANZ Bank working on intelligent systems, LLM optimization, and scalable ML platforms.

Agents #AI Agents #AI Engineering #LLM

·utkarshkanwat.com·Jul 22, 2025

Utkarsh Kanwat - AI Engineer

The Hitchhiker's Guide to Vector Search

A Qdrant Star shares her hardwon lessons from her extensive opensource building

Vector Search

·qdrant.tech·Jul 17, 2025

The Hitchhiker's Guide to Vector Search

a2a-community/a2a-ui

Contribute to a2a-community/a2a-ui development by creating an account on GitHub.

Agent2Agent

·github.com·Jul 16, 2025

a2a-community/a2a-ui

egor-baranov/a2a-ui: Repo migrated to a2a-community ogranization https://github.com/a2a-community/a2a-ui

Repo migrated to a2a-community ogranization https://github.com/a2a-community/a2a-ui - egor-baranov/a2a-ui

Agent2Agent

·github.com·Jul 16, 2025

egor-baranov/a2a-ui: Repo migrated to a2a-community ogranization https://github.com/a2a-community/a2a-ui

Turbocharging Customer Support Chatbot Development with LLM-Based Automated Evaluation

Key Contributors: Lily Sierra, Nour Alkhatib, Steven Gross, Jacquelene Obeid, Kyle Swint, Monta Shen, Gary Song, Riddhima Sejpal, Jatin…

Evaluation

·tech.instacart.com·Jul 15, 2025

Turbocharging Customer Support Chatbot Development with LLM-Based Automated Evaluation

GitHub - AdemBoukhris457/Docs_Parsing_Techniques: Jupyter notebooks testing different OCR models for document parsing (Dolphin, MonkeyOCR, Marker, Nanonets, ...)

Jupyter notebooks testing different OCR models for document parsing (Dolphin, MonkeyOCR, Marker, Nanonets, ...) - AdemBoukhris457/Docs_Parsing_Techniques

Document Parsers #Document Parsing #Sample-Code #OCR

·github.com·Jul 13, 2025

GitHub - AdemBoukhris457/Docs_Parsing_Techniques: Jupyter notebooks testing different OCR models for document parsing (Dolphin, MonkeyOCR, Marker, Nanonets, ...)

A2A vs ACP Protocol Comparison Analysis Report

A2A (Agent2Agent Protocol) and ACP (Agent Communication Protocol) represent two mainstream technical approaches in AI multi-agent system communication: 'cross-platform interoperability' and 'local/edge autonomy' respectively. A2A, with its powerful cross-vendor interconnection capabilities and rich task collaboration mechanisms, has become the preferred choice for cloud-based and distributed multi-agent scenarios; while ACP, with its low-latency, local-first, cloud-independent characteristics, is suitable for privacy-sensitive, bandwidth-constrained, or edge computing environments. Both protocols have their own focus in protocol design, ecosystem construction, and standardization governance, and are expected to further converge in openness in the future. Developers are advised to choose the most suitable protocol stack based on actual business needs.

Agent2Agent

·a2aprotocol.ai·Jul 11, 2025

A2A vs ACP Protocol Comparison Analysis Report

Anthropic Academy: Claude API Development Guide \ Anthropic

Learn to build applications with Claude's API. Find detailed documentation, integration guides, code examples, and best practices for developing with our AI capabilities.

Tutorial #Generative AI #GenAI #Claude #Anthropic

·anthropic.com·Jul 9, 2025

Anthropic Academy: Claude API Development Guide \ Anthropic

GitHub - TencentQQGYLab/AppAgent: AppAgent: Multimodal Agents as Smartphone Users, an LLM-based multimodal agent framework designed to operate smartphone apps.

AppAgent: Multimodal Agents as Smartphone Users, an LLM-based multimodal agent framework designed to operate smartphone apps. - TencentQQGYLab/AppAgent

Computer Use

·github.com·Jul 9, 2025

GitHub - TencentQQGYLab/AppAgent: AppAgent: Multimodal Agents as Smartphone Users, an LLM-based multimodal agent framework designed to operate smartphone apps.

Weights & Biases

Weights & Biases, developer tools for machine learning

Agent2Agent

·wandb.ai·Jul 7, 2025

Weights & Biases

LangGraph Rollout: Evolving VeRL’s Multi-Turn Capabilities for Agent RL

After completing our multi-turn tokenization and masking refactoring, we eliminated a critical bottleneck that was preventing us from building a more consistent and flexible rollout system for our Agent RL research. This breakthrough enabled us to implement a LangGraph-based rollout for VeRL in just a few days, which we’ve already successfully deployed in our Agent RL experiments. In this article, I’ll share our journey from VeRL’s native multi-turn implementation to our new LangGraph-based solution, explaining both the motivations driving this evolution and the technical details of our implementation.

Reinforcement Learning

·jybsuper.github.io·Jul 6, 2025

LangGraph Rollout: Evolving VeRL’s Multi-Turn Capabilities for Agent RL

Context Engineering - What it is, and techniques to consider — LlamaIndex - Build Knowledge Assistants over your Enterprise Data

LlamaIndex is a simple, flexible framework for building knowledge assistants using LLMs connected to your enterprise data.

Context Engineering #LlamaIndex #Prompt Engineering

·llamaindex.ai·Jul 6, 2025

Context Engineering - What it is, and techniques to consider — LlamaIndex - Build Knowledge Assistants over your Enterprise Data

Context Engineering Guide

Context Engineering Guide By DAIR.AI Academy Table of Contents What is Context Engineering? Context Engineering is Action System Prompt Instructions User Input Structured Inputs and Outputs Tool Calling RAG & Memory State & Historical Context Advanced Context Engineering Resources What is Co...

Context Engineering #Generative AI #Prompt Engineering

·docs.google.com·Jul 5, 2025

Context Engineering Guide

CS294/194-196 Large Language Model Agents

Fall 2024

Tutorial #GenAI #AI Agents #Tutorial #Academic

·rdi.berkeley.edu·Jul 4, 2025

CS294/194-196 Large Language Model Agents

37 Things I Learned About Information Retrieval in Two Years at a Vector Database Company – Leonie Monigatti

From BM25 to RAG: Everything I learned about vector databases, embedding models, and vector search - and everything in between.

Vector Search #Semantic Search #Embeddings

·leoniemonigatti.com·Jul 3, 2025

37 Things I Learned About Information Retrieval in Two Years at a Vector Database Company – Leonie Monigatti

Fine-tune ModernBERT for RAG with Synthetic Data

A Blog post by Sara Han Díaz on Hugging Face

Fine-Tuning #Contradiction Detection #Fine-tuning

·huggingface.co·Jul 3, 2025

Fine-tune ModernBERT for RAG with Synthetic Data

The New Skill in AI is Not Prompting, It's Context Engineering

Context Engineering is the new skill in AI. It is about providing the right information and tools, in the right format, at the right time.

Context Engineering

·philschmid.de·Jul 2, 2025

The New Skill in AI is Not Prompting, It's Context Engineering

Context Engineering

TL;DR Agents need context to perform tasks. Context engineering is the art and science of filling the context window with just the right information at each step of an agent’s trajectory. In this post, we break down some common strategies — write, select, compress, and isolate — for context engineering

Context Engineering #LangGraph

·blog.langchain.com·Jul 2, 2025

Context Engineering