Transformers

GenAI
The State of Reinforcement Learning for LLM Reasoning
Understanding GRPO and New Insights from Reasoning Model Papers
Six Principles for Production AI Agents
Practical lessons from building production agentic systems
Why agent infrastructure matters
Learn why agent infrastructure is essential to handling stateful, long-running tasks — and how LangGraph Platform provides the runtime support needed to build and scale reliable agents.
Context Engineering for AI Agents: Lessons from Building Manus
This post shares the local optima Manus arrived at through our own "SGD". If you're building your own AI agent, we hope these principles help you converge faster.
Understanding RAG vs Fine-Tuning
Discover the key differences between RAG and fine-tuning, what each approach can bring, and how to choose the right AI approach for your business goals.
Docs for AI agents
Urn:li:ugc post:7351284834956185600
Reinforcement Learning (RL) Guide | Unsloth Documentation
Learn all about Reinforcement Learning (RL) and how to train your own DeepSeek-R1 reasoning model with Unsloth using GRPO. A complete guide from beginner to advanced.
Advanced: Reinforcement Learning, Kernels, Reasoning, Quantization & Agents AIE 2025
➤ Check out our updated Reinforcement Learning guide!
Utkarsh Kanwat - AI Engineer
AI Engineer at ANZ Bank working on intelligent systems, LLM optimization, and scalable ML platforms.
The Hitchhiker's Guide to Vector Search
A Qdrant Star shares her hardwon lessons from her extensive opensource building
a2a-community/a2a-ui
Contribute to a2a-community/a2a-ui development by creating an account on GitHub.
egor-baranov/a2a-ui: Repo migrated to a2a-community ogranization https://github.com/a2a-community/a2a-ui
Repo migrated to a2a-community ogranization https://github.com/a2a-community/a2a-ui - egor-baranov/a2a-ui
Turbocharging Customer Support Chatbot Development with LLM-Based Automated Evaluation
Key Contributors: Lily Sierra, Nour Alkhatib, Steven Gross, Jacquelene Obeid, Kyle Swint, Monta Shen, Gary Song, Riddhima Sejpal, Jatin…
GitHub - AdemBoukhris457/Docs_Parsing_Techniques: Jupyter notebooks testing different OCR models for document parsing (Dolphin, MonkeyOCR, Marker, Nanonets, ...)
Jupyter notebooks testing different OCR models for document parsing (Dolphin, MonkeyOCR, Marker, Nanonets, ...) - AdemBoukhris457/Docs_Parsing_Techniques
A2A vs ACP Protocol Comparison Analysis Report
A2A (Agent2Agent Protocol) and ACP (Agent Communication Protocol) represent two mainstream technical approaches in AI multi-agent system communication: 'cross-platform interoperability' and 'local/edge autonomy' respectively. A2A, with its powerful cross-vendor interconnection capabilities and rich task collaboration mechanisms, has become the preferred choice for cloud-based and distributed multi-agent scenarios; while ACP, with its low-latency, local-first, cloud-independent characteristics, is suitable for privacy-sensitive, bandwidth-constrained, or edge computing environments. Both protocols have their own focus in protocol design, ecosystem construction, and standardization governance, and are expected to further converge in openness in the future. Developers are advised to choose the most suitable protocol stack based on actual business needs.
Anthropic Academy: Claude API Development Guide \ Anthropic
Learn to build applications with Claude's API. Find detailed documentation, integration guides, code examples, and best practices for developing with our AI capabilities.
GitHub - TencentQQGYLab/AppAgent: AppAgent: Multimodal Agents as Smartphone Users, an LLM-based multimodal agent framework designed to operate smartphone apps.
AppAgent: Multimodal Agents as Smartphone Users, an LLM-based multimodal agent framework designed to operate smartphone apps. - TencentQQGYLab/AppAgent
Weights & Biases
Weights & Biases, developer tools for machine learning
LangGraph Rollout: Evolving VeRL’s Multi-Turn Capabilities for Agent RL
After completing our multi-turn tokenization and masking refactoring, we eliminated a critical bottleneck that was preventing us from building a more consistent and flexible rollout system for our Agent RL research. This breakthrough enabled us to implement a LangGraph-based rollout for VeRL in just a few days, which we’ve already successfully deployed in our Agent RL experiments. In this article, I’ll share our journey from VeRL’s native multi-turn implementation to our new LangGraph-based solution, explaining both the motivations driving this evolution and the technical details of our implementation.
Context Engineering - What it is, and techniques to consider — LlamaIndex - Build Knowledge Assistants over your Enterprise Data
LlamaIndex is a simple, flexible framework for building knowledge assistants using LLMs connected to your enterprise data.
Context Engineering Guide
Context Engineering Guide By DAIR.AI Academy Table of Contents What is Context Engineering? Context Engineering is Action System Prompt Instructions User Input Structured Inputs and Outputs Tool Calling RAG & Memory State & Historical Context Advanced Context Engineering Resources What is Co...
CS294/194-196 Large Language Model Agents
Fall 2024
37 Things I Learned About Information Retrieval in Two Years at a Vector Database Company – Leonie Monigatti
From BM25 to RAG: Everything I learned about vector databases, embedding models, and vector search - and everything in between.
Fine-tune ModernBERT for RAG with Synthetic Data
A Blog post by Sara Han Díaz on Hugging Face
The New Skill in AI is Not Prompting, It's Context Engineering
Context Engineering is the new skill in AI. It is about providing the right information and tools, in the right format, at the right time.
Context Engineering
TL;DR
Agents need context to perform tasks. Context engineering is the art and science of filling the context window with just the right information at each step of an agent’s trajectory. In this post, we break down some common strategies — write, select, compress, and isolate — for context engineering
Training and Finetuning Sparse Embedding Models with Sentence Transformers v5
We’re on a journey to advance and democratize artificial intelligence through open source and open science.
How Exa built a Web Research Multi-Agent System with LangGraph and LangSmith
See how Exa used LangGraph and LangSmith to build a multi-agent web research system to process research queries