GenAI

GenAI

416 bookmarks
Newest
Agent Engineering: A New Discipline
Agent Engineering: A New Discipline
If you’ve built an agent, you know that the delta between “it works on my machine” and “it works in production” can be huge. Traditional software assumes you mostly know the inputs and can define the outputs. Agents give you neither: users can say literally anything, and the space
·blog.langchain.com·
Agent Engineering: A New Discipline
Evaluating Deep Agents: Our Learnings
Evaluating Deep Agents: Our Learnings
Over the past month at LangChain, we shipped four applications on top of the Deep Agents harness: * DeepAgents CLI: a coding agent * LangSmith Assist: an in-app agent to help with various things in LangSmith * Personal Email Assistant: an email assistant that learns from interactions with each user * Agent Builder: a no-code agent building platform powered by meta deep agents Building and shipping these agents meant adding evals for each of them, and we learned a lot along the way! In this
·blog.langchain.com·
Evaluating Deep Agents: Our Learnings
Parloa's Bayesian Framework to A/B Test AI Agents
Parloa's Bayesian Framework to A/B Test AI Agents
Learn about our hierarchical Bayesian model for A/B testing AI agents. It combines deterministic binary metrics and LLM-judge scores into a single framework that accounts for variation across different groups
·parloa.com·
Parloa's Bayesian Framework to A/B Test AI Agents
Agents Should Be More Opinionated | vtrivedy
Agents Should Be More Opinionated | vtrivedy
The best agent products aren't the most flexible, they're the most opinionated. Learn why agents need fewer knobs, not more, and how to design around model intelligence spikes.
·vtrivedy.com·
Agents Should Be More Opinionated | vtrivedy
Deploy bidirectional streaming agents with Vertex AI Agent Engine and Live API - Build with AI / Agents - Google Developer forums
Deploy bidirectional streaming agents with Vertex AI Agent Engine and Live API - Build with AI / Agents - Google Developer forums
This blog has been co-authored by Hanfei Sun, Vertex AI Agent Engine, Software Engineer, and Huang Xia, Vertex AI Agent Engine, Software Engineer. TL;DR: Vertex AI Agent Engine now integrates with the Live API to enable real-time, bidirectional streaming agents. This allows for low-latency, human-like conversations using text and audio. This post demonstrates how to quickly build a streaming agent with the Agent Development Kit (ADK), leveraging a fully managed, serverless platform that hand...
·discuss.google.dev·
Deploy bidirectional streaming agents with Vertex AI Agent Engine and Live API - Build with AI / Agents - Google Developer forums
Design Patterns for Securing LLM Agents against Prompt Injections
Design Patterns for Securing LLM Agents against Prompt Injections
As AI agents powered by Large Language Models (LLMs) become increasingly versatile and capable of addressing a broad spectrum of tasks, ensuring their security has become a critical challenge. Among the most pressing threats are prompt injection attacks, which exploit the agent's resilience on natural language inputs -- an especially dangerous threat when agents are granted tool access or handle sensitive information. In this work, we propose a set of principled design patterns for building AI agents with provable resistance to prompt injection. We systematically analyze these patterns, discuss their trade-offs in terms of utility and security, and illustrate their real-world applicability through a series of case studies.
·arxiv.org·
Design Patterns for Securing LLM Agents against Prompt Injections