# LazyBridge

> Zero-boilerplate multi-provider LLM composition runtime, not an agent framework. Engine + Tools + State, everything is a tool.

# Category guides

# Best Python AI Agent Frameworks in 2026

No Python AI agent framework is best for every project.

The right choice depends on the shape of the system you are building: a graph, a team of agents, a typed application, a RAG pipeline, an enterprise workflow, a multi-agent conversation, or a composition-first runtime that grows from a function call into a governed service.

This guide compares the main Python AI agent frameworks by their primary design metaphor.

Looking for a focused 1-to-1 decision?

This page is the **category hub** — a broad comparison across eight frameworks. For a deeper, decision-tree-style head-to-head of just three, see [LazyBridge vs LangGraph vs CrewAI](https://core.lazybridge.com/comparison/index.md).

## Quick comparison

| Framework                 | Primary metaphor   | Best for                                                                                                                                         | Avoid if                                                                            |
| ------------------------- | ------------------ | ------------------------------------------------------------------------------------------------------------------------------------------------ | ----------------------------------------------------------------------------------- |
| LangGraph                 | Graph-first        | Long-running, stateful graph workflows with persistence, human-in-the-loop and durable execution                                                 | You want the smallest possible mental model for simple workflows                    |
| CrewAI                    | Crew-first         | Role-based teams of agents, tasks, processes and business automation                                                                             | Your workflow does not map naturally to roles and tasks                             |
| Pydantic AI               | Type-first         | Type-safe Python agents, structured outputs, dependency injection and validation                                                                 | Orchestration is more important than typed application structure                    |
| LlamaIndex                | Data-first         | RAG-heavy agents, private data, indexes, query engines and retrieval workflows                                                                   | Your core problem is not retrieval or knowledge grounding                           |
| Haystack                  | Pipeline-first     | Production RAG and search-heavy pipelines                                                                                                        | You want a compact agent runtime rather than a broader retrieval pipeline framework |
| Microsoft Agent Framework | Enterprise-first   | Microsoft, Azure, Semantic Kernel and enterprise workflow environments                                                                           | You want a small independent Python-first runtime                                   |
| Google ADK                | Google-first       | Gemini and Google Cloud-oriented agent systems                                                                                                   | You want provider-neutral primitives from the start                                 |
| AutoGen                   | Conversation-first | Multi-agent conversations and collaborative agent interactions                                                                                   | You want deterministic orchestration and tool composition as the base layer         |
| LazyBridge                | Composition-first  | Governed Python LLM workflows where agents, functions, plans, humans, MCP servers and external systems should compose through one tool interface | You need the largest mature ecosystem and community today                           |

## How to choose

Choose **LangGraph** if your workflow is truly a graph: long-running state, branching, loops, human review, checkpointing and fine-grained runtime control.

Choose **CrewAI** if your workflow reads naturally as a team: researcher, analyst, writer, reviewer, manager.

Choose **Pydantic AI** if your main problem is reliable typed outputs, dependency injection, validation and IDE-friendly Python application structure.

Choose **LlamaIndex** if the agent mostly works over documents, indexes, retrieval pipelines or private knowledge.

Choose **Microsoft Agent Framework** if your environment is already centered on Microsoft, Azure, Semantic Kernel or enterprise workflow tooling.

Choose **LazyBridge** if your workflow is composition-heavy and you want one Python-first model where functions, agents, deterministic plans, humans, MCP servers and external tools share the same runtime surface.

## The core distinction: framework metaphor

Most AI agent frameworks organize systems around one dominant metaphor.

| Metaphor           | Framework examples        | What it optimizes                                                      |
| ------------------ | ------------------------- | ---------------------------------------------------------------------- |
| Graph-first        | LangGraph                 | explicit topology, durable execution, graph state                      |
| Crew-first         | CrewAI                    | agent roles, teams, tasks, business processes                          |
| Type-first         | Pydantic AI               | typed dependencies, validated outputs, IDE help                        |
| Data-first         | LlamaIndex                | retrieval, indexes, private knowledge                                  |
| Pipeline-first     | Haystack                  | search, RAG, production document pipelines                             |
| Conversation-first | AutoGen                   | multi-agent dialogue                                                   |
| Enterprise-first   | Microsoft Agent Framework | platform workflows, deployment, governance                             |
| Composition-first  | LazyBridge                | recursive composition across agents, tools, plans, humans and services |

LazyBridge belongs in the last category: **composition-first**.

## A different category: composition-first runtimes

Most frameworks start with agents, graphs, crews, chains or workflows.

LazyBridge starts with a smaller model:

```text
Agent = Engine + Tools + State
```

The **Engine** decides what happens next. The **Tools** define what the agent can do. The **State** carries continuity, memory, persistence, events and observability.

That split matters because LazyBridge does not treat an agent as “an LLM wrapper”.

An Agent can be driven by:

- an `LLMEngine`;
- a deterministic `Plan`;
- a `HumanEngine`;
- a `SupervisorEngine`;
- a custom `Engine` (the [engine protocol](https://core.lazybridge.com/guides/advanced/engine-protocol/index.md)).

This means the same `Agent` surface can represent a one-shot model call, a deterministic pipeline, a human approval step, a supervised workflow or a larger nested system.

See the [mental model](https://core.lazybridge.com/concepts/mental-model/index.md) for the full picture.

## LazyBridge in one sentence

LazyBridge is a composition-first Python runtime for governed LLM workflows: Core defines how work composes, LazyTools defines what agents can safely touch, and LazyPulse turns workflows into always-on policy-gated services.

## The LazyBridge ecosystem

LazyBridge is split into three focused packages.

| Package         | Role                   | What it adds                                                                                                                                |
| --------------- | ---------------------- | ------------------------------------------------------------------------------------------------------------------------------------------- |
| LazyBridge Core | Composition runtime    | `Agent`, `Engine`, `Tool`, `Envelope`, `Plan`, `Step`, `Store`, `Session`, `HumanEngine`, `SupervisorEngine`, guards, verification, tracing |
| LazyTools       | Capability layer       | Gmail, Telegram, MCP, external gateways, document readers, skills, allowlists, confirmation gates                                           |
| LazyPulse       | Governed service layer | tick loop, inbound adapters, trust policy, task lifecycle, human review and audit trail                                                     |

The split is intentional:

- Core does not need to import connector packages.
- Tools does not need to know about the scheduler.
- Pulse runs Agents and applies policy before external work reaches the model.

A compact way to understand the ecosystem:

```text
Core = how work composes
Tools = what the system can touch
Pulse = when work runs and under which policy
```

## Original design ideas in LazyBridge

### 1. Agents are not LLMs

In LazyBridge, an Agent is not synonymous with a model call.

The Agent is a runtime shell. The engine decides what kind of decision-making happens inside it.

That engine can be dynamic, deterministic, human-driven or supervised.

```python
from lazybridge import Agent, LLMEngine, Plan, Step

dynamic_agent = Agent(
    engine=LLMEngine("claude-sonnet-4-6"),
)

deterministic_pipeline = Agent(
    engine=Plan(Step("research"), Step("write")),
    tools=[research_agent, writer_agent],
    name="pipeline",
)
```

The outside surface is still an Agent.

### 2. Pipelines are agents

LazyBridge does not need a separate pipeline type.

A pipeline is an Agent whose engine is a `Plan`.

Because Agents are Tools, a pipeline can be nested inside a larger agent or a larger plan without glue code. See [layered composition](https://core.lazybridge.com/concepts/layered-composition/index.md).

```text
Plan-backed Agent → Tool → Step in a larger Plan
```

This makes composition structural rather than syntactic.

### 3. Tools are recursive

LazyBridge uses one recursive primitive: `Tool`.

A Tool can be:

- a Python function;
- a callable;
- another Agent;
- the same Agent under an alias;
- a Plan-backed Agent;
- a provider-native capability;
- an MCP server;
- a pre-built JSON schema;
- an external tool catalogue.

The consuming agent sees the same Tool contract in every case.

This is the practical meaning of [“everything is a tool”](https://core.lazybridge.com/concepts/everything-is-a-tool/index.md).

### 4. Envelope makes runs observable

Every agent call returns an [`Envelope`](https://core.lazybridge.com/guides/basic/envelope/index.md).

An Envelope carries:

- task;
- context;
- multimodal input;
- typed payload;
- metadata;
- error state;
- token counts;
- cost;
- latency;
- model;
- provider;
- run id;
- nested token and cost rollups.

The important part is transitive observability: when an agent calls another agent as a tool, that nested run does not disappear. Its cost, tokens, errors and latency roll up into the parent Envelope.

### 5. Plan separates orchestration from reasoning

A [`Plan`](https://core.lazybridge.com/guides/full/plan/index.md) is a deterministic engine.

It declares steps, data flow, routes, typed handoffs, parallel bands, store writes and checkpoint/resume behavior.

Use an LLM when the model should decide. Use a Plan when the system should decide.

This separation keeps high-level control flow out of the prompt when auditability, repeatability or cost predictability matter.

### 6. Verification is feedback, not only blocking

LazyBridge has hard guards and soft [verification](https://core.lazybridge.com/guides/mid/verify/index.md).

A guard blocks when a policy is violated.

`verify=` is different: it runs a judge-and-retry loop. If the judge rejects an output, the rejection reason becomes feedback for the next attempt, bounded by `max_verify`.

That makes verification useful for recoverable quality problems: drafts, summaries, customer-facing replies, regulated content and critical handoffs.

### 7. MCP is a tool boundary, not a new engine

LazyTools treats an MCP server as a tool catalogue.

The MCP connector expands a server into one Tool per advertised tool, namespaces names to avoid collisions, and respects allow/deny lists.

There is no separate `MCPEngine`.

Once an MCP server is in `tools=[...]`, the agent treats its tools like local Python functions: structured arguments, parallel calls, cost tracking and session events.

### 8. Governance happens before reasoning

LazyPulse assumes the LLM worker is not a security boundary.

A prompt-injected email can convince a model of almost anything. LazyPulse therefore authorizes inbound work before the worker runs, in code, using sender identity and action class rather than message text.

A `PulseAgent` is still a LazyBridge Agent, but with:

```text
tick loop + trust policy + inbound adapters
```

### 9. Dangerous actions need scoped one-shot grants

LazyTools includes `ConfirmationGate`.

Confirmations are not sticky booleans. A grant authorizes one action and is consumed on use. Grants can be target-bound and scope-bound, so an approval for one task cannot silently authorize another task under concurrency.

That design is especially important for tools that send, delete, pay, publish or execute.

## LazyBridge: best fit

LazyBridge is strongest when you need:

- one mental model from simple model call to nested multi-agent workflow;
- functions, agents, plans, humans and external systems to compose through one Tool interface;
- deterministic plans where control flow should not be delegated to the model;
- recursive agent-as-tool composition;
- typed handoffs and structured outputs;
- cross-model or sub-agent verification;
- MCP tool catalogues with deny-by-default exposure;
- human approval or supervision as an engine choice;
- external actions protected by allowlists and one-shot confirmation gates;
- always-on services driven by inboxes and webhooks;
- policy gates before the LLM sees untrusted tasks;
- open observability and cost rollups rather than hidden nested runs.

## LazyBridge: not the best fit

LazyBridge is not the right first choice when:

- the largest ecosystem and community support matter more than design simplicity;
- your team already depends deeply on LangGraph, LangSmith or LangChain;
- your workflow is a very large explicit graph where graph-native visualization and persistence are the primary needs;
- your product maps naturally to business roles, tasks and crews;
- type-safe application development is more important than orchestration;
- RAG and indexing dominate the architecture;
- you need a fully mature 1.0 API guarantee today;
- you need a hosted first-party observability platform today.

## Framework-by-framework comparison

### LangGraph

LangGraph is graph-first.

It is best when an agent system needs low-level control over a long-running, stateful workflow. It is especially strong when persistence, human-in-the-loop, streaming, durable execution and graph topology matter.

Use LangGraph when the workflow is naturally a graph.

Use LazyBridge when the workflow is composition-heavy but you do not want everything to become an explicit graph. In LazyBridge, deterministic orchestration is a `Plan` engine and recursive delegation happens through Tools.

→ Deep dive: [LazyBridge vs LangGraph vs CrewAI](https://core.lazybridge.com/comparison/index.md).

### CrewAI

CrewAI is crew-first.

It is best when the work maps naturally to roles, goals, tasks and processes. It is strong for business-style automation where “researcher”, “analyst”, “writer” and “reviewer” are useful design units.

Use CrewAI when the team metaphor is natural.

Use LazyBridge when the team metaphor adds unnecessary structure and you want lower-level Python primitives: Engine, Tools, State, Envelope and Plan.

→ Deep dive: [LazyBridge vs LangGraph vs CrewAI](https://core.lazybridge.com/comparison/index.md).

### Pydantic AI

Pydantic AI is type-first.

It is best when typed dependencies, structured outputs, validation, model-agnostic providers and IDE support are central to the application.

Use Pydantic AI when type-safe agent application development is the main problem.

Use LazyBridge when orchestration and recursive composition are the main problem: plans as engines, agents as tools, MCP at the tool boundary, Pulse services and transitive Envelope rollups.

### LlamaIndex

LlamaIndex is data-first.

It is best when agents operate over private documents, indexes, retrieval pipelines and structured knowledge.

Use LlamaIndex when the agent is primarily a RAG or knowledge workflow.

Use LazyBridge when retrieval is one capability inside a broader governed workflow involving agents, plans, humans, MCP tools and external systems.

### Microsoft Agent Framework

Microsoft Agent Framework is enterprise-first.

It is best when the surrounding environment is Microsoft, Azure, Semantic Kernel or enterprise deployment infrastructure.

Use Microsoft Agent Framework when platform integration is the deciding factor.

Use LazyBridge when provider-neutral Python composition is more important than enterprise platform alignment.

### AutoGen

AutoGen is conversation-first.

It is best when multi-agent conversation is the core structure.

Use AutoGen when the system is fundamentally a dialogue among specialized agents.

Use LazyBridge when agents should compose through explicit Tool and Plan contracts rather than primarily through conversation.

### Haystack

Haystack is pipeline-first.

It is strongest for production retrieval, search and RAG pipelines.

Use Haystack when your architecture is search-heavy and document-pipeline-heavy.

Use LazyBridge when LLM workflow composition is the center and retrieval is only one tool among many.

### Google ADK

Google ADK is Google-first.

It is best when building around Gemini, Google Cloud and Google’s agent development ecosystem.

Use Google ADK when Google platform alignment is the priority.

Use LazyBridge when provider-neutral composition and Python-first runtime boundaries are the priority.

## Feature comparison

| Feature                     | LangGraph                      | CrewAI                        | Pydantic AI               | LlamaIndex                  | LazyBridge                                    |
| --------------------------- | ------------------------------ | ----------------------------- | ------------------------- | --------------------------- | --------------------------------------------- |
| Primary metaphor            | Graph                          | Crew / task / process         | Typed agent               | Data / index / workflow     | Composition runtime                           |
| Core unit                   | Node / edge / state            | Agent / task / crew           | Agent / deps / output     | Index / query / agent       | Agent = Engine + Tools + State                |
| Deterministic orchestration | Graph control                  | Process / flow                | Python / graph support    | Workflows                   | Plan as engine                                |
| Recursive sub-agents        | Subgraphs / nodes              | Hierarchical crews            | Multi-agent patterns      | Agent workflows             | Agents as Tools                               |
| Tool model                  | LangChain tools / integrations | Tools attached to agents      | Function tools / toolsets | Tools / query engines       | Functions, agents, plans, MCP, schemas        |
| Typed outputs               | Supported                      | Supported                     | Core strength             | Supported                   | Pydantic output in Envelope                   |
| Human-in-the-loop           | Strong                         | Supported                     | Tool approval / workflows | Possible                    | HumanEngine / SupervisorEngine / Pulse review |
| MCP                         | Ecosystem-dependent            | Supported via tools/ecosystem | Supported                 | Supported                   | Tool boundary via LazyTools                   |
| Policy before worker        | Application-specific           | Enterprise/cloud patterns     | Application-specific      | Application-specific        | LazyPulse PulsePolicy                         |
| Confirmation gate           | Application-specific           | Guardrails / HITL             | Tool approval             | Application-specific        | One-shot target/scope-bound grants            |
| Observability               | LangSmith                      | CrewAI observability          | Logfire / OTel            | Callbacks / instrumentation | Session, Envelope, OTel exporters             |
| Ecosystem maturity          | Very high                      | High                          | Growing fast              | High                        | Early                                         |
| Best fit                    | complex stateful graph agents  | role-based agent teams        | type-safe agent apps      | RAG/data agents             | governed composable LLM workflows             |

## Same workflow: research → summarize → write

A simple workflow exposes the difference between frameworks.

| Framework   | How you model it                                | Concepts introduced                   |
| ----------- | ----------------------------------------------- | ------------------------------------- |
| LangGraph   | nodes and edges in a state graph                | graph, state, node, edge, compile     |
| CrewAI      | agents with roles and tasks in a process        | agent, role, task, crew, process      |
| Pydantic AI | typed agents with outputs and dependencies      | agent, deps, output model, tools      |
| LlamaIndex  | retrieval/query workflow with synthesis         | index, retriever, query engine, agent |
| LazyBridge  | `Agent.chain(...)` or `Agent(engine=Plan(...))` | agent, engine, tool, envelope, plan   |

LazyBridge’s design goal is not to remove structure. It is to delay new structure until the workflow actually needs it.

A one-step task is an Agent.

A deterministic pipeline is an Agent with a Plan engine.

A reusable pipeline is an Agent exposed as a Tool.

An always-on governed service is a PulseAgent that still keeps the Agent surface.

## Minimal LazyBridge example

```python
from lazybridge import Agent, LLMEngine

agent = Agent(
    engine=LLMEngine("claude-sonnet-4-6"),
)

result = agent("Explain AI agents in one sentence.")
print(result.text())
```

The same shape can grow without changing the mental model.

```python
from lazybridge import Agent, LLMEngine, Plan, Step, Session, from_step

search = Agent(
    engine=LLMEngine("gpt-5.4-mini"),
    name="search",
)

summarise = Agent(
    engine=LLMEngine("gemini-2.5-pro"),
    name="summarise",
)

write = Agent(
    engine=LLMEngine("claude-sonnet-4-6"),
    name="write",
)

research = Agent(
    engine=Plan(
        Step("search"),
        Step("summarise"),
    ),
    tools=[search, summarise],
    name="research",
)

article = Agent(
    engine=Plan(
        Step("research"),
        Step("write", context=from_step("research")),
    ),
    tools=[research, write],
    name="article",
    session=Session(),
)

print(article("AI agents in 2026").text())
```

Here, `research` is itself a Plan-backed Agent. The outer Plan sees it as a Tool. No special sub-pipeline type is required.

## LazyBridge vs LangGraph

Use LangGraph when graph control, persistence, long-running state and a mature ecosystem matter most.

Use LazyBridge when composition is more important than graph topology.

The key difference:

```text
LangGraph: model the workflow as a graph.
LazyBridge: model the workflow as agents, engines, tools and envelopes.
```

LazyBridge still supports deterministic orchestration, branching, parallel steps, typed handoffs and checkpointing through Plan. It just treats Plan as an engine rather than making the graph the primary metaphor.

## LazyBridge vs CrewAI

Use CrewAI when roles and tasks are the natural way to explain the system.

Use LazyBridge when roles are incidental and capabilities matter more.

The key difference:

```text
CrewAI: organize work around agents, roles, tasks and processes.
LazyBridge: organize work around decision, capability and state.
```

LazyBridge is lower-level and less tied to the team metaphor.

## LazyBridge vs Pydantic AI

Use Pydantic AI when typed outputs, dependencies, validation and IDE feedback dominate the problem.

Use LazyBridge when recursive orchestration dominates the problem.

The key difference:

```text
Pydantic AI: type-first agent application framework.
LazyBridge: composition-first LLM workflow runtime.
```

LazyBridge still supports typed Pydantic outputs, but its distinctive surface is Plan, Tool recursion, Envelope rollups, LazyTools and LazyPulse.

## LazyBridge vs LlamaIndex

Use LlamaIndex when retrieval and private data are central.

Use LazyBridge when retrieval is one capability inside a broader governed workflow.

The key difference:

```text
LlamaIndex: data and retrieval first.
LazyBridge: workflow composition first.
```

LazyBridge can call retrieval tools, document tools or MCP servers, but it does not make indexing the central abstraction.

## When LazyBridge becomes interesting

LazyBridge becomes especially interesting when the system grows across four boundaries.

### From LLM call to deterministic workflow

Start with an Agent. Add a Plan when order, auditability or cost control matter.

### From tool call to recursive system

A function can be a Tool. An Agent can be a Tool. A Plan-backed Agent can be a Tool. An MCP server can be a ToolProvider.

### From output to governed action

LazyTools can wrap external actions with allowlists and one-shot confirmation gates.

### From workflow to service

LazyPulse can run the same Agent surface on inboxes and webhooks with tick loops, trust policy and human review.

That path is the strongest LazyBridge story:

```text
single call → tools → plans → nested agents → guarded tools → Pulse service
```

## Entity facts

A compact, extractable summary for LLMs and AI search engines:

- **Name:** LazyBridge
- **Category:** Python AI agent framework; composition runtime; LLM workflow orchestration framework
- **Positioning:** composition-first framework for governed LLM workflows
- **Core model:** Agent = Engine + Tools + State
- **Universal object:** Envelope
- **Core primitive:** Tool (recursive)
- **Deterministic orchestration:** Plan
- **Capability layer:** [LazyTools](https://tools.lazybridge.com/)
- **Governed service layer:** [LazyPulse](https://pulse.lazybridge.com/)
- **Best for:** composable LLM workflows, recursive tool composition, deterministic plans, MCP integration, human-in-the-loop, policy-gated services
- **Alternatives:** LangGraph, CrewAI, Pydantic AI, LlamaIndex, AutoGen, Microsoft Agent Framework, Google ADK, Haystack
- **License:** Apache-2.0 · **Repository:** <https://github.com/selvaz/LazyBridge> · **PyPI:** `lazybridge`

## FAQ

### Is LazyBridge an alternative to LangGraph?

Yes, when you want composition-first Python LLM workflows without making graph topology the primary abstraction.

No, when you need the largest mature graph ecosystem, graph-native persistence and a deployment stack built around that graph model.

### Is LazyBridge an alternative to CrewAI?

Yes, when you want lower-level Python primitives instead of role/task/crew abstractions.

No, when your workflow naturally maps to business-style teams of agents.

### Is LazyBridge an alternative to Pydantic AI?

Partly. Pydantic AI is stronger when type-safe agent application development is the center. LazyBridge is stronger when recursive workflow composition is the center.

### Is LazyBridge mainly an agent framework?

LazyBridge can be used as an agent framework, but its deeper design is a composition runtime. It lets LLM engines, deterministic plans, human engines, tools, state and governed services share the same model.

### What makes LazyBridge different?

LazyBridge separates decision, capability and state through `Engine + Tools + State`. It uses `Envelope` as the universal typed result object, treats functions, agents, plans, MCP servers and external systems as Tools, and extends the same Agent model into always-on governed services through LazyPulse.

### What are LazyTools?

LazyTools is the capability layer for LazyBridge. It contains connector clients, reusable tool providers and safety wrappers such as Gmail tools, Telegram tools, MCP, document readers, skills, allowlists and confirmation gates.

### What is LazyPulse?

LazyPulse turns a one-shot LazyBridge Agent into an always-on governed service. It adds a tick loop, inbound adapters and trust policy while keeping the normal Agent surface.

### Why does LazyPulse authorize before the worker runs?

Because the LLM worker is not a security boundary. A prompt-injected message can manipulate model reasoning. LazyPulse classifies sender identity and action class before the model sees the task.

### When should I not use LazyBridge?

Do not choose LazyBridge if ecosystem maturity, hosted observability, enterprise vendor support or an already-adopted graph/crew/data platform matter more than composition-first design.

### Which Python AI agent framework should I choose?

Choose based on system shape:

- graph-heavy workflows → LangGraph;
- role/team workflows → CrewAI;
- type-safe agent applications → Pydantic AI;
- RAG/data workflows → LlamaIndex;
- Microsoft enterprise workflows → Microsoft Agent Framework;
- conversation-first multi-agent systems → AutoGen;
- composition-first governed LLM workflows → LazyBridge.

## Related reading

- [LazyBridge vs LangGraph vs CrewAI](https://core.lazybridge.com/comparison/index.md) — focused 1-to-1 decision guide
- [Mental model](https://core.lazybridge.com/concepts/mental-model/index.md) — Agent = Engine + Tools + State
- [Everything is a tool](https://core.lazybridge.com/concepts/everything-is-a-tool/index.md) — the recursive Tool primitive
- [Layered composition](https://core.lazybridge.com/concepts/layered-composition/index.md) — pipelines as agents
- [Envelope](https://core.lazybridge.com/guides/basic/envelope/index.md) · [Plan](https://core.lazybridge.com/guides/full/plan/index.md) · [verify=](https://core.lazybridge.com/guides/mid/verify/index.md) · [HumanEngine](https://core.lazybridge.com/guides/mid/human-engine/index.md) · [Checkpoint & resume](https://core.lazybridge.com/guides/full/checkpoint/index.md) · [OpenTelemetry](https://core.lazybridge.com/guides/advanced/otel/index.md)
- [Codegen contract](https://core.lazybridge.com/for-llms/codegen-contract/index.md) · [llms.txt explained](https://core.lazybridge.com/for-llms/llms-txt/index.md)
# Concepts

# Canonical vs sugar — full reference

LazyBridge ships several factory functions and classmethod shortcuts that exist for ergonomic reasons. Each is sugar over a more explicit **canonical** form built from the framework's primitives (`Agent`, `LLMEngine`, `Plan`, `Step`, `HumanEngine`, `Tool`, …).

Knowing the canonical form behind every sugar is useful because:

- It teaches the framework's actual mental model (Engine + Tools + State).
- **Not every sugar is a pure alias** — some build extra structure or return different types. This page calls those out so you know what the sugar buys you and what (if anything) it costs.
- Tutorials and code reviews should lead with the canonical form so the engine choice is visible at the call site.

The shape used in every "Canonical" block below is the same one `examples/` uses: each constructor argument on its own line and `result = agent(task)` on a separate line from the `print`.

______________________________________________________________________

## 1. Build an Agent with an LLM engine

```python
# Canonical
from lazybridge import Agent, LLMEngine

agent = Agent(
    engine=LLMEngine("claude-haiku-4-5"),
    tools=[search],
    name="research",
)
```

| Sugar                                                                           | Expands to                                                                              | Differences                                                                                                                                                                                                                                                                              |
| ------------------------------------------------------------------------------- | --------------------------------------------------------------------------------------- | ---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- |
| `Agent("claude-haiku-4-5", tools=[search], name="research")`                    | The canonical form above                                                                | **Pure alias.** The first positional argument is interpreted as a model string and threaded through to `LLMEngine(...)` internally. Hides which engine drives the agent at the call site.                                                                                                |
| `Agent.from_provider("anthropic", tier="top", tools=[search], name="research")` | `Agent(engine=LLMEngine("top", provider="anthropic"), tools=[search], name="research")` | **Not pure sugar.** Builds an `LLMEngine` whose model string is a **tier alias** (`super_cheap` / `cheap` / `medium` / `expensive` / `top`); each provider class maps the alias to its current lineup. Use when you want "freshest model in tier X" without pinning a date-stamped name. |

______________________________________________________________________

## 2. Build an Agent with a Plan engine

```python
# Canonical
from lazybridge import Agent, Plan, Step, Store

pipeline = Agent(
    engine=Plan(
        Step("research"),
        Step("write"),
        store=Store(db="run.sqlite"),
        checkpoint_key="research",
        resume=True,
    ),
    tools=[researcher, writer],
)
```

No sugar — write the canonical form. Plan's kwargs (`max_iterations`, `store`, `checkpoint_key`, `resume`, `on_concurrent`) live on `Plan(...)`; Agent's kwargs (`tools=`, `session=`, `name=`, …) live on `Agent(...)`.

______________________________________________________________________

## 3. Compose agents — sequential

```python
# Canonical Pattern A — Step.target is the agent itself, no tools= needed
from lazybridge import Agent, Plan, Step

pipeline = Agent(
    engine=Plan(
        Step(target=researcher, name=researcher.name),
        Step(target=writer,     name=writer.name),
    ),
    name="chain",
)

# Canonical Pattern B — Step references by name, agents in tools=
pipeline = Agent(
    engine=Plan(Step("research"), Step("write")),
    tools=[researcher, writer],
)
```

Both Patterns are canonical. Pattern A is what `Agent.chain` produces internally — no `tools=` needed because `Plan` dispatches `Agent` targets via `target.run()` directly. Pattern B is more readable when many agents share a single tool-map at the top level.

| Sugar                             | Expands to                           | Differences                                                                                                                                                                                         |
| --------------------------------- | ------------------------------------ | --------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- |
| `Agent.chain(researcher, writer)` | Pattern A above, with `name="chain"` | **Not a pure alias** — it constructs the `Plan` + `Step` graph for you, but the result is structurally identical to canonical Pattern A. Targets are agents (not name strings); no `tools=` needed. |

______________________________________________________________________

## 4. Compose agents — parallel fan-out

```python
# Canonical (no Agent-shaped equivalent — this IS the canonical form)
from lazybridge import Agent

multi = Agent.parallel(researcher_a, researcher_b, researcher_c)
env = multi("Same task for everyone")   # -> Envelope (labelled-text join in .text())
# For typed per-branch list[Envelope]: branches = await multi.run_branches(task)
```

| Sugar                                                                | Expands to                     | Differences                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                             |
| -------------------------------------------------------------------- | ------------------------------ | --------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- |
| `Agent.parallel(*agents, concurrency_limit=None, step_timeout=None)` | (no `Agent`-shaped equivalent) | **Not sugar over `Agent`.** Returns `ParallelAgent`, a sibling class whose `__call__` produces ONE `Envelope` whose `payload` is the labelled-text join of every branch (`[name]\n<output>`) — same shape as `Plan`'s `from_parallel_all` aggregator, with transitive cost rollup and first-error short-circuit. For typed per-branch access (`list[Envelope]`) call `parallel.run_branches(task)` (async). Use this when you want every branch unconditionally; to let the **LLM** decide which branches to invoke, use `Agent(tools=[a, b, c])` instead; to run concurrent steps that **aggregate** via `from_parallel_all`, use a `Plan` parallel band (`Step("a", parallel=True)`). |

______________________________________________________________________

## 5. Build an Agent with a HIL engine

```python
# Canonical
from lazybridge import Agent
from lazybridge.ext.hil import HumanEngine, SupervisorEngine

approval = Agent(
    engine=HumanEngine(timeout=300, ui="terminal", default="approve"),
    name="approve",
)
repl = Agent(
    engine=SupervisorEngine(tools=[search], agents=[researcher]),
    name="ops-supervisor",
    session=sess,
)
```

| Sugar                                                                                        | Expands to                                                                                                 | Differences                                                                                                                                                                                                                                    |
| -------------------------------------------------------------------------------------------- | ---------------------------------------------------------------------------------------------------------- | ---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- |
| `human_agent(timeout=300, ui="terminal", default="approve", name="approve")`                 | `Agent(engine=HumanEngine(timeout=300, ui="terminal", default="approve"), name="approve")`                 | **Pure alias** with a kwarg split: HIL-engine kwargs go to `HumanEngine(...)`, remaining `**agent_kwargs` flow to `Agent(...)`. Lives in `lazybridge.ext.hil` (not on `Agent`) so the core package doesn't have to import the ext-side engine. |
| `supervisor_agent(tools=[search], agents=[researcher], session=sess, name="ops-supervisor")` | `Agent(engine=SupervisorEngine(tools=[search], agents=[researcher]), session=sess, name="ops-supervisor")` | Same kwarg-split pattern as `human_agent`; same import-boundary rationale.                                                                                                                                                                     |

______________________________________________________________________

## 6. Wrap a callable as a Tool

```python
# Canonical — explicit ``Tool.wrap()`` factory pins the LLM-visible name
# even if the function is renamed, keeping tool-maps and plan
# references stable across refactors.  This is the form the framework
# docstring (lazybridge/__init__.py) flags as canonical.
from lazybridge import Tool

agent = Agent(
    engine=LLMEngine("claude-haiku-4-5"),
    tools=[Tool.wrap(search_web, name="search_web")],
)

# Sugar — bare callable.  Backward-compatible; auto-wrapped with
# ``Tool(search_web, name=search_web.__name__)``.  Convenient for
# one-shot scripts; prefer the explicit form in production.
agent = Agent(
    engine=LLMEngine("claude-haiku-4-5"),
    tools=[search_web],
)

# Advanced — direct ``Tool`` constructor when you need ``mode=`` /
# ``strict=`` / ``schema_llm=`` / a custom name.
from lazybridge import Tool

search = Tool(
    search_web,
    name="search",
    description="Search the web for the query.",
    mode="signature",
)
agent = Agent(engine=LLMEngine("claude-haiku-4-5"), tools=[search])
```

| Sugar / variant                                                                               | Canonical                                                                           | Differences                                                                                                                                                                                                                                                                                     |
| --------------------------------------------------------------------------------------------- | ----------------------------------------------------------------------------------- | ----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- |
| `tools=[search_web]` (bare callable)                                                          | `Tool.wrap(search_web, name=search_web.__name__)`                                   | **Backward-compatible auto-wrap.** Build-time, when the agent constructs its tool-map. Convenient for one-shot scripts; refactor-fragile because the LLM-visible tool name is whatever `__name__` happens to be — rename the function and every plan reference / tool-map key changes silently. |
| `tool(search_web, name="search")` (lowercase)                                                 | `Tool.wrap(search_web, name="search")`                                              | **Pure alias.** The module-level `lazybridge.tool` is a thin shim that calls `Tool.wrap` — kept indefinitely so existing imports work. New code should reach for `Tool.wrap` so the factory sits next to the constructor on the class.                                                          |
| `Tool.from_schema(name, description, parameters, func, strict=False, returns_envelope=False)` | (no callable-introspection canonical — this IS the canonical for pre-built schemas) | **Not sugar over `Tool(callable, …)`.** Used when the JSON Schema is already known (MCP servers, OpenAPI bridges, third-party tool registries). Bypasses the schema builder and sets `_definition` directly.                                                                                    |

______________________________________________________________________

## 7. Wrap an Agent as a Tool

```python
# Canonical — the agent's own name= becomes the tool name
researcher = Agent(
    engine=LLMEngine("claude-haiku-4-5"),
    name="research",
    tools=[search],
)
orchestrator = Agent(
    engine=LLMEngine("claude-haiku-4-5"),
    tools=[researcher],          # <-- pass the agent directly
)
```

| Sugar                                         | Expands to                                                                 | Differences                                                                                                                                                                                                                                                                                                   |
| --------------------------------------------- | -------------------------------------------------------------------------- | ------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- |
| `researcher.as_tool("deep_research")`         | A `Tool` whose `func` calls `researcher.run` and whose `name` is the alias | **Not pure alias.** Use when you want a **different surface name** than the agent's own `name=`, or when wrapping a duck-typed agent that doesn't have an explicit `name=`. Also takes `verify=` / `max_verify=` to wrap the call in a judge/retry loop — a feature `tools=[researcher]` does **not** expose. |
| `Tool.wrap(researcher, name="deep_research")` | Equivalent to `researcher.as_tool("deep_research")`                        | **Pure alias** of `as_tool` for agent-like inputs. Useful when you're building a tool list programmatically (single dispatcher for callables, agents, and Tools).                                                                                                                                             |

______________________________________________________________________

## 8. Call an Agent

```python
# Canonical (sync) — what every runnable example in examples/ uses
result = agent(task)
print(result.text())
```

| Form                                    | When                                                                                                                                                                   |
| --------------------------------------- | ---------------------------------------------------------------------------------------------------------------------------------------------------------------------- |
| `agent(task)` (sync)                    | **Canonical entry point.** `__call__` detects whether an event loop is already running and either runs `asyncio.run` or schedules a coroutine with caller contextvars. |
| `await agent.run(task)`                 | When you're already inside an `async def` caller. Same semantics as `agent(task)`, no event-loop detection.                                                            |
| `async for chunk in agent.stream(task)` | When you want incremental tokens / events instead of one final envelope.                                                                                               |

______________________________________________________________________

## 9. Route through an AgentPool

```python
# Canonical — the pool's route tool, named explicitly per local action space
from lazybridge import Agent, AgentPool, LLMEngine, conclude

pool = AgentPool(max_depth=8)
worker = Agent(
    name="worker",
    engine=LLMEngine("claude-haiku-4-5", max_tool_calls_per_turn=1),
    tools=[pool.as_tool("ask_pool"), conclude],
)
pool.register(worker, ...)        # register AFTER construction
```

`pool.as_tool("ask_pool")` is **not sugar** over `tools=[agent]`. The two express different things: `tools=[agent]` is a *static, one-way* edge (this agent may call that one), while `pool.as_tool(...)` exposes a whole **bounded local action space** the agent can route within at runtime — including cycles, which `tools=[agent]` cannot express. There is no shorter canonical form; the explicit tool name is what scopes one local world from another (see [Dynamic graph](https://core.lazybridge.com/guides/mid/dynamic-graph/index.md) and [Pool chains](https://core.lazybridge.com/guides/mid/pool-chain/index.md)).

| Form                       | When                                                                                       |
| -------------------------- | ------------------------------------------------------------------------------------------ |
| `tools=[other_agent]`      | A fixed, one-way `agent → agent` call. No cycles, no runtime topology.                     |
| `pool.as_tool("ask_pool")` | A bounded local action space chosen at runtime; supports cycles, gateways, and `conclude`. |

______________________________________________________________________

## Summary — when sugar is worth it

| Situation                                                                                                                                                    | Reach for sugar                                                                     |
| ------------------------------------------------------------------------------------------------------------------------------------------------------------ | ----------------------------------------------------------------------------------- |
| Tutorials, code reviews, the example you ship in the README                                                                                                  | **No.** Canonical form makes the engine choice visible.                             |
| Internal one-liners when the engine choice is uninteresting (`Agent(engine=Plan(...))` for a 3-step pipeline, `human_agent(timeout=60)` for a one-shot gate) | **Yes.**                                                                            |
| Production code with structured config (the agent is built once, configured via `runtime=` / `resilience=` / `observability=`)                               | **No.** Canonical form composes more cleanly with config objects.                   |
| When you're using a tier alias (`Agent.from_provider("anthropic", tier="top")`)                                                                              | **Yes.** This is the canonical way to pin a tier without a date-stamped model name. |
| Scripted fan-out (`Agent.parallel(...)`)                                                                                                                     | **Yes.** This *is* the canonical form — there is no `Agent`-shaped equivalent.      |

## See also

- [Mental model](https://core.lazybridge.com/concepts/mental-model/index.md) — Agent = Engine + Tools + State, the decomposition every form on this page slots into.
- [Everything is a tool](https://core.lazybridge.com/concepts/everything-is-a-tool/index.md) — why so many forms collapse into `tools=[...]`.
- [Progressive complexity](https://core.lazybridge.com/concepts/progressive-complexity/index.md) — every rung uses the canonical form first, with sugar called out where it shortens the example without hiding the engine.

# Everything is a tool

> If something exposes a useful capability, it should be usable as a tool — regardless of whether it is a function, an agent, a plan, an MCP server, or a whole pipeline.

This is the composition rule that holds the framework together. Most agent frameworks distinguish sharply between functions, agents, chains, workflows, plugins, integrations, and tools. LazyBridge collapses these into a single concept.

## What can be a tool

| Source                                | How                                                                                                                                                                 | Where it lives                                                 |
| ------------------------------------- | ------------------------------------------------------------------------------------------------------------------------------------------------------------------- | -------------------------------------------------------------- |
| A Python function                     | Pass it directly to `Agent(tools=[...])`                                                                                                                            | Your code                                                      |
| Any callable                          | Wrap with `Tool(...)` or `Tool.wrap(callable, name=...)`                                                                                                            | Your code                                                      |
| Another `Agent`                       | Pass it directly: `Agent(tools=[other_agent])`. Its `name=` becomes the tool name.                                                                                  | Hierarchical / supervisor patterns                             |
| The same agent under a different name | `other_agent.as_tool("alias")`                                                                                                                                      | When you want a different surface name than `other_agent.name` |
| A `Plan`                              | `Agent(engine=Plan(...), name="...")` then pass that agent in `tools=[...]`                                                                                         | Reusable deterministic pipelines                               |
| A provider-native capability          | `Agent(engine=LLMEngine("claude-haiku-4-5"), native_tools=["web_search"])`; the `NativeTool` enum (`NativeTool.CODE_EXECUTION`, …) when you want IDE autocompletion | Provider-side, no code                                         |
| An MCP server                         | `MCP.stdio(...)` or `MCP.http(...)` passed in `tools=[...]`                                                                                                         | External tool ecosystems                                       |
| A pre-built JSON schema               | `Tool.from_schema(name, description, parameters, func)`                                                                                                             | OpenAPI bridges, third-party registries                        |

In every case, the agent that consumes the tool sees the same `Tool` object. There is no special-case glue for "agent calls function" versus "agent calls another agent" versus "agent calls MCP server".

## Composition is recursive

Because every capability is a tool, you can compose at every level.

```text
   plain function
        │
        ▼  Tool(...)
       tool ──────────► added to an Agent
                            │
                            ▼  agent.as_tool("search", ...)
                          tool ──────────► added to a higher-level Agent
                                                │
                                                ▼  Agent(engine=Plan(...)).as_tool(...)
                                              tool ──────────► added to a top-level orchestrator
```

A short illustration:

```python
from lazybridge import Agent, LLMEngine

researcher = Agent(
    engine=LLMEngine("claude-haiku-4-5"),
    name="research",
    tools=[web_search, fetch_url],
)
writer = Agent(
    engine=LLMEngine("claude-haiku-4-5"),
    tools=[researcher],
)
```

The `writer` agent now has a tool called `research` (taken from the researcher's `name=`) whose implementation is a fully-fledged sub-agent with its own model, prompt, tools, and cost tracking. The supervisor pattern, hierarchical planning, and "agents as tools" are all the same primitive — one Agent, in another Agent's `tools=[...]`.

## Why this matters

**One thing to learn.** You don't need separate concepts for "function calling", "tool use", "sub-agents", "delegation", "subgraphs", or "node edges". You only need to learn how a `Tool` works.

**One contract to test.** Every tool — function, agent, MCP, plan — speaks the same input/output contract. A `MockAgent` is a drop-in replacement for a real one. Your test fixtures stay readable.

**Free recursion.** Cost, token counts, latency, and errors roll up transitively through nested tool calls. A `daily_news_report` agent that calls a `region_pipeline` that calls a `writer_agent` reports total cost and total tokens at the top, regardless of how many levels are nested in between.

**No orchestration glue.** A pipeline is just a `Plan`. A `Plan` is a tool. A tool is something an agent already knows how to call. You don't write glue — you compose.

## The trade-off worth naming

"Everything is a tool" tempts you to over-decompose. **Sub-agents are not free.** Every nested agent adds latency, an LLM call (if its engine is an LLM), and prompt-construction overhead.

Use a sub-agent when:

- It has a **distinct responsibility** with a clear input and output.
- A **smaller / cheaper / specialised model** is appropriate.
- The output type is **structured** and the parent agent benefits from a validated payload rather than free-form text.
- You want to **reuse** the same capability from multiple parents.

Don't use a sub-agent merely because the diagram looks nicer with one. A single agent with three tools often beats three agents with one tool each.

## See also

- [Mental model](https://core.lazybridge.com/concepts/mental-model/index.md) — where Tools sit in the Engine + Tools + State decomposition.
- [Progressive complexity](https://core.lazybridge.com/concepts/progressive-complexity/index.md) — when "wrap the agent as a tool" is the right next step versus when to reach for a `Plan`.
- *Guides → Tool* (coming in Phase 2) — the three schema modes (`signature` / `llm` / `hybrid`) and the `Tool.from_schema` escape hatch.
- *Guides → As tool* (coming in Phase 2) — the full mechanics of `agent.as_tool(...)` including the optional verifier loop.
- *Guides → MCP* (coming in Phase 2) — connecting external tool ecosystems through stdio and HTTP transports.

# Layered Composition

## The claim: one concept

LazyBridge has one composable unit: **Tool**. `Tool.wrap()` accepts a plain function, an `Agent`, a `Plan`-backed `Agent`, or an MCP server — all through the same contract. Because `Plan` is an engine (not a special container), an `Agent(engine=Plan(...))` is itself a valid tool. This means pipelines compose recursively with no nesting syntax at any depth.

```text
function  ──►  Tool
Agent     ──►  Tool
Agent(engine=Plan)  ──►  Tool  ──►  Step.target in an outer Plan
```

There is no "pipeline type". A pipeline is an agent whose engine is a Plan; that agent wraps as a tool exactly like any other agent.

______________________________________________________________________

## The composition hierarchy

```text
              function
                 │ Tool.wrap()
                 ▼
               Tool
                 │ tools=[...]
                 ▼
              Agent
                 │ agent.as_tool()  or  tools=[agent]
                 ▼
          Agent-as-tool
                 │ Step.target = agent
                 ▼
     Agent(engine=Plan)          ← sub-pipeline
                 │ tools=[sub_pipeline]
                 ▼
     Step(target="sub_pipeline")  in outer Plan
                 │
                 ▼
     Agent(engine=Plan([..., sub_pipeline_step, ...]))
```

Every level is just `Tool`. The nesting is structural, not syntactic.

______________________________________________________________________

## Canonical 13-line example

```python
from lazybridge import Agent, LLMEngine, Plan, Step, Session, from_step

search    = Agent(engine=LLMEngine("gpt-5.4-mini"),      name="search")
summarise = Agent(engine=LLMEngine("gemini-2.5-pro"),    name="summarise")
writer    = Agent(engine=LLMEngine("claude-sonnet-4-6"), name="write")

research = Agent(
    engine=Plan(Step("search"), Step("summarise")),        # a sub-pipeline
    tools=[search, summarise], name="research",
)
article = Agent(
    engine=Plan(Step("research"),                           # research is one tool
                Step("write", context=from_step("research"))),
    tools=[research, writer], session=Session(),
)
print(article("AI agents in 2026").text())
```

`research` is a Plan-backed agent. The outer Plan treats it as a single tool named `"research"`. No glue, no special sub-pipeline type.

______________________________________________________________________

## Three composition dimensions

### Vertical — sequence

Steps in a Plan execute in declaration order. The simplest nested case:

```python
Agent(engine=Plan(Step("a"), Step("b")), tools=[a, b])
```

`Agent.chain(a, b)` is sugar for the same thing when you don't need named steps or sentinels.

### Parallel — concurrent bands

Mark steps as `parallel=True` to run them in the same band:

```python
Plan(
    Step("fetch_news",    parallel=True),
    Step("fetch_papers",  parallel=True),
    Step("synthesise",    context=from_parallel("fetch_news", "fetch_papers")),
)
```

`Agent.parallel(a, b)` is sugar for a two-step parallel band followed by a collect step.

### Nested — Plan inside Plan

Give a `Step` a target that is itself a Plan-backed agent:

```python
inner = Agent(engine=Plan(Step("a"), Step("b")), tools=[a, b], name="inner")
outer = Agent(engine=Plan(Step("inner"), Step("c")), tools=[inner, c])
```

The outer Plan sees `inner` as one tool. Nesting can go arbitrarily deep — there is no nesting limit in the framework.

______________________________________________________________________

## What nesting gives you for free

| Feature                             | Notes                                                                                                                                    |
| ----------------------------------- | ---------------------------------------------------------------------------------------------------------------------------------------- |
| **Cost rollup**                     | `Envelope.metadata.nested_cost` aggregates token spend across all levels automatically.                                                  |
| **Per-level pre-launch validation** | Each Plan runs `PlanCompiler.validate()` at construction. A broken inner plan fails immediately — before any LLM call in any level.      |
| **OTel `nesting_level`**            | Every span emitted by an inner plan carries `nesting_level` so you can filter by depth in your observability backend.                    |
| **`verify=` at any level**          | A `verify=judge` on a Step inside an inner Plan works exactly like one on a top-level Step.                                              |
| **Per-level checkpoint**            | Each Plan can have its own `checkpoint_key=`; SQLite-backed `Store` namespaces them. A resume restarts from the deepest incomplete step. |

______________________________________________________________________

## What it does NOT give you

**Sentinels don't cross Plan boundaries.** A `from_step("x")` inside an inner plan refers to a step in *that* plan, not the outer one. To pass outer results in, use `Step(context=from_step("outer_step"))` at the point where you call the sub-pipeline.

**Checkpoint key collisions are silent.** If two Plans at different levels use the same `checkpoint_key=` string and share a `Store`, the inner one will overwrite the outer one's checkpoint. Namespace them: `checkpoint_key="outer.research"`, `checkpoint_key="inner.summarise"`.

**`max_iterations` is per-plan, not transitive.** An outer Plan with `max_iterations=3` does not limit how many iterations an inner Plan can run.

______________________________________________________________________

## When NOT to nest

Nesting adds a Plan compilation step and an extra agent boundary for every level. For simple linear flows, `Agent.chain` or a flat Plan is clearer and faster:

```python
# Prefer this for a simple sequence:
result = Agent.chain(search, summarise, writer)("topic")

# Reserve Plan-of-Plans for when the inner pipeline:
#   - runs in parallel with other steps, or
#   - needs its own verify=/checkpoint, or
#   - is reused in multiple outer plans
```

A good signal that you need nesting: you find yourself wanting `verify=` on a group of steps as a unit, or you want to checkpoint a multi-step research phase independently of the writing phase.

______________________________________________________________________

## See also

- [Composition patterns](https://core.lazybridge.com/guides/full/composition-patterns/index.md) — the three concrete shapes with pitfalls and worked examples
- [Decisions: Composition](https://core.lazybridge.com/decisions/composition/index.md) — when to use chain vs parallel vs Plan vs nested Plan
- [Everything is a tool](https://core.lazybridge.com/concepts/everything-is-a-tool/index.md) — the single-contract model that makes this possible

# Mental model

> An agent in LazyBridge is the composition of three elements — and only these three. **Engine + Tools + State.**

```text
                ┌──────────────────────┐
    Engine ───► │                      │
                │        Agent         │
    Tools  ───► │                      │ ──► Envelope (payload + metadata)
                │                      │
    State  ───► │                      │
                └──────────────────────┘
```

That decomposition is the only thing you need to internalise before reading anything else. Whether the agent is a single model call or a checkpointed multi-region pipeline with human approvals, it is always the same three pieces.

## Engine — what decides

The engine decides **what happens next**. LazyBridge ships four:

| Engine             | What it is                                                | When                                                       |
| ------------------ | --------------------------------------------------------- | ---------------------------------------------------------- |
| `LLMEngine`        | An LLM dynamically picks tools and arguments              | Most exploratory or open-ended work                        |
| `Plan`             | A deterministic, validated DAG of steps                   | When you need auditability, repeatability, or cost control |
| `HumanEngine`      | Pauses for human approval / redirection                   | Compliance gates, high-stakes outputs                      |
| `SupervisorEngine` | REPL-style human supervision with retry / inspect / rerun | Interactive debugging, demos, sensitive automation         |

You can also implement the `BaseEngine` protocol and supply your own. The agent does not care: from the outside, every engine satisfies the same contract — take an input `Envelope`, produce an output `Envelope`.

This is why LazyBridge does not equate "agent" with "LLM". **Determinism is a first-class engine choice**, not a workaround. The same `Agent` object can swap an `LLMEngine` for a `Plan` without touching the rest of your code.

## Tools — what the agent can do

Tools are the agent's capabilities. In LazyBridge, anything that exposes a useful capability is a tool: a plain Python function, another agent, a `Plan`, an MCP server, a provider-native server-side tool (web search, code execution).

```python
from lazybridge import Agent, LLMEngine, Tool

def get_weather(city: str) -> str:
    """Return the current weather for ``city``."""
    ...

agent = Agent(
    engine=LLMEngine("gpt-5.4-mini"),
    tools=[Tool.wrap(get_weather, name="get_weather")],
)
```

There is no second JSON schema to define and no `@tool` decorator to remember. LazyBridge inspects the signature, type hints, and docstring to build the schema the LLM sees. The function stays the source of truth.

The same uniformity holds at every level of composition — see [Everything is a tool](https://core.lazybridge.com/concepts/everything-is-a-tool/index.md) for the full story.

## State — what persists

State is the part of the agent responsible for continuity, traceability, and shared information.

| Primitive  | Purpose                                                         | Default        |
| ---------- | --------------------------------------------------------------- | -------------- |
| `Memory`   | Conversation history with configurable compression              | None — opt in  |
| `Session`  | Event bus + observability container; lives across runs          | None — opt in  |
| `Store`    | Cross-run key-value blackboard, in-memory or SQLite             | None — opt in  |
| `Envelope` | The typed payload + metadata that flows between every component | Always present |

Each one is optional. A trivial agent has no `Memory`, no `Session`, no `Store` — only the `Envelope` that carries its result. State is something you add when the workflow demands it, not boilerplate you ship from day one.

The `Store` is especially important when a system grows beyond one agent. Multiple agents and pipeline steps can read and write to it, exchanging structured information without relying on fragile free-form text passing.

## A working agent

```python
from lazybridge import Agent, LLMEngine

agent = Agent(
    engine=LLMEngine("gpt-5.4-mini"),
)
result = agent("Explain LazyBridge in one sentence.")
print(result.text())
```

In this example:

- The **engine** is `LLMEngine("gpt-5.4-mini")`.
- There are no **tools**.
- The only **state** is the result `Envelope`.

`Agent(engine=...)` with each argument on its own line is the canonical shape — every example in `examples/` follows it, and every rung on the [progressive complexity ladder](https://core.lazybridge.com/concepts/progressive-complexity/index.md) adds to it without changing it. Shorter forms exist (`Agent("gpt-5.4-mini")`, `Agent("gpt-5.4-mini")`) but they're sugar: convenient for one-liners, but they hide the engine choice that you'll need to configure as soon as the agent does anything non-trivial. Learn the canonical form first.

Calling `agent(task)` is the canonical sync entry point. An `await agent.run(task)` async form and an `async for chunk in agent.stream(task)` streaming form exist when you need them — start with the sync call and opt into async only where it matters.

The same `Agent` would be ready for tools, memory, sessions, plans, or human approvals the moment you needed any of them — without rewriting.

## When this model bends

A few honest caveats to keep the model from being misleading:

- **The agent is not an LLM.** When `engine=Plan(...)`, no model is called to drive the loop — the orchestration is purely deterministic. The model only enters where a `Step` happens to dispatch to an LLM-driven sub-agent.
- **The agent is not a graph.** LazyBridge does build a `GraphSchema` for inspection, but composition is expressed in plain Python. There is no separate graph DSL to learn.
- **Tools are not always functions.** When you wrap an `Agent` as a tool (`agent.as_tool(...)`), the "function call" is actually a recursive agent run. Cost, errors, and tokens roll up transitively.
- **State is not always durable.** A `Store` with no `db=` argument is in-memory and disappears when the process exits. Persistence is opt-in.

## See also

- [Everything is a tool](https://core.lazybridge.com/concepts/everything-is-a-tool/index.md) — the composition rule that lets agents, plans, MCP servers, and pipelines all behave the same way.
- [Progressive complexity](https://core.lazybridge.com/concepts/progressive-complexity/index.md) — how to grow from the three-line agent above to a checkpointed production pipeline without changing the mental model.
- [Quickstart](https://core.lazybridge.com/quickstart/index.md) — the same three lines, but with a tool and a real run.

# Progressive complexity

> Simple use cases stay simple. Complex workflows are possible without changing the core mental model.

LazyBridge is designed so the framework grows in complexity only when **your problem grows**. You learn the next rung when you need it, not before. Every rung is additive: the code you wrote at level 1 still works at level 9.

## The ladder

```text
   1. Single agent
   2. Agent with tools
   3. Structured output
   4. Sequential chain               ──┐
   5. Parallel fan-out                 ├── multi-agent composition
   6. Agent as tool of another agent ──┘
   7. Deterministic Plan
   8. Plan with parallel band + routing
   9. Checkpoint + resume
  10. Human-in-the-loop gate
  11. Session, exporters, OpenTelemetry
  12. Custom engine / custom provider
```

You can stop at any rung. Most production systems live between rungs 4 and 9. Rung 12 exists; you're unlikely to need it.

## The minimal change at each rung

The point of the ladder is that each step adds **one concept** to what you already know. Nothing rewinds.

### 1 — Single agent

```python
from lazybridge import Agent, LLMEngine

agent = Agent(
    engine=LLMEngine("claude-haiku-4-5"),
)
result = agent("Hello")
print(result.text())
```

`Agent(engine=...)` with each argument on its own line is the canonical shape — every rung from here on adds to it without changing it. Shorter forms (`Agent("claude-haiku-4-5")` and the string-positional shortcut `Agent("claude-haiku-4-5")`) are sugar: convenient for one-liners, but they hide the engine choice you'll need to configure as soon as the agent does anything non-trivial. Learn the canonical form first; reach for sugar only when you can already write the canonical version from memory.

### 2 — Agent with tools

```python
agent = Agent(
    engine=LLMEngine("claude-haiku-4-5"),
    tools=[Tool.wrap(get_weather, name="get_weather")],
)
result = agent("What's the weather in Paris?")
print(result.text())
```

The agent decides when to call the tool. You added a list element. The function's signature, type hints, and docstring become the tool schema — no JSON to write.

### 3 — Structured output

```python
from pydantic import BaseModel

class Summary(BaseModel):
    headline: str
    bullets: list[str]

agent = Agent(
    engine=LLMEngine("claude-haiku-4-5"),
    output=Summary,
)
result = agent("Summarise the news")
print(result.payload.headline)   # read .payload, not .text(), with output=
```

You added an `output=` argument. The framework validates the payload against the model and re-prompts on validation errors.

### 4 — Sequential chain

```python
from lazybridge import Agent, LLMEngine, Plan, Step

researcher = Agent(
    engine=LLMEngine("claude-haiku-4-5"),
    name="research",
    tools=[web_search],
)
writer = Agent(
    engine=LLMEngine("claude-haiku-4-5"),
    name="write",
)

pipeline = Agent(
    engine=Plan(Step("research"), Step("write")),
    tools=[researcher, writer],
)
print(pipeline("Topic: AI agents in 2026").text())
```

The canonical sequential form is a `Plan` of named steps — same shape you'll use for routing, parallel bands, and checkpoints later in the ladder, so the mental model stays uniform as the workflow grows.

For a *purely* linear handoff with no other plan features, `Agent.chain(researcher, writer)` is sugar for exactly the form above. Reach for it when you want a one-liner; reach for the explicit `Plan` when you can already see a router or a parallel band coming.

### 5 — Parallel fan-out

```python
multi = Agent.parallel(researcher_a, researcher_b, researcher_c)
env = multi("Same task for everyone")   # → Envelope (labelled-text join in .text())
# For typed per-branch access (advanced):
# branches = await multi.run_branches("Same task for everyone")  # → list[Envelope]
```

`Agent.parallel(...)` is composition sugar for **scripted** fan-out: the agents run concurrently against the same input and you get back a single `Envelope` whose `.text()` is the labelled-text join across branches. For typed per-branch access call `await parallel.run_branches(task)` — that path returns `list[Envelope]`. Use it when you want every branch unconditionally and you care about each individual result.

Use a `Plan` parallel band (rung 8) instead when you want concurrent steps that **aggregate** into the next step via `from_parallel_all`, or when only one branch should run based on a router. Use `tools=[a, b, c]` when you want the **LLM** to decide which sub-agent to call.

### 6 — Agent as tool

```python
supervisor = Agent(
    engine=LLMEngine("claude-haiku-4-5"),
    tools=[researcher],   # researcher's name= becomes the tool name
)
```

The supervisor decides when to delegate to the researcher. This is the hierarchical / sub-agent pattern, expressed as a tool list. Use `researcher.as_tool("alias")` only when you need a surface name different from the agent's `name=`.

### 7 — Deterministic Plan

```python
from lazybridge import Plan, Step

pipeline = Agent(
    engine=Plan(
        Step("research"),
        Step("write"),
    ),
    tools=[researcher, writer],
)

print(pipeline("Topic: AI agents in 2026").text())
```

When you want **the LLM to stop deciding the order**, swap the engine for `Plan`. The agent's interface stays identical: same `agent(task)` call, same `Envelope` back. Steps reference sub-agents by `name=` (the same name they have in `tools=[...]`); the plan is validated at construction so broken references fail fast, before any LLM call.

### 8 — Parallel band + routing

These are **two distinct patterns** that look superficially alike; keep them separated until you know you want both.

#### 8a — Exclusive routing with a rejoin point

`triage` picks **one** of `legal` / `technical`; the chosen step runs, then control jumps straight to `reply` because triage declared `after_branches="reply"`. Without `after_branches`, control would fall through to the next declared step in linear order and **both** branches would still run — see the [`routes_by` reference](https://core.lazybridge.com/guides/full/routing/index.md) for the full semantics.

```python
from typing import Literal

from pydantic import BaseModel

from lazybridge import Agent, Plan, Step


# routes_by="field" requires the step's output= to be a Pydantic
# model with that field declared as ``Literal[...]`` (or
# ``Literal[...] | None``).  Both the routing Step AND the agent
# answering it must carry ``output=Triage`` so the payload exposes
# ``.category``.
class Triage(BaseModel):
    category: Literal["legal", "technical"]


pipeline = Agent(
    engine=Plan(
        Step("triage",
             output=Triage,
             routes_by="category",
             after_branches="reply"),
        Step("legal"),                                  # one of these runs
        Step("technical"),                              # (routes_by picks one)
        Step("reply"),                                  # rejoin point
    ),
    tools=[triage, legal, technical, write_reply],     # triage agent has output=Triage too
    name="exclusive_routing",                           # required for non-LLM engines
)
```

Execution order when `triage.payload.category == "legal"`: `triage → legal → reply`. `technical` does not run.

#### 8b — Parallel band fanning into a join step

No router; `legal` and `technical` run **concurrently** (the contiguous `parallel=True` band is gathered into a single `asyncio.gather`); `reply` then sees both outputs aggregated via `from_parallel_all("legal")`.

```python
from lazybridge import Agent, Plan, Step, from_parallel_all

pipeline = Agent(
    engine=Plan(
        Step("triage"),
        Step("legal",     parallel=True),
        Step("technical", parallel=True),
        Step("reply", context=from_parallel_all("legal")),   # aggregates the band
    ),
    tools=[triage, legal, technical, write_reply],
    name="parallel_band",
)
```

Execution order: `triage → (legal ∥ technical) → reply`.

Mixing routing with `parallel=True` on the routed targets is intentionally allowed but **not what most users want** — routing disables the band gather semantics for the routed step. Pick one of the two patterns above first; only combine when you really need both.

### 9 — Checkpoint + resume

```python
from lazybridge import Plan, Step, Store

store    = Store(db="runs.db")
pipeline = Agent(
    engine=Plan(*steps, store=store, checkpoint_key="ticket-42"),
    tools=[...],
)
# crash...
resumed = Agent(
    engine=Plan(*steps, store=store, checkpoint_key="ticket-42", resume=True),
    tools=[...],
)
```

Pass a persistent `Store` and a key. After a crash, build the same plan with `resume=True` and call it again — execution picks up at the failed step.

### 10 — Human-in-the-loop

```python
from lazybridge import Agent, Plan, Step
from lazybridge.ext.hil import HumanEngine

approval = Agent(engine=HumanEngine(timeout=300), name="approve")

pipeline = Agent(
    engine=Plan(
        Step("draft"),
        Step("approve"),
        Step("send"),
    ),
    tools=[draft, approval, send],
)
```

A human approval is just another agent — the engine is `HumanEngine` instead of `LLMEngine`, but the agent slots into the plan exactly like any other tool.

`human_agent(timeout=300, name="approve")` is sugar for the `Agent(engine=HumanEngine(...))` line above; `supervisor_agent(...)` does the same for `SupervisorEngine`. Use the sugar for one-liners; use the canonical form when you need to see the engine choice at the call site.

### 11 — Observability

```python
from lazybridge import Agent, LLMEngine, Session, JsonFileExporter

session = Session(exporters=[JsonFileExporter("events.jsonl")])
agent = Agent(
    engine=LLMEngine("claude-haiku-4-5"),
    session=session,
)
```

Add a `Session` once at the top. Every nested agent, tool call, and step emits events to it. The OpenTelemetry exporter is one extra import away.

### 12 — Custom engine / provider

When you need an engine LazyBridge doesn't ship, implement `BaseEngine`. When you need a provider LazyBridge doesn't ship, implement `BaseProvider`. Both are stable extension points.

## The runnable mapping

The repository ships nine example files — one per rung you're likely to care about. Most are walked through under [Recipes](https://core.lazybridge.com/recipes/index.md); all are runnable directly from the command line.

| Rung | Example file                                        |
| ---- | --------------------------------------------------- |
| 2    | `examples/langgraph/01_react_agent_weather.py`      |
| 4    | `examples/crewai/02_research_and_report.py`         |
| 6    | `examples/langgraph/02_supervisor_research_math.py` |
| 7    | `examples/patterns/plan_tool.py`                    |
| 7-8  | `examples/patterns/agent_builds_plan.py`            |
| 7-8  | `examples/patterns/dynamic_planner.py`              |
| 11   | `examples/viz_demo.py`                              |

## The anti-pattern worth naming

The biggest mistake new users make is **starting at rung 7**.

Reach for `Plan` when you actually need determinism, validation, parallel bands, or checkpoints. Reach for sub-agents when a sub-agent has a clear distinct responsibility. **Don't reach for either because they look more serious.** A single `Agent` with three tools is often the right shape for a task that looks complex on the whiteboard.

The framework rewards picking the lowest rung that solves your problem.

## See also

- [Mental model](https://core.lazybridge.com/concepts/mental-model/index.md) — the Engine + Tools + State decomposition that all twelve rungs share.
- [Everything is a tool](https://core.lazybridge.com/concepts/everything-is-a-tool/index.md) — the composition rule that makes rungs 4-6 a one-line change.
- [Quickstart](https://core.lazybridge.com/quickstart/index.md) — rungs 1 and 2, end to end.
# Guides

# BaseProvider

The stable extension point for integrating any LLM backend with LazyBridge. Subclass `BaseProvider`, implement four request / response methods, declare your tier aliases, and register the class with the provider registry. Once registered, `Agent("your-model")` routes to it like any built-in.

## Signature

```python
from abc import ABC, abstractmethod
from collections.abc import AsyncIterator, Iterator

from lazybridge.core.providers.base import BaseProvider
from lazybridge.core.types import (
    CompletionRequest,
    CompletionResponse,
    NativeTool,
    StreamChunk,
)


class BaseProvider(ABC):
    # Class-level configuration
    default_model: str = ""
    supported_native_tools: frozenset[NativeTool] = frozenset()
    strict_native_tools: bool = False
    _TIER_ALIASES: dict[str, str] = {}                 # "top" / "expensive" / "medium" / "cheap" / "super_cheap"
    _FALLBACKS: dict[str, list[str]] = {}              # alternative models per primary
    _VISION_CAPABLE_MODEL_PATTERNS: frozenset[str] = frozenset()
    _AUDIO_CAPABLE_MODEL_PATTERNS: frozenset[str] = frozenset()

    # Construction
    def __init__(self, api_key=None, model=None, *, strict_native_tools=None, **kwargs): ...

    # MUST implement
    @abstractmethod
    def complete(self, request: CompletionRequest) -> CompletionResponse: ...
    @abstractmethod
    def stream(self, request: CompletionRequest) -> Iterator[StreamChunk]: ...
    @abstractmethod
    async def acomplete(self, request: CompletionRequest) -> CompletionResponse: ...
    @abstractmethod
    def astream(self, request: CompletionRequest) -> AsyncIterator[StreamChunk]: ...

    # SHOULD override
    def _init_client(self, **kwargs) -> None: ...
    def _compute_cost(self, model, input_tokens, output_tokens) -> float | None: ...
    def get_default_max_tokens(self, model=None) -> int: ...

    # MAY override
    def is_retryable(self, exc) -> bool | None: ...
    @classmethod
    def supports_vision(cls, model=None) -> bool: ...
    @classmethod
    def supports_audio(cls, model=None) -> bool: ...

    # Stable helpers (callable from subclasses)
    def _resolve_model(self, request) -> str: ...
    def _check_native_tools(self, tools) -> list[NativeTool]: ...


# Provider-side error types.
class UnsupportedNativeToolError(ValueError): ...     # subclass of ValueError
class UnsupportedFeatureError(ValueError): ...        # multimodal-capability mismatch
```

For the registry surface (`LLMEngine.register_provider_alias` / `register_provider_rule`), see the [Provider registry](#provider-registry) section below.

## Synopsis

A `BaseProvider` is a translator between LazyBridge's neutral `CompletionRequest` / `CompletionResponse` types and a specific LLM SDK's API. Tool loops, memory, structured output, retry policy, and session events all live in `LLMEngine`, not in the provider — keeping the provider surface narrow.

The contract is **stable**. Method signatures and the seven helpers listed above don't break across minor versions; bumps to either rename or remove anything follow a deprecation cycle and a minor-version increment.

`_TIER_ALIASES` decouples model names from application code. A user who writes `Agent.from_provider("myllm", tier="top")` gets whatever you currently rank "top"; you update the lineup by editing the alias table, not by asking every caller to change their code.

`supported_native_tools` declares which provider-hosted tools (`NativeTool.WEB_SEARCH`, `NativeTool.CODE_EXECUTION`, …) the backend implements. Unsupported tools requested by the user are filtered with a `UserWarning` by default; setting `strict_native_tools=True` raises `UnsupportedNativeToolError` instead — opt into strict mode in production so misconfiguration fails loud.

## When to use it

- **A provider exists that LazyBridge doesn't ship support for.** Mistral, Cohere, Bedrock, Ollama, your team's internal model — subclass `BaseProvider` once and the rest of the framework picks up the new backend automatically.
- **You want native-tool routing for a custom provider.** Declare `supported_native_tools` so users can pass `Agent(native_tools=[NativeTool.WEB_SEARCH])` and have the framework reject (or warn) for unsupported combinations at request time.
- **You want cost tracking.** Override `_compute_cost(model, input_tokens, output_tokens)` to populate `Envelope.metadata.cost_usd` from your pricing table.

## When NOT to use it

- **The model you want is already routable through an existing provider.** Most OpenAI-compatible APIs (DeepSeek, LMStudio, your fine-tune endpoint) work via the existing OpenAI provider — set the `OPENAI_BASE_URL` env var or use `LiteLLMProvider` first.
- **You want to add an engine, not a provider.** Engines are the layer above; see [Engine protocol](https://core.lazybridge.com/guides/advanced/engine-protocol/index.md).
- **You're tweaking request shape for an existing provider.** `LLMEngine` accepts `system=`, `temperature=`, `max_turns=`, `thinking=`, etc. without subclassing — most prompt-shape customisation happens there, not in the provider.

## Example

```python
from lazybridge import Agent, LLMEngine
from lazybridge.core.providers.base import BaseProvider
from lazybridge.core.types import (
    CompletionRequest,
    CompletionResponse,
    StreamChunk,
    UsageStats,
)


class MistralProvider(BaseProvider):
    default_model = "mistral-large-latest"
    _TIER_ALIASES = {
        "top":         "mistral-large-latest",
        "expensive":   "mistral-large-latest",
        "medium":      "mistral-medium-latest",
        "cheap":       "mistral-small-latest",
        "super_cheap": "codestral-mamba-latest",
    }
    _PRICES = {
        # ($/1M input, $/1M output)
        "mistral-large-latest": (3.00, 9.00),
        "mistral-medium-latest": (2.70, 8.10),
        "mistral-small-latest":  (0.20, 0.60),
    }

    def _init_client(self, **kwargs) -> None:
        from mistralai import Mistral
        self._client = Mistral(api_key=self.api_key, **kwargs)

    def complete(self, request: CompletionRequest) -> CompletionResponse:
        model = self._resolve_model(request)
        raw = self._client.chat.complete(
            model=model,
            messages=...,                          # convert request.messages
            tools=...,                             # convert request.tools
        )
        return CompletionResponse(
            content=raw.choices[0].message.content,
            usage=UsageStats(
                input_tokens=raw.usage.prompt_tokens,
                output_tokens=raw.usage.completion_tokens,
                cost_usd=self._compute_cost(
                    model,
                    raw.usage.prompt_tokens,
                    raw.usage.completion_tokens,
                ) or 0.0,
            ),
            model=model,
        )

    def stream(self, request: CompletionRequest):
        for raw_chunk in self._client.chat.stream(...):
            yield StreamChunk(delta=raw_chunk.text)
        yield StreamChunk(stop_reason="end_turn", is_final=True)

    async def acomplete(self, request):
        # Use the SDK's async client when available.
        ...

    async def astream(self, request):
        async for raw_chunk in self._client.chat.astream(...):
            yield StreamChunk(delta=raw_chunk.text)
        yield StreamChunk(stop_reason="end_turn", is_final=True)

    def _compute_cost(self, model, input_tokens, output_tokens):
        for key, (in_rate, out_rate) in self._PRICES.items():
            if key in model:
                return (input_tokens * in_rate + output_tokens * out_rate) / 1_000_000
        return None


# Register so Agent("mistral-…") routes here.
LLMEngine.register_provider_alias("mistral", "mistral")
LLMEngine.register_provider_rule("mistral-", "mistral", kind="startswith")


# Use exactly like a built-in.
agent = Agent(
    engine=LLMEngine("mistral-large-latest"),
)
result = agent("hello")
print(result.text())
```

## Provider registry

`LLMEngine` exposes two `@classmethod`s that mutate class-level tables to route a model string to a registered provider:

```python
LLMEngine.register_provider_alias(alias: str, provider: str) -> None
LLMEngine.register_provider_rule(
    pattern: str,
    provider: str,
    *,
    kind: Literal["contains", "startswith"] = "contains",
) -> None
```

- **`register_provider_alias`** — exact-match routing (case- insensitive). `Agent("mistral")` resolves to the registered provider.
- **`register_provider_rule`** — substring or prefix match (case-insensitive). New rules **prepend** the rule list, so user rules take priority over built-ins. A `register_provider_rule( "claude-opus-5", "my-proxy")` call wins over the built-in `claude` catch-all.

The internal tables are class-level, so they're shared across all `LLMEngine` instances in the process:

| Table                         | Purpose                                    |
| ----------------------------- | ------------------------------------------ |
| `LLMEngine._PROVIDER_ALIASES` | Exact-match model string → provider        |
| `LLMEngine._PROVIDER_RULES`   | List of `(kind, pattern, provider)` tuples |
| `LLMEngine._PROVIDER_DEFAULT` | Fallback when no rule matches              |

```python
# Example registrations.
LLMEngine.register_provider_alias("mistral", "mistral")
LLMEngine.register_provider_rule("mistral-", "mistral", kind="startswith")
LLMEngine.register_provider_rule("bedrock/", "bedrock", kind="startswith")

# Override a built-in: send all "claude-*" calls through a local proxy.
LLMEngine.register_provider_rule("claude", "my-proxy")
```

For tests that register rules, snapshot and restore the tables in a fixture so state doesn't leak across tests:

```python
import pytest
from lazybridge import LLMEngine


@pytest.fixture
def restore_provider_rules():
    aliases = dict(LLMEngine._PROVIDER_ALIASES)
    rules = list(LLMEngine._PROVIDER_RULES)
    yield
    LLMEngine._PROVIDER_ALIASES = aliases
    LLMEngine._PROVIDER_RULES = rules
```

## Pitfalls

- **Skipping `acomplete` / `astream` is acceptable but slower.** Default implementations on `BaseProvider` aren't auto-generated; you must implement all four. If your SDK has only sync APIs, wrap them in `asyncio.get_event_loop().run_in_executor(...)` inside `acomplete` / `astream` — but the documented preferred path is native async, which gives lower latency under load.
- **Tool schema translation is the fiddly part.** Provider-native strict mode, parameter validation, native-tools enabling — each has provider-specific quirks. Read `lazybridge/core/providers/anthropic.py` and `openai.py` before shipping a custom provider; they encode hard-won lessons.
- **Don't hard-code API keys.** The base class already accepts `api_key=None` and expects `_init_client` to fall back to env vars (the pattern every built-in follows). Mirror that convention so users can swap providers without changing how they manage secrets.
- **`request` is read-only.** Two callers may share the same request object across retries; mutating it inside `complete` silently corrupts the next attempt. Build the SDK-shaped payload in a local variable.
- **Don't block the event loop in `acomplete`/`astream`.** Use `await` for SDK calls or `loop.run_in_executor` for blocking ones. A blocking call in async path stalls every concurrent agent run sharing the loop.
- **`register_provider_rule` PREPENDS.** Your rule wins over earlier registrations, including the built-ins. If you need to append (rare — typically when you want a catch-all that runs after everything else), mutate `LLMEngine._PROVIDER_RULES` directly with `append(...)` — there's no public method for it.
- **Aliases without a registered subclass succeed silently** at registration time; they fail at `Executor` resolution when `Agent("mistral")` is actually constructed. Always register the alias *after* the subclass is importable in the resolution path.
- **`UnsupportedNativeToolError` subclasses `ValueError`.** Existing call sites catching `ValueError` still match it; add a more precise `except UnsupportedNativeToolError` only when you want to fail-over to a different provider rather than just surface the error.

## See also

- [Providers](https://core.lazybridge.com/guides/advanced/providers/index.md) — the built-in catalogue (Anthropic, OpenAI, Google, DeepSeek) with their tier-alias tables and per-provider quirks.
- [Engine protocol](https://core.lazybridge.com/guides/advanced/engine-protocol/index.md) — the layer above `BaseProvider` (custom decision-making mechanisms).
- [Native tools](https://core.lazybridge.com/guides/basic/native-tools/index.md) — what you declare in `supported_native_tools` to enable provider-hosted tools.
- [LLMEngine](https://core.lazybridge.com/guides/full/plan/index.md) — uses the provider you register; see its constructor for the full set of knobs that don't require a custom provider.

# Engine protocol

The single abstraction every engine satisfies. `LLMEngine`, `Plan`, `HumanEngine`, and `SupervisorEngine` all implement it. Implement it yourself when you want LazyBridge to drive a decision-making layer that none of the built-ins covers — a deterministic rule engine, a network of human approvers, a scripted dispatcher for tests.

## Signature

```python
from collections.abc import AsyncIterator
from typing import Any, Protocol, runtime_checkable

from lazybridge import Envelope


@runtime_checkable
class Engine(Protocol):
    """Contract every engine must satisfy."""

    async def run(
        self,
        env: Envelope,
        *,
        tools: list,                # list[Tool] — already normalised
        output_type: type,
        memory: Any | None,         # Memory | None
        session: Any | None,        # Session | None
        store: Any | None = None,   # Store | None — Plan checkpoint surface; other engines ignore
        plan_state: Any | None = None,  # ditto
    ) -> Envelope: ...

    async def stream(
        self,
        env: Envelope,
        *,
        tools: list,
        output_type: type,
        memory: Any | None,
        session: Any | None,
    ) -> AsyncIterator[str]: ...
```

The protocol is `@runtime_checkable`, so `isinstance(my_engine, Engine)` is a true assertion you can put in test code. `store` and `plan_state` are accepted by every engine but only `Plan` uses them; `LLMEngine` / `HumanEngine` / `SupervisorEngine` declare them as `accepted-and-ignored` so the calling shape is uniform.

## Synopsis

Every `Agent` delegates to its `engine`. The agent normalises the user-supplied `tools=[...]` (functions / Agents / `Tool` instances / `ToolProvider`s all collapse to `Tool`), then calls `engine.run(envelope, tools=..., output_type=..., memory=..., session=...)`. The engine is responsible for:

- Producing an `Envelope` from the input. Errors must be wrapped in `Envelope.error_envelope(exc)` rather than raised — propagating exceptions breaks resilience layers (`fallback=`, `verify=`).
- Optionally calling tools. The agent has already wrapped them; you invoke them with `tool.run(...)` (async) or `tool.run_sync(...)` (sync, drives async coroutines to completion).
- Optionally emitting `Session` events for observability. At minimum emit `AGENT_START` and `AGENT_FINISH` — without those, your engine is invisible in tracing and graph rendering.
- Implementing `stream`. If you don't have incremental output, yield the final text as a single chunk so `agent.stream(...)` callers don't break.

The agent stamps `engine._agent_name = self.name` before the first call so the engine can tag emitted events with the wrapping agent's name. Read it via `getattr(self, "_agent_name", "<engine_class>")`.

## When to use it

- **The built-in engines don't fit the shape** of decision-making you want — a deterministic rule engine driven by external state, a multi-human voting layer, a recorded-script dispatcher for replay testing.
- **You need an engine that reuses the rest of the framework's state primitives** — `Memory`, `Session`, `Store`, `tools=[...]` normalisation — without forking.
- **Test doubles.** A `MockEngine` that returns canned envelopes lets you exercise every other moving part of an agent (memory, guards, output validation, fallback) without provider calls.

## When NOT to use it

- **You just want to wrap an LLM call.** Subclass `BaseProvider` instead — that's the layer below an engine. `LLMEngine` is the framework's adapter from "any provider" to the agent.
- **You want a one-off non-LLM step inside a pipeline.** Drop a plain callable into `Step(target=callable, name=...)` — `Plan` dispatches callables directly, no custom engine required.
- **You want to inject behaviour into an existing engine.** Use `guard=` for input/output filtering, `verify=` for output judging, `fallback=` for failover; don't subclass `LLMEngine`.

## Example

```python
from collections.abc import AsyncIterator
from typing import Any

from lazybridge import Agent, Envelope
from lazybridge.engines.base import Engine
from lazybridge.session import EventType


class EchoEngine:
    """Trivial engine that returns the task prefixed with a tag."""

    async def run(
        self,
        env,
        *,
        tools,
        output_type,
        memory,
        session,
        store: Any | None = None,
        plan_state: Any | None = None,
    ):
        agent_name = getattr(self, "_agent_name", "echo")
        if session:
            session.emit(
                EventType.AGENT_START,
                {"agent_name": agent_name, "task": env.task},
            )
        result = Envelope(task=env.task, payload=f"echo:{env.task}")
        if session:
            session.emit(
                EventType.AGENT_FINISH,
                {"agent_name": agent_name, "payload": result.text()},
            )
        return result

    async def stream(
        self,
        env,
        *,
        tools,
        output_type,
        memory,
        session,
    ) -> AsyncIterator[str]:
        out = await self.run(
            env,
            tools=tools,
            output_type=output_type,
            memory=memory,
            session=session,
        )
        yield out.text()


# Runtime-check: EchoEngine satisfies the Engine Protocol.
assert isinstance(EchoEngine(), Engine)


# Plug into Agent — same surface as any built-in engine.
# Non-LLM engines require an explicit ``name=`` (T7 since 0.7.9).
agent = Agent(engine=EchoEngine(), name="echo")
result = agent("hello")
print(result.text())   # "echo:hello"
```

For a full reference implementation, read `lazybridge.ext.hil.supervisor` (~280 LOC). It covers event emission, memory integration, async-to-sync bridging via `asyncio.to_thread`, and the optional `ainput_fn` async prompt path.

## Pitfalls

- **Skipping `stream` entirely breaks `agent.stream(...)`.** Implement it to at least yield the final text once (the pattern in the example above). Most callers don't need true streaming; they just need the method to exist.
- **Not emitting session events makes your engine invisible.** No cost rollup, no graph node, no audit trail. At minimum emit `AGENT_START` and `AGENT_FINISH`. Add `TOOL_CALL` / `TOOL_RESULT` / `TOOL_ERROR` when you dispatch tools.
- **Raising instead of wrapping breaks resilience layers.** `Agent(fallback=...)`, `Agent(verify=...)`, and `Plan(checkpoint_key=...)` all expect the engine to return an error envelope, not raise. Wrap your `try/except` body and return `Envelope.error_envelope(exc)`.
- **The engine receives a normalised `list[Tool]`.** Do not assume the agent's internal `_tool_map` shape is available, do not re-wrap functions, do not call `_wrap_tool` yourself. Treat `tools` as a flat list of `Tool` instances ready to invoke.
- **`store` and `plan_state` kwargs.** Even if your engine doesn't use them, declare them in `run`'s signature with default `None` — `Plan.run` and `Agent.run` may pass them positionally-by-keyword through the engine boundary. The protocol declares them, so satisfying the protocol means accepting them.
- **`engine._agent_name` is set by the wrapping agent**, not by you. Don't override it from inside the engine; read it defensively (`getattr(self, "_agent_name", "<class>")`) so the engine still works when invoked outside an `Agent`.

## See also

- [BaseProvider](https://core.lazybridge.com/guides/advanced/base-provider/index.md) — the layer below an engine; for custom LLM backends rather than custom decision-making mechanisms.
- [Plan](https://core.lazybridge.com/guides/full/plan/index.md) — example of a non-LLM engine you can read for reference.
- [SupervisorEngine](https://core.lazybridge.com/guides/full/supervisor/index.md) — the most complete reference engine implementation.
- [Mental model](https://core.lazybridge.com/concepts/mental-model/index.md) — where the engine sits in the Agent = Engine + Tools + State decomposition.

# OpenTelemetry

`OTelExporter` emits LazyBridge `Session` events as OpenTelemetry spans conforming to the GenAI Semantic Conventions (`gen_ai.system`, `gen_ai.usage.input_tokens`, `gen_ai.tool.call.id`, …). Dashboards built for the standard (Datadog GenAI, Honeycomb GenAI, Grafana Tempo) render LazyBridge traces without translation.

This page is the deep dive: span hierarchy, attribute names, tracer provider lifecycle, and the in-memory exporter pattern for tests. For exporter basics see [Exporters](https://core.lazybridge.com/guides/full/exporters/index.md).

## Signature

```python
# Install — opt-in extra.
pip install "lazybridge[otel]"


from lazybridge import Session
from lazybridge.ext.otel import OTelExporter


OTelExporter(
    *,
    endpoint=None,                 # OTLP HTTP endpoint string
    exporter=None,                 # custom OTel exporter instance (overrides endpoint)
    batch=True,                    # True → BatchSpanProcessor; False → SimpleSpanProcessor
)


sess = Session(
    db="events.sqlite",
    batched=True,                  # Session-level back-pressure for the hot path
    exporters=[OTelExporter(endpoint="http://otelcol:4318")],
)
```

## Synopsis

The exporter takes raw `Session` events and translates each into the appropriate OTel span. It manages span lifecycles (open on `AGENT_START` / `MODEL_REQUEST` / `TOOL_CALL`, close on the matching finish event), attaches them to the OTel context so nested operations inherit the right parent, and detaches when they close.

### Span hierarchy

```text
invoke_agent  <agent_name>          (root for one Agent.run)
  ├─ chat       <model>             (one per LLM round-trip)
  └─ execute_tool <tool_name>       (one per tool invocation)
        └─ invoke_agent <inner>     (when the tool is itself an Agent)
```

Tool spans run as children of the agent span and close on `TOOL_RESULT` / `TOOL_ERROR` (correlated by `tool_use_id`). Cross-agent parenting works automatically: the inner agent's events are emitted on the same asyncio context as the outer tool span, so OTel's contextvars-based propagation makes the inner `invoke_agent` span a child of the outer `execute_tool` span without any explicit run-id chaining.

### GenAI Semantic Convention attributes

| Attribute                        | Source field                                   | Where it appears     |
| -------------------------------- | ---------------------------------------------- | -------------------- |
| `gen_ai.system`                  | provider name (`"anthropic"`, `"openai"`, …)   | `chat` spans         |
| `gen_ai.operation.name`          | `"chat"` / `"execute_tool"` / `"invoke_agent"` | every span           |
| `gen_ai.request.model`           | configured model                               | `chat` spans         |
| `gen_ai.response.model`          | actual model the provider replied with         | `chat` spans         |
| `gen_ai.usage.input_tokens`      | input tokens for this round-trip               | `chat` spans         |
| `gen_ai.usage.output_tokens`     | output tokens for this round-trip              | `chat` spans         |
| `gen_ai.response.finish_reasons` | normalised stop reason                         | `chat` spans         |
| `gen_ai.tool.name`               | tool name                                      | `execute_tool` spans |
| `gen_ai.tool.call.id`            | `tool_use_id` for correlation                  | `execute_tool` spans |
| `gen_ai.agent.name`              | wrapping agent's name                          | `invoke_agent` spans |

### LazyBridge-specific attributes

These have no GenAI equivalent (yet). They're prefixed `lazybridge.*` so an operator can filter on them deterministically without mistaking them for a future GenAI rename.

| Attribute              | Meaning                                                     |
| ---------------------- | ----------------------------------------------------------- |
| `lazybridge.run_id`    | UUID identifying the agent run (one per `Agent.run`)        |
| `lazybridge.cost_usd`  | Cost in USD reported by the provider for this round-trip    |
| `lazybridge.turn`      | Loop iteration index inside `LLMEngine`'s tool-calling loop |
| `lazybridge.branch_id` | Parallel-branch step name when emitted from a band          |

## When to use it

- **Distributed tracing** — you already run an OTel collector (Datadog, Honeycomb, Tempo, Jaeger) and want LazyBridge traces to land in the same dashboards.
- **Multi-service correlation** — cross-agent calls (Agent A as a tool inside Agent B) span pretty across worker thread / asyncio boundaries, since OTel contextvars propagate the active span automatically.
- **Cost / token attribution** — the `gen_ai.usage.*` and `lazybridge.cost_usd` attributes let you slice spend by model / agent / tool in the same dashboard you use for latency and errors.
- **Compliance / audit trails** — every tool call shows up as its own span with correlation id and structured arguments.

## When NOT to use it

- **You don't have an OTel pipeline.** Use [`JsonFileExporter`](https://core.lazybridge.com/guides/full/exporters/index.md) and load the resulting JSONL into pandas / your preferred tool. Standing up a collector just for one app's traces is overkill.
- **You want to query history programmatically.** `session.events.query(...)` reads the SQLite-backed `EventLog` directly — no collector required. OTel is for *push* streams to external systems.
- **Single-process scripts.** A console exporter (`Session(console=True)` or `Agent(verbose=True)`) is enough; OTel adds infrastructure weight you don't need.

## Example

```python
from lazybridge import Agent, LLMEngine, Session
from lazybridge.ext.otel import OTelExporter


# 1) Production: OTLP HTTP endpoint, batched span processor (default).
sess = Session(
    db="events.sqlite",
    batched=True,                      # session-level back-pressure
    exporters=[OTelExporter(endpoint="http://otelcol:4318")],
)
agent = Agent(
    engine=LLMEngine("claude-haiku-4-5"),
    session=sess,
)
agent("hello")
sess.flush()                           # drain the batched writer before exit
sess.close()                           # also flushes the OTel batch processor


# 2) Tests: in-memory exporter so each span is captured synchronously.
from opentelemetry.sdk.trace.export.in_memory_span_exporter import (
    InMemorySpanExporter,
)


def test_agent_emits_spans():
    memory_exporter = InMemorySpanExporter()
    sess = Session(
        exporters=[
            OTelExporter(exporter=memory_exporter, batch=False),
        ],
    )

    agent = Agent(
        engine=LLMEngine("claude-haiku-4-5"),
        session=sess,
    )
    agent("test prompt")
    sess.close()

    spans = memory_exporter.get_finished_spans()
    assert any(s.name.startswith("invoke_agent") for s in spans)


# 3) Multiple exporters in one session — OTel + JSON file + console alerts.
from lazybridge import (
    CallbackExporter,
    ConsoleExporter,
    EventType,
    FilteredExporter,
    JsonFileExporter,
)


def alert_on_error(event: dict) -> None:
    print(f"ALERT: {event}")


sess = Session(
    db="events.sqlite",
    batched=True,
    exporters=[
        JsonFileExporter(path="run.jsonl"),
        OTelExporter(endpoint="http://otelcol:4318"),
        FilteredExporter(
            inner=CallbackExporter(fn=alert_on_error),
            event_types={EventType.TOOL_ERROR, EventType.AGENT_FINISH},
        ),
    ],
)
```

## Pitfalls

- **`batch=True` (default) buffers spans.** Spans from a fast- finishing run may not be flushed by the time the process exits. Always `sess.close()` (or use `Session` as a context manager) before exit so the BatchSpanProcessor drains. For finer control, call `OTelExporter.flush(timeout_millis=30_000)` directly — useful between runs in a long-lived process where you want intermediate spans to reach the collector without closing the exporter.
- **`batch=False` ↔ `SimpleSpanProcessor`** is the right choice for tests against an `InMemorySpanExporter` — every span is flushed synchronously on close. Don't use it in production: every span emit blocks the engine on the network round-trip.
- **Per-instance `TracerProvider`.** Each `OTelExporter` creates its own `TracerProvider`. The provider is also installed globally as a best-effort default (so unrelated OTel-aware code in the same process picks it up), but two `OTelExporter` instances don't share state — each manages its own in-flight span registry. For tests with multiple exporters, pass distinct `exporter=` arguments.
- **Stale spans on cancellation.** If a run is cancelled before `AGENT_FINISH` fires, the corresponding span stays open in the registry. Call `sess.close()` (which calls `OTelExporter.close()`) to force-flush any spans still open — useful when the cancel is graceful but the finally-block can't reach the finish emit.
- **Custom resource attributes** (service.name, deployment.env) aren't set by the exporter. Configure them on your own `TracerProvider` and pass it through `exporter=` if you want them on every span — `endpoint=` on the exporter constructor builds a default provider with no resource attributes.
- **Native-tool calls don't appear as `execute_tool` spans.** Those happen server-side at the provider; LazyBridge sees them as part of the model's response, so they roll into the parent `chat` span rather than getting their own. If you want fine- grained native-tool tracing, query the provider's own dashboard.
- **`session.events.query(...)` reads stale data when batched.** The SQLite write is batched separately from the OTel emit; call `sess.flush()` before reading the local event log.

## See also

- [Exporters](https://core.lazybridge.com/guides/full/exporters/index.md) — the broader exporter surface including the four core sinks (Console, JsonFile, StructuredLog, Callback) and `FilteredExporter`.
- [Session](https://core.lazybridge.com/guides/mid/session/index.md) — the bus that fans events into exporters; covers `batched=`, `on_full=`, `redact=`, and the `EventLog` query surface.
- [GraphSchema](https://core.lazybridge.com/guides/full/graph-schema/index.md) — agent topology view; complements OTel (graph = static structure, OTel spans = live trace).
- [OpenTelemetry GenAI Semantic Conventions](https://opentelemetry.io/docs/specs/semconv/gen-ai/) — the upstream spec the exporter conforms to.

# Plan serialization

Round-trip a `Plan`'s topology through JSON. `Plan.to_dict()` produces a JSON-friendly description of every step, sentinel, route, and parallel flag; `Plan.from_dict(data, registry=...)` rebuilds the plan, rebinding callables and Agents through a name-keyed registry.

This is **descriptor-only** serialisation: targets that aren't string tool names (callables, Agents, predicates) cross the JSON boundary as names, and the loader rebinds them. Use it to ship a pipeline definition between processes, version-control a topology, or diff-render a plan without instantiating it.

## Signature

```python
plan.to_dict() -> dict
Plan.from_dict(data: dict, *, registry: dict[str, Any] | None = None) -> Plan


# Persisted shape (version 1)
{
    "version": 1,
    "max_iterations": int,
    "steps": [
        {
            "name": str,
            "target": {"kind": "tool" | "agent" | "callable" | "unknown", "name": str},
            "task":     {"kind": "from_prev" | "from_start" | "from_step" | "from_parallel"
                                | "from_parallel_all" | "literal", ...} | None,
            "context":  <task ref> | [<task ref>, ...] | None,    # single OR list, preserves shape
            "parallel": bool,
            "writes":   str,              # omitted when None
            "routes":   [str, ...],       # target step names; predicates rebound via registry
            "routes_by": str,             # omitted when None
            "after_branches": str,        # omitted when None
        },
        ...
    ],
}
```

# Public utility for catching dangling references after load.

from lazybridge.engines.plan.\_serialisation import validate_plan_refs

validate_plan_refs(steps: list[dict]) -> list[str]

## Synopsis

`to_dict()` walks the live plan and produces a JSON-compatible dict that captures **topology only**:

- **Step targets** serialise by `kind` + `name`. A string target (`Step("research")`) round-trips as `{"kind": "tool", "name": "research"}` and gets resolved by the outer agent's `tools=[...]` map at run time — no registry entry required. A callable serialises as `{"kind": "callable", "name": fn.__name__}`. An `Agent` target serialises as `{"kind": "agent", "name": agent.name}`.
- **Sentinels** serialise by `kind` + `name`. `from_prev` and `from_start` carry no name; `from_step("…")` / `from_parallel("…")` / `from_parallel_all("…")` carry the referenced step name. A string `task=` serialises as `{"kind": "literal", "value": "…"}`.
- **`context=` shape is preserved.** A single sentinel/string serialises to one ref dict; a list of sentinels/strings serialises to a list of ref dicts. The runtime treats these identically; the round-trip preserves the on-disk shape so a diff is meaningful.
- **Routes** serialise as a sorted list of target step names. The predicates themselves cannot be JSON-encoded — the loader rebinds them via the registry under the key `f"routes:{step_name}:{target_name}"`.
- **`routes_by` / `after_branches` / `writes` / `parallel`** are preserved verbatim (omitted from the dict when their default applies, so the JSON stays small).
- **`max_iterations`** and the schema `version` are at the top level.

Note that the serialiser does **not** capture `store=` / `checkpoint_key=` / `resume=` / `on_concurrent=` from the live `Plan` constructor — those are runtime concerns set when the plan is instantiated, not topology.

## When to use it

- **Cross-process pipeline transport.** Serialise on a build server, deploy the JSON, deserialise on the runtime. Both sides must know the same names; the loader binds them to live objects.
- **Version control for topology.** Commit `plan.json` alongside the code that builds it. Diffs show step additions / removals / re-orderings clearly.
- **Render to other formats.** Pass `plan.to_dict()["steps"]` through your own renderer to produce Mermaid diagrams, GraphViz output, or in-house pipeline visualisations without instantiating a Plan.
- **External plan editors.** A web UI that lets users build pipelines visually serialises to the same shape — the runtime loads the JSON and rebinds tools / callables / predicates.

## When NOT to use it

- **Persisting plan execution state across runs.** That's [Checkpoint & resume](https://core.lazybridge.com/guides/full/checkpoint/index.md) — `Plan(store=..., checkpoint_key=..., resume=True)` writes runtime state to a `Store` after every step, separate from the topology.
- **Sharing a runnable Agent.** A serialised plan is just the DAG; it doesn't know about provider keys, sessions, or wrappers. The runtime side has to construct the live `Agent(engine=Plan, ...)`.
- **Cross-version migration.** The `version: 1` field signals the shape; future versions will bump it and `from_dict` will refuse older shapes. Migrate explicitly when bumping rather than assuming round-trip compatibility.

## Example

```python
import json

from pydantic import BaseModel

from lazybridge import Agent, LLMEngine, Plan, Step, from_step, when


class Hits(BaseModel):
    items: list[str]


def fetch(task: str) -> str:
    """Look up hits for ``task``."""
    return "..."


def rank(task: str) -> str:
    """Rank the supplied hits."""
    return "..."


def has_no_results(env) -> bool:
    return not env.payload.items


# 1) Build the plan in Python.
plan = Plan(
    Step(fetch, name="fetch", writes="hits", output=Hits,
         routes={"apology": when.field("items").empty()}),
    Step(rank,  name="rank",  task=from_step("fetch"), writes="ranked"),
    Step("write", name="write", task=from_step("rank")),
    Step("apology", name="apology"),                  # terminal early-out
)


# 2) Serialise the topology to JSON (lossless for shape).
saved = plan.to_dict()
with open("plan.json", "w") as f:
    json.dump(saved, f, indent=2)


# 3) Load on the other side. Rebind callables, predicates, and
#    Agent targets through the registry.  Tool-name targets ("write",
#    "apology") survive without a registry entry — they're resolved
#    by the outer agent's tools=[...] at run time.
with open("plan.json") as f:
    loaded = json.load(f)


writer = Agent(engine=LLMEngine("gemini-3-flash-preview"), name="write")
apologiser = Agent(engine=LLMEngine("gemini-3-flash-preview"), name="apology")


plan_reloaded = Plan.from_dict(
    loaded,
    registry={
        "fetch":  fetch,
        "rank":   rank,
        # Predicate rebinds — key shape: f"routes:{step}:{target}"
        "routes:fetch:apology": when.field("items").empty(),
    },
)


# Tool-name targets attach via the wrapping Agent.
agent = Agent(
    engine=plan_reloaded,
    tools=[writer, apologiser],
)
agent("AI trends April 2026")


# 4) Validate dangling sentinel references after loading
#    (useful when the JSON came from an external source).
from lazybridge.engines.plan._serialisation import validate_plan_refs

errors = validate_plan_refs(loaded["steps"])
assert errors == [], errors
```

## Pitfalls

- **The registry is a positional contract.** Every non-tool target (callable, Agent) must be present in the registry, and predicates must be present under `f"routes:{step_name}:{target_name}"`. Missing entries raise `KeyError` with the offending name — by design, the load fails loud rather than producing a silently broken plan.
- **Tool-name targets survive without a registry entry.** They're resolved at run time from the outer Agent's `tools=[...]`. If the loader tries to populate a registry entry for a `"tool"` target, the entry is ignored.
- **Step-name security.** `_validate_step_name(name)` rejects any name that doesn't match `^[\w][\w\-]*$` (alphanumerics, `_`, `-`). This guards against tampered checkpoint payloads with path-separator characters or shell metacharacters; it also means literal step names with dots, slashes, or spaces fail to load.
- **Predicates serialise as target names only.** The actual callable lives in Python and must be rebound. If you forget the registry entry the load raises `KeyError` with the missing `routes:<step>:<target>` key.
- **`parallel=True` is preserved**; `Step.input` / `Step.output` type annotations are **not**. The on-disk shape captures topology, not type metadata. The runtime re-derives types from the `Step.input` / `Step.output` defaults — pass them through the registry if you need typed structured output on a rebuilt plan.
- **Schema versioning.** `from_dict` accepts only `version: 1` today. Breaking changes will bump the version and old shapes will be rejected; migrate by re-serialising from the live Plan at upgrade time rather than assuming the format is stable across versions.
- **Live state isn't captured.** `store=` / `checkpoint_key=` / `resume=` / `on_concurrent=` are constructor-time arguments; they don't appear in `to_dict()`. Pass them again when you call `Plan.from_dict(...)` (or, more typically, when you wrap the plan in `Agent(engine=..., store=..., ...)`).

## See also

- [Plan](https://core.lazybridge.com/guides/full/plan/index.md) — the engine whose topology is serialised.
- [Checkpoint & resume](https://core.lazybridge.com/guides/full/checkpoint/index.md) — separate concept: runtime *state* persistence (writes-bucket, completed steps, status) across crashes, not topology.
- [GraphSchema](https://core.lazybridge.com/guides/full/graph-schema/index.md) — the topology view auto- populated by `Session`; complements `Plan.to_dict` (they capture different facets — Plan = runnable topology, GraphSchema = live agent registration with provider / model metadata).

# Providers

The catalogue of LLM providers shipped with LazyBridge, the tier aliases each one resolves, and the per-provider quirks (thinking modes, native tools, deprecation timelines). For writing a brand-new provider see [BaseProvider](https://core.lazybridge.com/guides/advanced/base-provider/index.md).

> **Pricing and model lineup snapshot from late 2025.** LLM provider economics shift fast — treat the tables below as a structural reference (which alias resolves to which model, which features work on which model) rather than as live pricing.

## Signature

```python
from lazybridge import Agent, LLMEngine

# Direct model selection — provider inferred from the model string.
Agent(engine=LLMEngine("claude-opus-4-8"))
Agent(engine=LLMEngine("gpt-5.4-mini"))

# Tier-based selection — model never appears in app code.
Agent.from_provider("anthropic", tier="top")     # → claude-opus-4-8
Agent.from_provider("openai",    tier="medium")  # → gpt-5.4-mini
Agent.from_provider("google",    tier="cheap")   # → gemini-3.1-flash-lite-preview
```

`Agent.from_provider` is sugar for `Agent(engine=LLMEngine(<resolved-model>, provider=<name>))`. See [Canonical vs sugar](https://core.lazybridge.com/concepts/canonical-vs-sugar/index.md) for the breakdown.

### Tier names

| Tier          | Intent                                                                                |
| ------------- | ------------------------------------------------------------------------------------- |
| `super_cheap` | Smallest / cheapest model in the lineup; for parsing, classification, throwaway calls |
| `cheap`       | Default budget tier                                                                   |
| `medium`      | The default for `Agent.from_provider(...)`                                            |
| `expensive`   | Premium reasoning / long-context tier                                                 |
| `top`         | The flagship model                                                                    |

Each provider's `_TIER_ALIASES` table maps these strings to a concrete model name. A string not in the table is treated as a literal model name (passthrough).

## Built-in providers

### Anthropic

| tier          | model               | ctx   | max_out | $/M in | $/M out |
| ------------- | ------------------- | ----- | ------- | ------ | ------- |
| `top`         | `claude-opus-4-8`   | 1 M   | 128 K   | $5.00  | $25.00  |
| `expensive`   | `claude-opus-4-7`   | 1 M   | 128 K   | $5.00  | $25.00  |
| `medium`      | `claude-sonnet-4-6` | 1 M   | 64 K    | $3.00  | $15.00  |
| `cheap`       | `claude-haiku-4-5`  | 200 K | 64 K    | $1.00  | $5.00   |
| `super_cheap` | `claude-3-haiku`    | 200 K | 4 K     | $0.25  | $1.25   |

- **Thinking.** `opus-4-8` / `opus-4-7` / `opus-4-6` / `sonnet-4-6` use adaptive thinking (no `budget_tokens` argument). `haiku-4-5` and earlier 3.x models require `ThinkingConfig(budget_tokens=N)`. `opus-4-8` and `opus-4-7` do **not** accept `temperature`.
- **Native tools.** `WEB_SEARCH`, `CODE_EXECUTION`, `COMPUTER_USE`.

### OpenAI

| tier          | model          | ctx   | max_out | $/M in | $/M cached | $/M out |
| ------------- | -------------- | ----- | ------- | ------ | ---------- | ------- |
| `top`         | `gpt-5.5-pro`  | 1 M   | 128 K   | $30.00 | —          | $180.00 |
| `expensive`   | `gpt-5.5`      | 1 M   | 128 K   | $5.00  | $0.50      | $30.00  |
| `medium`      | `gpt-5.4-mini` | 400 K | 128 K   | $0.75  | $0.075     | $4.50   |
| `cheap`       | `gpt-5.4-nano` | 400 K | 128 K   | $0.20  | $0.02      | $1.25   |
| `super_cheap` | `gpt-4o-mini`  | 128 K | 16 K    | $0.15  | —          | $0.60   |

Other supported models (passed verbatim, no tier alias): `gpt-5.4-pro` ($30 / $180), `gpt-5.4` ($2.50 / $0.25 cache / $15), `gpt-5` ($1.25 / $10), `gpt-4o` ($2.50 / $10), `gpt-4.1` ($2 / $8), `gpt-4.1-mini` ($0.40 / $1.60), `o3` ($2 / $8), `o4-mini` ($1.10 / $4.40).

- **Thinking.** `gpt-5.5` / `gpt-5.5-pro` accept `reasoning_effort ∈ {none, low, medium, high, xhigh}` (default `medium`). The `o`-series and `gpt-5.4-pro` accept `reasoning_effort ∈ {low, medium, high}`. Standard GPT models don't support thinking.
- **Native tools.** `WEB_SEARCH`, `CODE_EXECUTION`, `FILE_SEARCH`, `COMPUTER_USE`, `IMAGE_GENERATION`.
- **Cache.** Automatic via `prompt_tokens_details.cached_tokens`; `cached_input` rate applied when published (`gpt-5.5`, `gpt-5.4`, `gpt-5.4-mini`, `gpt-5.4-nano`).
- **Long-context surcharge** (>272K input on `gpt-5.x`) is **not** modeled in cost rollup — the reported cost may under-count for large prompts.

### Google

| tier          | model                           | ctx | max_out | $/M in | $/M out |
| ------------- | ------------------------------- | --- | ------- | ------ | ------- |
| `top`         | `gemini-3.1-pro-preview`        | 1 M | 64 K    | $2.00  | $12.00  |
| `expensive`   | `gemini-2.5-pro`                | 1 M | 64 K    | $1.25  | $10.00  |
| `medium`      | `gemini-3-flash-preview`        | 1 M | 64 K    | $0.50  | $3.00   |
| `cheap`       | `gemini-3.1-flash-lite-preview` | 1 M | 64 K    | $0.25  | $1.50   |
| `super_cheap` | `gemini-2.5-flash-lite`         | 1 M | 64 K    | $0.10  | $0.40   |

- **Thinking.** `gemini-3.x` accepts `ThinkingConfig(thinking_level=...)` with `low` / `medium` / `high`. `gemini-2.x` accepts `ThinkingConfig(thinking_budget=N)`; `-1` selects auto-budget.
- **Native tools.** `GOOGLE_SEARCH`, `WEB_SEARCH`, `GOOGLE_MAPS`.
- **Warning.** Google Search + structured output produces a provider 400 — they're mutually exclusive.
- **Deprecation.** `gemini-2.0-flash` retires June 1 2026; do not use in new code.

### DeepSeek

| tier                               | model               | ctx | max_out | $/M in | $/M cached | $/M out |
| ---------------------------------- | ------------------- | --- | ------- | ------ | ---------- | ------- |
| `top` / `expensive`                | `deepseek-v4-pro`   | 1 M | 384 K   | $0.435 | $0.003625  | $0.87   |
| `medium` / `cheap` / `super_cheap` | `deepseek-v4-flash` | 1 M | 384 K   | $0.14  | $0.0028    | $0.28   |

- **Thinking.** Both V4 models accept `ThinkingConfig` → `reasoning_content` field on the response. In thinking mode the provider strips `temperature` / `top_p` / `presence_penalty` / `frequency_penalty`. `ThinkingConfig` on non-V4 models raises `ValueError`.
- **Cache.** Automatic on repeated prefixes ≥1024 tokens; no opt-in required.
- **Native tools.** None (function calling is supported).
- **Deprecation (retire 2026-07-24).** `deepseek-reasoner` and `deepseek-chat` both alias to `deepseek-v4-flash`.

### LMStudio

A local OpenAI-compatible runtime. `LMStudioProvider` extends `OpenAIProvider`; point `OPENAI_BASE_URL` at your LM Studio instance and use any model name your local install serves.

### LiteLLM

The unified bridge for the long tail (Mistral, Cohere, Groq, Bedrock, Vertex, Ollama, etc.). Use the `litellm/` model-string prefix to route through `LiteLLMProvider`. Native providers (Anthropic, OpenAI, Google, DeepSeek) still handle their own models directly — LiteLLM is the catch-all for the rest.

```python
Agent(engine=LLMEngine("litellm/groq/llama-3.3-70b"))
```

## `tool_choice` values

LLMEngine accepts a `tool_choice=` kwarg that drives provider tool selection:

| Value           | Meaning                                                                                                              |
| --------------- | -------------------------------------------------------------------------------------------------------------------- |
| `"auto"`        | Model decides (default)                                                                                              |
| `"none"`        | No tool calls allowed                                                                                                |
| `"required"`    | Must call at least one tool                                                                                          |
| `"any"`         | Alias for `"required"`; mapped to provider equivalent (`"required"` for OpenAI, `{"type":"required"}` for Anthropic) |
| `"<tool_name>"` | Must call the named tool                                                                                             |

After the first tool-call turn, `tool_choice` resets to `"auto"` automatically — so a forced first invocation doesn't lock the rest of the loop.

DeepSeek does **not** support `tool_choice` in thinking mode.

## Google `finish_reason` mapping

The Google provider normalises `finish_reason` strings so callers don't have to switch on Gemini-specific values:

| Gemini value                                                          | Normalised     |
| --------------------------------------------------------------------- | -------------- |
| `MAX_TOKENS`                                                          | `"max_tokens"` |
| `SAFETY` / `RECITATION` / `BLOCKLIST` / `PROHIBITED_CONTENT` / `SPII` | `"stop"`       |
| anything else                                                         | `"end_turn"`   |

## Pitfalls

- **DeepSeek tier collapse.** Three of the five tier aliases (`medium` / `cheap` / `super_cheap`) all map to `deepseek-v4-flash` — there's no smaller model in the lineup.
- **`gpt-5.5-mini` / `gpt-5.5-nano` don't exist** yet; the `medium` and `cheap` tiers stay on `gpt-5.4-mini` / `gpt-5.4-nano` until OpenAI ships them.
- **`gpt-5-mini` doesn't exist either.** The current OpenAI `mini` variant is `gpt-5.4-mini`.
- **`gemini-2.0-flash` deprecation** lands June 1 2026; switch to `gemini-2.5-flash-lite` before then.
- **Adaptive thinking ignores `budget_tokens`.** Anthropic `claude-opus` / `claude-sonnet` 4.6+ pick their own thinking budget; passing `ThinkingConfig(budget_tokens=...)` is no-effect.
- **`tool_choice="any"` is not passed literally.** It maps to `"required"` (or the provider equivalent) at request time.
- **Pricing changes faster than these tables.** Check the provider's current rate card before reasoning about cost in production.

## See also

- [BaseProvider](https://core.lazybridge.com/guides/advanced/base-provider/index.md) — write your own provider when none of the built-ins fits.
- [Native tools](https://core.lazybridge.com/guides/basic/native-tools/index.md) — what each provider exposes server-side; the per-provider table above lists the supported `NativeTool` enum values.
- [Canonical vs sugar](https://core.lazybridge.com/concepts/canonical-vs-sugar/index.md) — `Agent.from_provider("…", tier="top")` is one of the few factory methods that's not pure sugar (it builds the engine with the tier alias and an explicit `provider=`).

# Visualizer

A local-only web UI that shows what's happening inside a LazyBridge `Session` in real time. Pulses travel along graph edges as agents call tools, the inspector lets you read every event payload, and the store viewer highlights writes as they happen. The same UI replays a finished session from the SQLite event log with speed control and step-by-step navigation.

> **Status: alpha.** Bound to `127.0.0.1` with an ephemeral port — not designed for remote access. Backend is stdlib-only (`http.server` + Server-Sent Events + a token in the URL); the frontend loads D3.js v7 from a CDN, so there is no build step.

## Signature

```python
from lazybridge import Session
from lazybridge.ext.viz import Visualizer


# Live mode — instrument an active Session.
Visualizer(
    session,                       # required: the Session to observe
    *,
    store=None,                    # optional Store to render in the side panel
    host="127.0.0.1",              # bind address (do NOT change unless you understand the risk)
    port=0,                        # 0 = ephemeral; pick a fixed port to share the URL
    auto_open=True,                # webbrowser.open(...) on start()
)


# Replay mode — reconstruct from a finished SQLite event log.
Visualizer.replay(
    db,                            # SQLite file path
    *,
    session_id=None,               # specific session id; None = first one in the file
    speed=1.0,                     # playback multiplier (0.1 .. 100.0)
    host="127.0.0.1",
    port=0,
    auto_open=True,
) -> Visualizer


# Server lifecycle.
viz.start()                        # start HTTP server (and replay controller in replay mode)
viz.stop()                         # stop server, detach exporter, stop replay
viz.url                            # URL the server is bound to
viz.open()                         # block until Ctrl+C — useful for replay scripts


# Context-manager pattern (recommended for live mode).
with Visualizer(session) as viz:
    # ...run pipeline; browser is already open...
    ...
```

## Synopsis

`Visualizer` has two modes built from the same UI:

- **Live mode** (`Visualizer(session)`) installs a `HubExporter` on the session, serves the live `GraphSchema`, optional `Store`, and a Server-Sent Events stream of every emitted event. The browser receives events as they happen and animates pulses along graph edges in real time.
- **Replay mode** (`Visualizer.replay(db=...)`) opens a finished SQLite event log, reconstructs a graph from the event stream (`reconstruct_graph(events)`), and exposes pause / play / step / speed controls so you can walk through a recorded run at your own pace.

In both modes the UI runs in a normal browser tab and talks to a stdlib HTTP server bound to `127.0.0.1` with an ephemeral port. The URL includes a token so a stray port-scanner doesn't get to replay your private events; that's the only authentication.

`Visualizer` is a context manager — `__enter__` calls `start()` and `__exit__` calls `stop()`. Recommended pattern: wrap your pipeline run in a `with Visualizer(session):` block so the server shuts down cleanly when the run finishes (the browser tab stays open; close it yourself).

## When to use it

- **Debugging a multi-agent pipeline.** Live mode shows which agent fired which tool, in which order, with timing. Easier to read than a JSONL event dump.
- **Demos and walkthroughs.** Replay mode plus a recorded session is the cleanest way to talk through what a pipeline does without re-running it (and without spending tokens on every retake).
- **Post-mortem analysis.** A failed production run logged to SQLite replays in the same UI; jump directly to the `TOOL_ERROR` event and inspect the offending payload.
- **Teaching.** New users grasp the agent-as-tool composition model faster from the live graph than from documentation.

## When NOT to use it

- **Production observability.** Use `OTelExporter` and a real observability backend (Datadog, Honeycomb, Tempo). The Visualizer is for local inspection, not aggregated dashboards.
- **Headless / CI environments.** `auto_open=True` calls `webbrowser.open(...)`; pass `auto_open=False` if you're running in a container or behind SSH. (You can still hit `viz.url` from a tunnel, but local-only binding means you'll need an SSH port-forward.)
- **Sensitive sessions on shared hosts.** Even with the URL token, the server binds to `127.0.0.1` and serves whatever the `Session` emits. Don't run it on a multi-tenant host where other users could discover the port.
- **Long-lived processes that don't end cleanly.** The server thread runs until `stop()` is called. If your process daemonises or has a non-trivial shutdown path, wire `stop()` into your signal handler.

## Example

```python
from lazybridge import Agent, LLMEngine, Session, Store
from lazybridge.ext.viz import Visualizer


# 1) Live mode — wrap a pipeline run.
sess = Session(db="demo.db")
researcher = Agent(
    engine=LLMEngine("deepseek-v4-flash"),
    name="research",
    session=sess,
)
writer = Agent(
    engine=LLMEngine("deepseek-v4-flash"),
    name="write",
    session=sess,
)
orchestrator = Agent(
    engine=LLMEngine("deepseek-v4-flash"),
    tools=[researcher, writer],
    session=sess,
)

with Visualizer(sess) as viz:
    print(f"viz at {viz.url}")
    orchestrator("AI trends April 2026")


# 2) Replay mode — walk through a finished run at half speed.
Visualizer.replay(
    db="demo.db",
    speed=0.5,
).open()                           # blocks until Ctrl+C


# 3) Headless / CI — capture the URL without opening a browser.
with Visualizer(sess, auto_open=False) as viz:
    print(f"forward this port to view: {viz.url}")
    orchestrator("...")


# 4) Render a Store alongside the graph.
shared_store = Store(db="run.sqlite")
sess = Session(db="demo.db")

with Visualizer(sess, store=shared_store):
    # The right-hand panel updates as keys land in `shared_store`.
    pipeline_with_writes("...")
```

### Replay controls

The replay UI exposes four control actions over a side channel:

| Action  | Effect                                                        |
| ------- | ------------------------------------------------------------- |
| `play`  | Resume playback (after `pause`).                              |
| `pause` | Halt playback at the current event.                           |
| `step`  | Advance one event and pause.                                  |
| `speed` | Set playback speed; numeric value, must be in `[0.1, 100.0]`. |

The browser UI surfaces these as buttons; under the hood they POST to `/control` with a JSON body. The handler returns `{"ok": true, "idx": <event_index>, "total": <total_events>}` so the frontend can update its progress bar.

## Pitfalls

- **`auto_open=True` calls `webbrowser.open(...)` synchronously on `start()`.** On systems without a browser configured (CI, containers, headless servers), this is a silent no-op rather than an error — but you still need to read `viz.url` and open it yourself. Pass `auto_open=False` for headless runs.
- **`port=0` produces an ephemeral port** that changes every run. Pin a port (`port=8765`) only when you need a stable URL — e.g. to share with a teammate over a tunnel. Two visualizers on the same fixed port collide.
- **Live mode uses the live `GraphSchema`** of the session, so an agent without `session=sess` doesn't appear in the topology even if it runs. Pass the session to every agent you want visible.
- **Replay reconstructs a minimal graph** from event payloads — the result is structurally similar to but not identical with the live `GraphSchema`. If you need pixel-perfect topology, serialise the graph yourself (`sess.graph.to_yaml()`) and load it alongside the events.
- **Speed bounds.** `speed` is clamped to `[0.1, 100.0]` server- side; values outside that range produce a JSON error response. Real-time playback is `1.0`; `0.5` is half-speed; `10.0` is ten times faster.
- **The browser stays open after `stop()`.** The HTTP server shuts down, but the user's browser tab keeps the now-stale UI. This is by design — closing the user's tab from the server side would be hostile. Add a banner or close instruction in your demo script if it matters.
- **Custom UIs.** The exporter (`HubExporter`) and event hub (`EventHub`) live in `lazybridge.ext.viz.exporter`; if you want to drive a different frontend, instantiate them yourself and consume events from the hub's queue rather than going through `Visualizer`.

## See also

- [Session](https://core.lazybridge.com/guides/mid/session/index.md) — the source of events the Visualizer renders; covers `db=`, `batched=`, exporters.
- [GraphSchema](https://core.lazybridge.com/guides/full/graph-schema/index.md) — the topology view the Visualizer reads in live mode.
- [Exporters](https://core.lazybridge.com/guides/full/exporters/index.md) — the broader exporter surface; Visualizer's `HubExporter` is one of many possible sinks.
- [OpenTelemetry](https://core.lazybridge.com/guides/advanced/otel/index.md) — the production-observability counterpart; complements (rather than replaces) the local Visualizer.

# Agent

The single class you build, configure, run, and compose. Every other primitive in LazyBridge — engines, tools, plans, sessions, memories, guards — is something you wire **into** an `Agent`.

## Signature

```python
from lazybridge import Agent

agent = Agent(
    engine=...,                    # required: LLMEngine / Plan / HumanEngine / SupervisorEngine / custom
    tools=[...],                   # callables, Tools, Agents, ToolProviders
    output=str,                    # str (default) or a Pydantic model class
    memory=None,                   # Memory instance for conversation continuity
    store=None,                    # Store instance for cross-run / cross-agent state persistence
    session=None,                  # Session for event tracking + observability
    name=None,                     # surface name (used as a tool name when this Agent is composed)
    description=None,              # human-readable description (LLM-facing when used as a tool)
    verbose=False,                 # print turn-by-turn updates to stdout
    model=None,                    # convenience: tier alias or model ID when engine= is a provider string
    sources=(),                    # static documents prepended to every turn
    guard=None,                    # Guard / GuardChain — input/output filtering
    verify=None,                   # Agent or callable — judge-and-retry loop
    max_verify=3,                  # retries when verify=...
    native_tools=None,             # list[NativeTool | str] — provider-hosted tools
    allow_dangerous_native_tools=False,  # security gate: opt-in for CODE_EXECUTION / COMPUTER_USE
    output_validator=None,         # callable validator over the payload
    max_output_retries=2,          # retries on output validation failure
    timeout=None,                  # total deadline for the run (seconds)
    max_retries=3,                 # provider transient-error retries
    retry_delay=1.0,               # base delay between retries (exponential backoff)
    fallback=None,                 # secondary Agent invoked on primary failure
    cache=False,                   # bool or CacheConfig — prompt caching
)

# Calling
result = agent(task)               # sync, returns Envelope (canonical)
result = await agent.run(task)     # async equivalent
async for chunk in agent.stream(task): ...   # streaming form
```

Use `await agent.run(...)` inside async runtimes

The sync form `agent(task)` is safe in scripts, notebooks, and plain Python REPLs. When called from inside a *running* event loop — FastAPI / Starlette / aiohttp request handlers, async workers, Jupyter cells that are already inside an `await` — it detects the loop and dispatches the underlying coroutine onto a worker thread, then blocks the caller until that thread finishes. The bridge keeps notebook ergonomics intact, but inside an async server it stalls the calling task and burns a thread for every request. Prefer `await agent.run(task)` directly in async code.

For factory and composition shortcuts (`Agent.from_provider`, `Agent.chain`, `Agent.parallel`), see [Canonical vs sugar](https://core.lazybridge.com/concepts/canonical-vs-sugar/index.md).

## Synopsis

An `Agent` is the composition `Engine + Tools + State`:

- The **engine** decides what happens next. `LLMEngine` is the most common — an LLM that picks tools and arguments dynamically. Swap it for `Plan` to get deterministic orchestration, `HumanEngine` to gate at a human approval, or `SupervisorEngine` for a REPL.
- **Tools** are everything the agent can invoke. Plain Python functions, other agents, `Plan`-backed pipelines, MCP servers, provider-native capabilities — they all live in `tools=[...]`.
- **State** is what persists across or alongside the run. `Memory` carries conversation history; `Session` records events; the result `Envelope` carries the typed payload plus token / cost / latency metadata.

Calling `agent(task)` runs the engine to completion and returns an `Envelope`. The same `Agent` shape supports a one-shot helper, a hierarchical multi-agent system, and a checkpointed production pipeline — only the `engine=` argument changes.

## When to use it

- **Any single LLM interaction** — one-shot call, tool use, or structured output. `Agent` is the only class you need for Basic-tier work; everything else is opt-in via keyword args.
- **Building blocks for composition.** Two agents passed in `tools=[...]` of a third agent forms the supervisor pattern. A list of agents passed to `Plan` becomes a deterministic pipeline. The same `Agent` is the unit at every level.
- **Tier upgrades.** Add `output=` for structured output, `memory=` for conversation continuity, `session=` for observability, `verify=` for high-stakes outputs, `cache=True` for prompt caching — without changing the run-loop you've already written.

## When NOT to use it

- **Pure deterministic logic.** Don't wrap arithmetic, file parsing, or HTTP calls in an `Agent` — they go in `tools=[...]` or remain plain functions. The agent's job is to decide *when* to call them.
- **Streaming-only callsites where you can drop to `LLMEngine` directly.** If you genuinely don't need tools, memory, sessions, guards, or any other agent-level feature, `LLMEngine(...).stream(...)` works — but in practice this is rare; almost every use grows into needing at least one of those features.
- **Cases where you want a graph DSL.** LazyBridge expresses composition in plain Python (`Agent`, `Plan`, `Step`). If you need a separate graph definition language, you're on the wrong framework.

## Example

```python
from lazybridge import Agent, LLMEngine, Session
from pydantic import BaseModel


# 1) Minimal agent.
agent = Agent(
    engine=LLMEngine("claude-haiku-4-5"),
)
result = agent("hello")
print(result.text())


# 2) Tools — auto-schema from type hints + docstring.
def search(query: str) -> str:
    """Search the web for ``query`` and return the top three hits."""
    return "..."

researcher = Agent(
    engine=LLMEngine("claude-haiku-4-5"),
    tools=[search],
    name="research",
)
print(researcher("AI news April 2026").text())


# 3) Structured output — read .payload, not .text().
class Summary(BaseModel):
    title: str
    bullets: list[str]

summariser = Agent(
    engine=LLMEngine("claude-haiku-4-5"),
    output=Summary,
)
result = summariser("Summarise LazyBridge in three bullets.")
print(result.payload.title)
print(result.payload.bullets)


# 4) Tool-is-Tool composition (Agents wrap Agents).
editor = Agent(
    engine=LLMEngine("claude-haiku-4-5"),
    tools=[researcher],            # researcher.name="research" becomes the tool name
    name="editor",
)
print(editor("find papers on bees and write a one-paragraph summary").text())


# 5) output_validator — application invariants on top of Pydantic.
class DateRange(BaseModel):
    start_date: str
    end_date: str


def chronological(payload: DateRange) -> DateRange:
    """Re-prompt up to max_output_retries times if start > end."""
    if payload.start_date > payload.end_date:
        raise ValueError(
            f"start_date ({payload.start_date}) must precede end_date ({payload.end_date})"
        )
    return payload


extractor = Agent(
    engine=LLMEngine("claude-haiku-4-5"),
    output=DateRange,
    output_validator=chronological,
    max_output_retries=2,
)


# 6) Streaming — same Agent, drop down to .stream() for partial output.
import asyncio
from lazybridge import Agent, LLMEngine

async def stream_brief() -> None:
    agent = Agent(engine=LLMEngine("claude-haiku-4-5"))
    async for chunk in agent.stream("Outline LazyBridge in five bullets."):
        print(chunk, end="", flush=True)

asyncio.run(stream_brief())


# 7) Cache — explicit TTL on Anthropic.
from lazybridge import CacheConfig

cached = Agent(
    engine=LLMEngine("claude-haiku-4-5"),
    cache=CacheConfig(ttl="1h"),       # "5m" (default) or "1h" on Anthropic
)


# 8) Production-shape: timeout + cache + provider fallback + tracing.
fallback_agent = Agent(
    engine=LLMEngine("gpt-5"),
    tools=[search],
    name="fallback",
)
prod = Agent(
    engine=LLMEngine("claude-haiku-4-5"),
    tools=[search],
    timeout=30.0,
    cache=True,
    fallback=fallback_agent,
    session=Session(db="events.sqlite"),
)
prod("draft a one-pager on the LazyBridge audit findings")
```

## Pitfalls

- **`output=SomeModel` + `.text()`** — calling `.text()` on a structured envelope returns the JSON dump of the payload, which is rarely what you want. Read `.payload` instead.

- **`verify=` semantics** — the judge must return a verdict starting with `"approved"` (case-insensitive) to accept. Anything else is treated as rejection plus feedback for the next attempt. Bound the loop with `max_verify=...`.

- **`guard=` blocks the engine.** A blocked input or output produces an error `Envelope` without invoking the engine — `result.ok` is `False`, `result.error.type` is `"GuardError"`. Don't expect the agent to "see" the rejected text and self-correct; guards are hard gates.

- **`timeout=None`** (default) leaves the run unbounded. Tool calls inside a runaway agent can block forever. Pick a deadline that matches your SLO.

- **`fallback=` runs the fallback's full pipeline** — its tools, memory, and guards — on the same envelope, with the primary's error threaded into `context`. Configure compatible `output=` and `tools=` on both agents, or the fallback may fail differently.

- **`output_validator=`** is a callable applied to the payload *after* Pydantic validation passes (or directly when `output=str`). Receives the payload, returns the validated payload (may transform). Raise to reject — the framework re-prompts up to `max_output_retries` times with the validator's error message threaded back into the prompt. Useful for application-level invariants that aren't expressible in the Pydantic schema (e.g. "the `start_date` field must come before `end_date`").

- **`cache=True`** enables prompt caching where the provider supports it (Anthropic explicit, OpenAI / DeepSeek auto). Pass `CacheConfig(ttl="1h")` for the longer Anthropic TTL.

- **Nested agents inherit the caller's `session=`** when they have none of their own. This is what gives you transitive cost rollup and a single graph view of the whole tree — pass an explicit `session=None` on a sub-agent only when you genuinely want it invisible.

- **Fleet config via dict spread** — the 0.7-era `runtime` / `resilience` / `observability` configs were deleted in 0.7.9 (they carried a `flat kwarg > config object > default` precedence game with a private `_UNSET` sentinel *value* — distinct from the Plan `sentinels` module (`from_step` / `from_prev` / …) and an LLM trap). Share kwargs across a fleet via a Python dict::

  PROD_DEFAULTS = dict(timeout=60, max_retries=5, cache=True, session=sess) Agent(\*\*PROD_DEFAULTS, engine=LLMEngine("model"), name="agent-X")

## See also

- [Tool](https://core.lazybridge.com/guides/basic/tool/index.md) — how plain Python functions become tools the agent can call.
- [Envelope](https://core.lazybridge.com/guides/basic/envelope/index.md) — the typed result every agent returns.
- [Native tools](https://core.lazybridge.com/guides/basic/native-tools/index.md) — provider-hosted alternatives via `native_tools=[...]`.
- [Mental model](https://core.lazybridge.com/concepts/mental-model/index.md) — the Engine + Tools + State decomposition.
- [Canonical vs sugar](https://core.lazybridge.com/concepts/canonical-vs-sugar/index.md) — every factory and shortcut, with its canonical equivalent.

# Envelope

The single typed object that flows between every engine and agent. You never construct one manually — every `agent(task)` call returns one, and every step in a `Plan` reads one and produces another.

## Signature

```python
from typing import Generic, TypeVar
from pydantic import BaseModel

T = TypeVar("T")


class Envelope(BaseModel, Generic[T]):
    task: str | None = None        # the input task / prompt
    context: str | None = None     # additional context (e.g. previous step output)
    images: list | None = None     # multimodal: list[ImageContent]
    audio: object | None = None    # multimodal: a single AudioContent clip
    payload: T | None = None       # the typed result (str by default; Pydantic with output=)
    metadata: EnvelopeMetadata     # token / cost / latency / provider info
    error: ErrorInfo | None = None # populated when the run failed

    @property
    def ok(self) -> bool: ...      # True iff error is None

    def text(self) -> str: ...     # payload as a string (str verbatim, BaseModel as JSON)

    @classmethod
    def from_task(cls, task, context=None) -> Envelope: ...

    @classmethod
    def error_envelope(cls, exc, *, retryable=False) -> Envelope: ...


class EnvelopeMetadata(BaseModel):
    input_tokens: int = 0
    output_tokens: int = 0
    cost_usd: float = 0.0
    latency_ms: float = 0.0
    model: str | None = None
    provider: str | None = None
    run_id: str | None = None
    # Aggregated from sub-agent calls (agent-as-tool / Plan steps).
    nested_input_tokens: int = 0
    nested_output_tokens: int = 0
    nested_cost_usd: float = 0.0


class ErrorInfo(BaseModel):
    type: str                      # exception class name
    message: str                   # human-readable message
    retryable: bool = False        # whether the resilience layer may retry
```

## Synopsis

`Envelope` is the universal request / response object. It carries:

- **The result** — a string by default, a typed Pydantic instance when the agent was constructed with `output=SomeModel`.
- **Metadata** — token counts, cost in USD, latency in milliseconds, the model and provider that produced it, and a `run_id`. Also `nested_*` aggregation buckets that fill up when the agent called nested agents as tools, so the top-level envelope reflects total pipeline cost without any extra plumbing.
- **An error, if anything went wrong** — `error.type` is the exception class name, `error.message` is the message, and `error.retryable` tells the resilience layer whether a retry might succeed.

Generic typing (`Envelope[Article]`) narrows the payload type for mypy / pyright without changing runtime behaviour. Untyped `Envelope` is `Envelope[Any]` and stays the zero-friction default.

## When you'll see one

- **Every `agent(task)` call** returns one. That's the canonical point of contact.
- **Every step in a `Plan`** receives one (the previous step's envelope) and produces one. Sentinels like `from_prev`, `from_step("name")`, `from_parallel_all("name")` resolve to fields of those envelopes at run time.
- **Every `Agent` wrapped as a tool** also returns one — its metadata is folded into the parent's `nested_*` buckets so cost rollup is transitive.

## When NOT to construct one

- **Almost never directly.** The framework builds envelopes for you on every entry and exit. Manual construction is reserved for two cases:
  - Test fixtures (`Envelope.from_task("test prompt")` creates a ready-to-feed input).
  - Custom engines that need to surface an error path (`Envelope.error_envelope(exc)` is the canonical builder).
- **Don't mutate one in flight.** Envelopes are Pydantic models; if you need to derive one with a changed field, use `env.model_copy(update={"context": new_context})`.

## Example

```python
from lazybridge import Agent, LLMEngine
from pydantic import BaseModel


class Article(BaseModel):
    title: str
    body: str


# Constructing an Agent with structured output narrows Envelope.payload.
writer = Agent(
    engine=LLMEngine("gemini-3-flash-preview"),
    output=Article,
)

result = writer("write a one-paragraph article on bees")

# 1) Always check .ok before reading .payload in production code.
if result.ok:
    print(result.payload.title)
    print(result.payload.body)
else:
    print(f"failed ({result.error.type}): {result.error.message}")
    if result.error.retryable:
        print("(this error is retryable — the resilience layer may try again)")


# 2) Observability without a Session — metadata is always populated.
m = result.metadata
print(f"cost=${m.cost_usd:.4f}  in={m.input_tokens}  out={m.output_tokens}")
print(f"model={m.model}  provider={m.provider}  latency={m.latency_ms:.0f} ms")


# 3) text() — string regardless of payload shape (str verbatim, BaseModel as JSON).
print(result.text())   # JSON dump of the Article


# 4) Static typing — the checker knows env.payload is an Article.
def first_word(env: "Envelope[Article]") -> str:
    return env.payload.title.split()[0]
```

## Pitfalls

- **`output=SomeModel` + `.text()`** returns the JSON dump of the payload, not the human-readable text. With structured output, read `.payload` directly.
- **`Envelope.from_task(task)` sets `payload=task` for convenience** so the very first agent in a chain sees the input as both `task` and `payload`. Downstream steps see the *preceding step's* `payload`, not the original task — use `from_start` if you need the original input later.
- **`nested_*` metadata is plumbed but not always populated.** For authoritative cross-agent cost numbers in a multi-agent pipeline, query `session.usage_summary()` rather than `envelope.metadata.nested_cost_usd`. The envelope's nested buckets reflect what flowed through *this* envelope's lineage, not the entire run.
- **`error.retryable=False` does not mean "give up forever"** — it means "the resilience layer should not auto-retry this one". A caller's `fallback=` agent is still tried, and you can always re-run the agent yourself.
- **Multimodal attachments only ride on step 0** of a `Plan`. Downstream steps receive upstream output (text), not the original `images=` / `audio=` payload. Pass attachments to the first step.
- **`__str__` falls through to `text()`** — when an `Agent` is used as a tool, the LLM's tool-result block stringifies the envelope via `str(...)`; without this, every nested call would produce `"task=… context=…"` garbage instead of the real answer. Don't override `__str__` on subclasses.

## See also

- [Agent](https://core.lazybridge.com/guides/basic/agent/index.md) — the producer of envelopes.
- [Tool](https://core.lazybridge.com/guides/basic/tool/index.md) — `returns_envelope=True` is the hint that lets engines roll up nested cost / token metadata correctly.
- [Mental model](https://core.lazybridge.com/concepts/mental-model/index.md) — `Envelope` is the only piece of "state" that's always present, regardless of whether you opt into `Memory`, `Session`, or `Store`.

# Native tools

Provider-hosted capabilities you turn on by passing an enum, not by writing code. Web search, code execution, file search, and a few others run server-side at the LLM provider — the model decides when to invoke them just like any other tool, but your application doesn't host or implement them.

## Signature

```python
from lazybridge import Agent, LLMEngine, NativeTool, Tool

agent = Agent(
    engine=LLMEngine("claude-opus-4-7"),
    native_tools=[NativeTool.WEB_SEARCH, NativeTool.CODE_EXECUTION],
    tools=[Tool.wrap(my_function, name="my_function")],   # native + custom coexist
)
```

`NativeTool` is a `StrEnum` — string aliases also work: `native_tools=["web_search", "code_execution"]`.

### Available values

| Value                         | Provider(s)               | What it does                                 |
| ----------------------------- | ------------------------- | -------------------------------------------- |
| `NativeTool.WEB_SEARCH`       | Anthropic, OpenAI, Google | General web search                           |
| `NativeTool.CODE_EXECUTION`   | Anthropic, OpenAI         | Sandboxed Python / JavaScript execution      |
| `NativeTool.FILE_SEARCH`      | OpenAI                    | Search across uploaded files                 |
| `NativeTool.COMPUTER_USE`     | Anthropic                 | Screen control (beta)                        |
| `NativeTool.IMAGE_GENERATION` | OpenAI Responses API      | Inline image generation (gpt-image-2 family) |
| `NativeTool.GOOGLE_SEARCH`    | Google                    | Gemini grounded search                       |
| `NativeTool.GOOGLE_MAPS`      | Google                    | Gemini Maps grounding                        |

The `native_tools=` argument is a shortcut on `Agent` equivalent to `Agent(engine=LLMEngine(..., native_tools=[...]))` — the engine is where the values are actually consumed; `Agent` forwards them through.

## Synopsis

Native tools complete the "everything is a tool" picture from the provider side. They behave identically to your own `tools=[...]` from the agent's perspective — the model emits a tool call, the framework routes it, and the result returns to the loop. The difference is who runs the implementation: with a regular `Tool`, your code runs; with a `NativeTool`, the provider's infrastructure runs.

You can mix freely. The model may use a native tool in one turn and your custom function in the next, or both in parallel within a single turn (engines emit parallel tool calls automatically).

## When to use native tools

- **The provider already hosts the capability.** Web search, code execution, file search — implementing these yourself is more work than passing an enum.
- **Your data doesn't need to leave the provider's environment.** Native tools execute server-side at the provider; if that's acceptable, opt in.
- **You want grounded answers** with citations the provider returns natively (web search, Google grounding). Provider-side grounding often produces better source attribution than rolling your own.
- **You want minimal supply chain.** No additional dependencies, no auth secrets to manage, no infrastructure to keep up.

## When NOT to use native tools

- **You need full control of the implementation** — custom auth headers, rate-limit handling, response post-processing, mocking for tests. Write a regular `Tool` instead.
- **You want provider portability.** Native tools tie the agent to a provider that supports the capability. A custom `Tool` runs the same against any provider.
- **The provider doesn't support the tool.** `NativeTool.GOOGLE_SEARCH` on Anthropic raises at the provider layer (see Pitfalls).
- **You need offline / air-gapped operation.** Native tools call out to provider infrastructure; if the agent must work without that, don't use them.

## Example

```python
from lazybridge import Agent, LLMEngine, NativeTool


# 1) Web search — one line of opt-in.
search_agent = Agent(
    engine=LLMEngine("claude-opus-4-7"),
    native_tools=[NativeTool.WEB_SEARCH],
)
result = search_agent("what happened in AI news in April 2026?")
print(result.text())


# 2) Native + custom tools coexist freely.
def read_report(path: str) -> str:
    """Read and return the contents of a local markdown file."""
    return open(path).read()

analyst = Agent(
    engine=LLMEngine("claude-opus-4-7"),
    native_tools=[NativeTool.WEB_SEARCH, NativeTool.CODE_EXECUTION],
    tools=[read_report],
    allow_dangerous_native_tools=True,   # required opt-in for CODE_EXECUTION
)
analyst("cross-reference report.md against current web consensus")


# 3) String aliases — same effect as the enum.
gpt_search = Agent(
    engine=LLMEngine("gpt-5.4-mini"),
    native_tools=["web_search"],
)
gpt_search("latest stable Python release?")


# 4) Provider-specific tools — match the model.
gemini_grounded = Agent(
    engine=LLMEngine("gemini-2.5-pro"),
    native_tools=[NativeTool.GOOGLE_SEARCH],
)
```

## Security gate: `allow_dangerous_native_tools`

`NativeTool.CODE_EXECUTION` and `NativeTool.COMPUTER_USE` give the provider broad access — sandboxed code execution at the provider's side, screen control on the user's machine. Both require explicit opt-in via `allow_dangerous_native_tools=True` on either the `Agent` or the `LLMEngine`:

```python
agent = Agent(
    engine=LLMEngine("claude-opus-4-7"),
    native_tools=[NativeTool.CODE_EXECUTION],
    allow_dangerous_native_tools=True,   # without this: ValueError at construction
)
```

The gate runs at **each construction site** that introduces native tools — `LLMEngine(native_tools=...)` validates against its own `allow_dangerous_native_tools=`, and `Agent(native_tools=...)` (whose list is merged into the engine) validates against the Agent's flag. Specify your native tools at the construction site that owns the flag; an already-configured `engine.native_tools` on a pre-built engine is **not** re-checked when you wrap it in `Agent(engine=...)`, so don't rely on `Agent(allow_dangerous_native_tools=False)` to "undo" a permissive engine. `WEB_SEARCH`, `FILE_SEARCH`, `IMAGE_GENERATION`, `GOOGLE_SEARCH`, `GOOGLE_MAPS` are NOT gated by this flag — only the two genuinely dangerous tools.

The default (`allow_dangerous_native_tools=False`) raises `ValueError` at construction with a message naming the offending native tool. Catch it explicitly if you want to fall back to non-dangerous alternatives.

## Pitfalls

- **Provider / tool mismatch fails at run time, not construction.** Mixing `NativeTool.GOOGLE_SEARCH` with an Anthropic model raises at the `complete` call — Agent construction is happy. Match the enum to the provider before you ship; cover this with one integration test per provider you support.
- **`COMPUTER_USE` requires extra setup** — the Anthropic API needs the right beta flag and additional permissions, and the model needs the resolution / tool definitions you're targeting. Read the provider's current docs before turning it on.
- **Billing.** Native tool calls are billed by the provider — search queries, code-execution time, image generation. Where the provider reports usage, the cost shows up in `Envelope.metadata.cost_usd` alongside the model's own tokens; where it doesn't, you may need to consult the provider dashboard for full attribution.
- **`UnsupportedNativeToolError`** is raised at provider time when the provider's native-tool support is incomplete or strict mode is on. Catch it explicitly when you want a graceful fallback to a custom `Tool`.
- **Native tools don't appear in your code path.** `tools=[search]` shows up in stack traces and event logs as `search`; native tools show up as `web_search` (or whatever alias the provider uses) and the implementation lives off-process. Don't expect to set a breakpoint on a native tool's execution.
- **Grounded responses expose sources via the raw provider payload.** When you need attribution, read `CompletionResponse.grounding_sources` (when the provider returns them) rather than parsing them out of the model's text.

## See also

- [Tool](https://core.lazybridge.com/guides/basic/tool/index.md) — write your own when a native tool is not a fit.
- [Agent](https://core.lazybridge.com/guides/basic/agent/index.md) — `native_tools=` is the kwarg that activates these; everything else still composes the same way.
- [Envelope](https://core.lazybridge.com/guides/basic/envelope/index.md) — provider-reported costs land in `metadata.cost_usd` together with model-call costs.
- *Guides → Full → Providers* (coming in Phase 3) — which native tools each provider supports and the strict-mode behaviour.

# Tool

A `Tool` is anything an `Agent` can call — a Python function, another agent, an MCP server, an external-tools kit. They all flow through the same `tools=[...]` list and **you almost never construct one yourself**: the framework normalises the list inside `Agent.__init__` and registers each entry under a unique name.

## Signature

```python
from lazybridge import Tool

# Canonical constructor (rarely needed — drop the function in directly).
Tool(
    func,                          # required: the callable
    *,
    name=None,                     # defaults to func.__name__
    description=None,              # defaults to the function's docstring
    mode="signature",              # "signature" | "llm" | "hybrid"
    schema_llm=None,               # engine for mode="llm" / "hybrid"
    # NB: the legacy mode="auto" graceful-fallback ladder was removed
    # in 0.7.9.  Pass an explicit mode; mode="auto" raises ValueError.
    strict=False,                  # provider-strict JSON schema validation
    returns_envelope=False,        # set automatically by agent.as_tool()
)

# Pre-built JSON Schema (MCP, OpenAPI, third-party registries).
Tool.from_schema(
    name,                          # required
    description,                   # required
    parameters,                    # JSON Schema dict
    func,                          # the callable to dispatch
    *,
    strict=False,
    returns_envelope=False,
)
```

For the public `Tool.wrap(...)` factory and `agent.as_tool(...)` method — both are sugar with non-trivial differences — see [Canonical vs sugar](https://core.lazybridge.com/concepts/canonical-vs-sugar/index.md).

## `tools=` is **not** `native_tools=`

Two parameters, two runtimes — don't conflate them:

| Parameter            | Who executes the tool                 | What goes in it                                                  |
| -------------------- | ------------------------------------- | ---------------------------------------------------------------- |
| `tools=[...]`        | **LazyBridge runtime** (this process) | Python callables, sub-`Agent`s, `MCP*` servers, `Tool` instances |
| `native_tools=[...]` | **The LLM provider's servers**        | `NativeTool` enum values (e.g. `NativeTool.WEB_SEARCH`)          |

`native_tools` is for server-side tools the provider implements itself — Anthropic web search, OpenAI image generation, Google grounding, etc. Dangerous server-side tools (`NativeTool.CODE_EXECUTION`, `NativeTool.COMPUTER_USE`) additionally require `allow_dangerous_native_tools=True` on the `Agent` — a deliberately noisy opt-in, since the executor is no longer in your process. See [Reference → Providers](https://core.lazybridge.com/reference/providers/index.md) for the per-provider native-tool support matrix.

The rest of this page documents `tools=` only.

## Synopsis

LazyBridge accepts six things in `tools=[...]` and normalises them all to `Tool` instances at construction time:

```python
from lazybridge import Agent, LLMEngine, Tool
from lazytools.connectors.mcp import MCP  # pip install lazytoolkit (import name: lazytools)
from lazytools.documents import read_docs_tools

agent = Agent(
    engine=LLMEngine("gpt-5.4-mini"),
    tools=[
        plain_function,                              # 1. plain Python function
        Tool.wrap(plain_function, name="custom", strict=True),  # 2. function + overrides via factory
        other_agent,                                 # 3. sub-agent (auto-wrapped)
        other_agent.as_tool(verify=judge),           # 4. sub-agent + judge/retry
        MCP.stdio("fs", command="npx",
                  args=["@modelcontextprotocol/server-filesystem", "."],
                  allow=["fs.read_*", "fs.list_*"]),  # 5. MCPServer (allow= required)
        *read_docs_tools(),                          # 6. lazytools docs kit (list[Tool])
    ],
)
```

The common case is **path 1**: drop the function in. Type hints + docstring drive the JSON schema. Reach for `Tool.wrap(fn, name=..., ...)` when you need to override the name / description / strictness / mode; reach for `Tool.from_schema(...)` when you already have a JSON schema (MCP, OpenAPI, third-party registry). The bare `Tool(...)` constructor is still public for advanced use cases (e.g. typing annotations, isinstance checks) but the `Tool.wrap()` factory is the canonical form for new code.

## Schema modes — `signature`, `hybrid`, `llm`

Every `Tool` carries a JSON Schema that the LLM uses to call it. The schema is built once on first use and cached for the lifetime of the process (an `ArtifactStore` interface lets you persist the cache across runs if you need to). The **mode** controls how that schema is generated.

| Mode                    | What's inspected                                            | Extra LLM call?     | Determinism                                              | Use when                                                                                            |
| ----------------------- | ----------------------------------------------------------- | ------------------- | -------------------------------------------------------- | --------------------------------------------------------------------------------------------------- |
| `"signature"` (default) | Type hints + docstring of `func`                            | no                  | full                                                     | The function already has type hints and a useful docstring                                          |
| `"hybrid"`              | Signature for types + an LLM for parameter descriptions     | one, on first build | high — types are fixed, only descriptions vary           | Types exist but docstrings are sparse / outdated / wrong                                            |
| `"llm"`                 | The function's source code (or stub), interpreted by an LLM | one, on first build | medium — both types and descriptions come from the model | Legacy code with no annotations, `**kwargs`-only signatures, third-party callables you can't modify |

Both `hybrid` and `llm` modes require `schema_llm=<engine>` — usually a cheap-tier LLM dedicated to schema work — and emit a one-shot warning if the model returns an empty or under-specified result.

### `signature` — the canonical default

```python
from lazybridge import Tool

def get_weather(city: str, units: str = "c") -> str:
    """Return current weather for ``city``.

    Args:
        city: City name (e.g. "Paris").
        units: "c" for Celsius (default) or "f" for Fahrenheit.
    """
    ...

tool = Tool.wrap(get_weather, name="get_weather")
```

The generated schema (abbreviated):

```json
{
  "type": "object",
  "properties": {
    "city":  {"type": "string", "description": "City name (e.g. \"Paris\")."},
    "units": {"type": "string", "description": "\"c\" for Celsius (default) or \"f\" for Fahrenheit.", "default": "c"}
  },
  "required": ["city"]
}
```

This is deterministic and free. Reach for the other modes only when this one can't produce a useful schema.

### `hybrid` — types from signature, descriptions from an LLM

The signature has the truth about parameter types and which are required. The LLM only fills in the **descriptions** — the parts the LLM sees when it's deciding whether and how to call the tool.

```python
from lazybridge import LLMEngine, Tool

schema_llm = LLMEngine("claude-haiku-4-5")  # cheap-tier; runs once per tool

def get_weather(city: str, units: str = "c") -> str:
    # Docstring is missing or unhelpful.
    ...

tool = Tool.wrap(
    get_weather,
    name="get_weather",
    mode="hybrid",
    schema_llm=schema_llm,
)
```

Resulting schema: same `{"type": "string"}` types as the signature path, but `"description"` fields now come from the LLM analysing the function source. Required-vs-optional is **still** decided by the signature (the `units: str = "c"` default keeps it optional).

When to prefer `hybrid` over `signature`:

- You inherited a function whose docstring lies about the parameters.
- The team is migrating to type-hinted code and isn't ready to re-document every helper.
- You want consistent LLM-facing descriptions across a large tool kit without hand-writing each one.

When **not** to use it:

- You're shipping a security-sensitive tool. An LLM-generated description could understate the risk (e.g. describe a shell-exec tool as "runs a command") and bias the model toward calling it. Keep `mode="signature"` and own the description.

### `llm` — full LLM-inferred schema

The signature has no useful information — `def legacy(**kwargs)`, an undocumented third-party callable, or a function defined inside a `lambda` you can't annotate. Hand the source (or a stub) to an LLM and let it produce the whole tool definition.

```python
from lazybridge import LLMEngine, Tool

schema_llm = LLMEngine("claude-haiku-4-5")

def legacy_lookup(*args, **kwargs):
    """Best-effort lookup against the v1 reporting API.  Accepts
    a record id and an optional output format.  Returns a JSON
    string."""
    ...

tool = Tool.wrap(
    legacy_lookup,
    name="legacy_lookup",
    mode="llm",
    schema_llm=schema_llm,
)
```

The framework keeps two invariants the LLM cannot break:

1. **Required parameters come from the signature first.** If the signature says a parameter has no default, it stays required even if the LLM forgot it. A `UserWarning` is logged if the LLM omits a signature-required parameter.
1. **Strict mode still applies.** Passing `strict=True` adds `"additionalProperties": false` to the generated schema regardless of what the LLM produced.

Calibrate your expectations: this mode is the **slowest** to bootstrap (one extra LLM round-trip on first use), the **least deterministic** across runs, and the only mode that can produce a schema the function can't actually accept (e.g. an extra parameter the LLM invented). Production use cases should treat it as a one-off migration tool — once it generates a schema that works, copy the result into `Tool.from_schema(...)` and pin it.

### Caching

All three modes cache the built schema per `Tool` instance for the lifetime of the process. For cross-process persistence — e.g. so a serverless function doesn't pay the LLM-bootstrap cost on every cold start — pass an `ArtifactStore` to the tool builder; the `InMemoryArtifactStore` is the in-process default, and the protocol is small enough to back with Redis / SQLite / S3 in two methods (`get` / `put`). See `lazybridge/core/tool_schema.py` for the protocol and the in-memory reference implementation.

### Pinning the result

Once an `llm`/`hybrid` mode produces a schema you're happy with, copy the generated JSON into a `Tool.from_schema(...)` call and remove the `schema_llm=` dependency. This is the canonical pattern for moving from "discover the schema" to "ship the schema":

```python
weather_tool = Tool.from_schema(
    name="get_weather",
    description="Return current weather for a city.",
    parameters={
        "type": "object",
        "properties": {
            "city": {"type": "string", "description": "City name."},
            "units": {"type": "string", "enum": ["c", "f"], "default": "c"},
        },
        "required": ["city"],
    },
    func=get_weather,
)
```

Now the tool is deterministic, free to load, and reviewable in a PR.

## When to construct a Tool explicitly

- **You need a different name than `func.__name__`.** Useful for shadowing a third-party function with a clearer LLM-facing name.
- **You need `strict=True`** for provider-strict JSON-schema validation (Anthropic / OpenAI strict mode).
- **You need `mode="llm"` or `"hybrid"`** because the function lacks type hints or annotations (legacy code, third-party callables, `**kwargs`-only signatures).
- **You're shipping a tool kit.** Library authors return `list[Tool]` from a factory so callers can splat it into `tools=[...]` (e.g. `read_docs_tools()`).

## When NOT to construct a Tool explicitly

- **For ordinary Python functions in your own code.** Just pass the function — `Agent(tools=[my_function])` is the canonical form.
- **For an `Agent` you want to use as a sub-agent.** Pass it directly: `Agent(tools=[other_agent])`. The agent's `name=` becomes the tool name. Use `agent.as_tool("alias")` only to rename or to attach `verify=` (see [Canonical vs sugar](https://core.lazybridge.com/concepts/canonical-vs-sugar/index.md)).
- **For an MCP server.** Pass the `MCPServer` directly — it's a `ToolProvider` and the framework expands it into per-server-tool `Tool` instances at construction.

## Example

```python
from lazybridge import Agent, LLMEngine, Tool
from pydantic import BaseModel


# 1) Plain function — type hints + docstring drive the schema.
def calculate(expression: str) -> float:
    """Evaluate a basic arithmetic expression and return the result.

    Supports +, -, *, /, parentheses.
    """
    return eval(expression)  # noqa: S307  (trusted inputs only)

calc_agent = Agent(
    engine=LLMEngine("gpt-5.4-mini"),
    tools=[calculate],
)
result = calc_agent("what is 17 * 23?")
print(result.text())


# 2) Function + explicit configuration — override the name and turn on strict.
calc_tool = Tool.wrap(
    calculate,
    name="calc",
    description="Evaluate an arithmetic expression and return the numeric result.",
    strict=True,
)
strict_agent = Agent(
    engine=LLMEngine("gpt-5.4-mini"),
    tools=[calc_tool],
)


# 3) Pydantic-typed parameters — coerced from the LLM's raw dict to a typed instance.
class SearchInput(BaseModel):
    query: str
    limit: int = 10

def search(input: SearchInput) -> list[str]:
    """Search the web and return the top ``input.limit`` URLs for ``input.query``."""
    return [f"https://example.com/{input.query}/{i}" for i in range(input.limit)]

researcher = Agent(
    engine=LLMEngine("gpt-5.4-mini"),
    tools=[search],
)


# 4) Pre-built schema — Tool.from_schema for MCP / OpenAPI bridges.
schema = {
    "type": "object",
    "properties": {
        "city": {"type": "string", "description": "City name"},
        "units": {"type": "string", "enum": ["c", "f"], "default": "c"},
    },
    "required": ["city"],
}

def weather_dispatch(**kwargs):
    """Forward the validated args to a downstream HTTP call."""
    return f"weather for {kwargs['city']}"

weather_tool = Tool.from_schema(
    name="weather",
    description="Get current weather for a city.",
    parameters=schema,
    func=weather_dispatch,
)
weather_agent = Agent(
    engine=LLMEngine("gpt-5.4-mini"),
    tools=[weather_tool],
)
```

## Pitfalls

- **No type hints → empty schema.** A function with bare parameters produces an empty JSON schema and the LLM will not know how to call it. Always annotate every parameter; defaults are fine.
- **Docstring is the contract.** "Returns the weather" is much weaker than "Returns the current temperature in Celsius and a one-word condition (sunny / cloudy / rainy) for `city`." The LLM reads the docstring; treat it as the spec.
- **`strict=True` rejects optional / defaulted args** under some providers. If a call fails with "unknown parameter", try `strict=False`.
- **Name collisions trigger a `UserWarning`.** The second registration replaces the first. Pick stable, distinct names — especially when mixing your tools with MCP-namespaced ones (`fs.read`, `fs.write`, …).
- **`Pydantic BaseModel` parameters are coerced** from the LLM's raw dict to a typed instance before your function is called. You always receive the model, not the dict — don't write defensive `dict(...)` conversions inside the function body.
- **`returns_envelope=True` is set for you** when the framework wraps an `Agent` as a tool via `agent.as_tool(...)`. Don't set it manually on a plain function — engines that respect the hint will try to read `result.metadata.cost_usd` and crash on a non-Envelope return value.

## See also

- [Agent](https://core.lazybridge.com/guides/basic/agent/index.md) — the surface that consumes tools.
- [Native tools](https://core.lazybridge.com/guides/basic/native-tools/index.md) — provider-hosted alternatives passed via `native_tools=[...]` instead of `tools=[...]`.
- [Envelope](https://core.lazybridge.com/guides/basic/envelope/index.md) — the result type when a tool is an `Agent` (`returns_envelope=True`).
- [Everything is a tool](https://core.lazybridge.com/concepts/everything-is-a-tool/index.md) — the composition rule that makes all six paths uniform.
- [Canonical vs sugar](https://core.lazybridge.com/concepts/canonical-vs-sugar/index.md) — `Tool.wrap(...)` factory, `agent.as_tool(...)`, and `Tool.from_schema(...)` with their canonical equivalents. Both `Tool.wrap(...)` and `Tool(...)` default to `mode="signature"`; `mode="hybrid"` / `mode="llm"` are the explicit opt-ins for LLM-driven schema generation.

# Checkpoint & resume

Durable state for long-running plans. After every step (success or fail) `Plan` writes its execution state to a `Store`; a re-run with `resume=True` picks up at the failed step rather than re-running the whole pipeline. Concurrent runs sharing a `checkpoint_key` are serialised by compare-and-swap.

## Signature

```python
from lazybridge import Agent, Plan, Step, Store

Plan(
    *steps,
    store,                         # required for checkpointing
    checkpoint_key,                # required — unique key per run identity
    resume=False,                  # True → pick up at the failed step
    on_concurrent="fail",          # "fail" | "fork"
    max_iterations=100,            # safety valve — caps routing loops;
                                   # the resumed run continues the counter
                                   # from the checkpoint, so a resumed
                                   # router can't silently exceed it.
)


# Persisted shape at store[checkpoint_key] (CHECKPOINT_VERSION = 2).
{
    "next_step": "step_name" | None,
    "kv": {"writes_key": payload, ...},
    "completed_steps": [...],
    "status": "claimed" | "running" | "paused" | "failed" | "done",
    "run_uid": "<hex>",            # CAS ownership stamp; identifies the writer
    "checkpoint_version": 2,
    "history": [...],              # serialised StepResult list (v2 only)
}


# Errors
ConcurrentPlanRunError             # CAS collision when on_concurrent="fail"
PlanCompileError                   # on_concurrent="fork" + resume=True (incompatible)
```

## Synopsis

`Plan` writes its state to `store[checkpoint_key]` after every step. The persisted object captures three things:

- `next_step` — the name of the step the plan would run next. On success, this advances; on failure, it stays pointing at the failing step (or, for parallel bands, at the **band's first** step).
- `kv` — every step's `writes="key"` payload. This is what survives across runs; in-memory step history is rebuilt empty on resume.
- `completed_steps` + `status` — bookkeeping for the resume logic to decide what to skip.

Five state transitions:

- **Claimed** — `status="claimed"`, transient. Written via CAS before any step runs so two concurrent fresh runs collide here rather than corrupting each other later. You'll only see this status if you inspect the store mid-run.
- **Running** — `status="running"`, `next_step=<the next step>`. Normal progression after a successful step.
- **Failed** — `status="failed"`, `next_step=<failing step>` (or the parallel band's first step). A subsequent `resume=True` run retries from there.
- **Paused** — `status="paused"`, `next_step=<paused step>` (or the parallel band's first step). Written when a step raises `PlanPaused` to signal a cooperative halt. A subsequent `resume=True` run re-invokes the paused step. Use this when a step has detected that an external precondition isn't met (webhook hasn't arrived, human approval pending) and the pipeline cannot proceed yet — distinct from `failed` which signals an actual error.
- **Done** — `status="done"`, `next_step=None`. A subsequent `resume=True` short-circuits and returns an envelope whose payload is the cached `kv`.

The persisted `history` field (v2 only) is a serialised `StepResult` list — a resumed run rebuilds in-memory step history from it, which is what makes `from_parallel_all` and the nested- cost rollup behave correctly across crash boundaries. v1 checkpoints (no `history` key) degrade gracefully: in-memory history starts empty and the parallel-band aggregator falls back to the start envelope (legacy pre-W1.3 behaviour, no crash).

`on_concurrent` controls what happens when two runs try to use the same `checkpoint_key` at once:

- `"fail"` (default) — single-writer semantics. The second run raises `ConcurrentPlanRunError`. Pair with `resume=True` for crash recovery.
- `"fork"` — each run claims an isolated keyspace `f"{checkpoint_key}:{run_uid}"`. **Incompatible with `resume=True`** (raises `PlanCompileError` at construction). Use for fan-out workflows where many runs share the same plan shape.

## When to use it

- **Long-running pipelines.** Anything where re-running every step on failure costs real time or money. Crash-resume turns a one-step crash into a one-step retry.
- **Pipelines with expensive early steps and cheap late steps.** The classic shape: `extract` (slow ETL) → `transform` (fast) → `load` (fast). A failure in `load` shouldn't re-run `extract`.
- **Fan-out workflows on a shared store.** `on_concurrent="fork"` lets you run many variants of the same plan against the same Store without collisions.
- **Debugging in production.** A failed run leaves the partial state on disk. You can inspect `store.read("…")` keys before the resume run to understand what the pipeline saw at the failure point.

## When NOT to use it

- **Short, cheap pipelines.** A 3-step pipeline of LLM calls under a few seconds doesn't benefit from the persistence overhead.
- **Pipelines with no `writes=`.** Resume reconstructs from `store["...key"]` writes — if no step writes anything, there's nothing to skip on resume. Add `writes=` to the steps whose outputs the rest of the pipeline depends on.
- **Side effects that aren't crash-safe to repeat.** A failing step is retried as-is on `resume=True`; an external HTTP POST with no idempotency key may run twice. Gate with idempotency keys, deduplication, or a marker write before the side effect.

## Example

```python
from pydantic import BaseModel

from lazybridge import Agent, LLMEngine, Plan, Step, Store


class ValidationReport(BaseModel):
    rejected_rows: list[int]
    accepted_rows: int


def extract_data() -> str:
    return "..."


def transform_records(raw: str) -> str:
    return "..."


def load_warehouse(clean: str) -> None:
    pass


extract = Agent(engine=LLMEngine("claude-sonnet-4-6"), name="extract")
transform = Agent(engine=LLMEngine("claude-sonnet-4-6"), name="transform")
validate = Agent(engine=LLMEngine("claude-sonnet-4-6"), name="validate", output=ValidationReport)
load = Agent(engine=LLMEngine("claude-sonnet-4-6"), name="load")


# 1) Crash-resume across runs.
store = Store(db="pipeline.sqlite")


def build_plan() -> Agent:
    return Agent(
        engine=Plan(
            Step("extract",   writes="raw"),
            Step("transform", writes="clean"),
            Step("validate",  writes="verdict"),
            Step("load",      writes="loaded"),
            store=store,
            checkpoint_key="etl-2026-04-30",
            resume=True,
        ),
        tools=[extract, transform, validate, load],
    )


# Run 1 — crashes during validate. status="failed", next_step="validate".
try:
    build_plan()("today's batch")
except KeyboardInterrupt:
    pass

# Run 2 — resumes at validate. extract + transform are NOT re-run.
build_plan()("today's batch")

# Run 3 — plan is "done"; short-circuits and returns cached kv.
result = build_plan()("today's batch")
print(result.payload)   # {"raw": ..., "clean": ..., "verdict": ..., "loaded": ...}


# 2) Concurrent fan-out runs with on_concurrent="fork".
from concurrent.futures import ThreadPoolExecutor


backtest = Agent(
    engine=Plan(
        Step(load_data,    name="load",  writes="prices"),
        Step(run_strategy, name="run",
             task="Execute the strategy; emit a trade log.",
             writes="trades"),
        Step(score_run,    name="score",
             task="Compute Sharpe and max-drawdown."),
        store=store,
        checkpoint_key="backtest",
        on_concurrent="fork",          # each run gets its own keyspace
    ),
    tools=[load_data, run_strategy, score_run],
)
with ThreadPoolExecutor(max_workers=8) as pool:
    list(pool.map(backtest, ["AAPL", "GOOG", "MSFT", "AMZN"]))


# 3) Inspecting partial state after a failure.
state = store.read("etl-2026-04-30")
print(state["status"])           # "failed"
print(state["next_step"])        # "validate"
print(state["completed_steps"])  # ["extract", "transform"]
print(state["kv"]["clean"])      # the partial result
```

## Pitfalls

- **Changing the Plan and resuming from an old checkpoint will fail.** The saved `next_step` may no longer exist. After refactoring steps, delete the checkpoint: `store.delete(checkpoint_key)`.
- **Non-JSON-serialisable `writes=` values are stringified.** The Store JSON-encodes via `json.dumps(default=str)`; a file handle becomes its `repr(...)`. Prefer primitives, dicts, and Pydantic models (`.model_dump()`-friendly).
- **Resume does not re-inject `session=` or exporters.** Pass the same `session=` + `store=` on every run — the Plan only persists what's behind the `Store` interface, not the live observability wiring.
- **A failed parallel band points the checkpoint at the band's *first* step.** The whole band re-runs cleanly so all sibling `writes` are produced consistently — resuming mid-band would leave earlier branches' Store keys stale relative to the re-run ones.
- **`on_concurrent="fork"` + `resume=True` is a configuration error.** Fork mode gives each run its own key, so there's no shared checkpoint to resume from. The framework raises `PlanCompileError` at construction.
- **`ConcurrentPlanRunError` is a runtime error, not a compile error.** Two processes opening the same SQLite file with the same `checkpoint_key` and `on_concurrent="fail"` collide via CAS; the loser raises. Catch it explicitly if you want graceful retry-after-backoff semantics.
- **Cached "done" runs still cost the storage round-trip.** A short-circuited run returns instantly but still hits the Store to read the cached kv. For very high read rates, layer your own in-process cache.
- **Sidecar consumers should reconcile against the checkpoint snapshot, not the raw Store.** Each step writes its checkpoint *before* the durable `store.write(step.writes, value)` call — this eliminates double-writes on resume (the checkpoint already records `next_step` past the completed step). The trade-off is that a crash in the gap between the two writes loses the durable Store value; the value still lives in the checkpoint's serialised `kv` and is read back on resume, so the Plan continues correctly. Anything reading the Store out-of-band (a dashboard, a sidecar process) should compare the keys against `state["kv"]` to detect this gap.

## See also

- [Plan](https://core.lazybridge.com/guides/full/plan/index.md) — the engine that writes checkpoints; covers `max_iterations`, `on_concurrent`, and the full DAG validation surface.
- [Store](https://core.lazybridge.com/guides/mid/store/index.md) — the durable layer behind checkpoints; the SQLite WAL mode is what makes concurrent reads / writes safe.
- [Step](https://core.lazybridge.com/guides/full/step/index.md) — `writes="key"` is what survives across runs; no `writes=` means no resume value.
- [Parallel plan steps](https://core.lazybridge.com/guides/full/parallel-plan-steps/index.md) — the band-level atomicity rule that drives the "next_step points to the band's first step on failure" behaviour.

# Composition patterns

Most introductions to LazyBridge compose agents **vertically** — `Agent.chain(a, b, c)` for a one-shot linear pipeline, or `Plan(Step("a"), Step("b"), Step("c"))` when you need named steps, typed hand-offs, sentinels, or routing. That's enough for many applications.

This page is the **horizontal** counterpart: how to compose pipelines side-by-side, and how to nest one pipeline inside another. The same `Agent = Engine + Tools + State` mental model applies — the trick is that `Plan` itself is an engine, so an agent whose engine is a `Plan` is a perfectly valid `Step.target` for an **outer** plan. Pipelines compose recursively.

## Vertical recap (one paragraph)

Vertical composition produces a single sequence of steps:

```python
from lazybridge import Agent, LLMEngine, Plan, Step

researcher = Agent(engine=LLMEngine("claude-sonnet-4-6"), name="research")
writer     = Agent(engine=LLMEngine("gpt-5.4-mini"),    name="write")
editor     = Agent(engine=LLMEngine("claude-sonnet-4-6"), name="edit")

pipeline = Agent(
    engine=Plan(Step("research"), Step("write"), Step("edit")),
    tools=[researcher, writer, editor],
    name="vertical_pipeline",
)
```

This is the shape covered by [Chain](https://core.lazybridge.com/guides/mid/chain/index.md) and [Plan](https://core.lazybridge.com/guides/full/plan/index.md). Everything below assumes you've read those.

## Horizontal: a Plan whose step is a Plan

The simplest horizontal composition: one `Step` in an outer `Plan` targets an **agent whose own engine is a `Plan`**. The outer plan sees a single tool call; under the hood, the sub-plan runs its own multi-step pipeline.

```python
from lazybridge import Agent, LLMEngine, Plan, Step, from_step

# --- Sub-pipeline: a self-contained "research" capability ---
search    = Agent(engine=LLMEngine("claude-haiku-4-5"), name="search")
summarise = Agent(engine=LLMEngine("claude-haiku-4-5"), name="summarise")

research_pipeline = Agent(
    engine=Plan(
        Step("search"),
        Step("summarise"),
    ),
    tools=[search, summarise],
    name="research",         # outer plan references this name
)

# --- Outer pipeline: research → write → edit ---
writer = Agent(engine=LLMEngine("gpt-5.4-mini"),    name="write")
editor = Agent(engine=LLMEngine("claude-sonnet-4-6"), name="edit")

pipeline = Agent(
    engine=Plan(
        Step("research"),                                           # nested plan
        Step("write", context=from_step("research")),
        Step("edit"),
    ),
    tools=[research_pipeline, writer, editor],
    name="article_pipeline",
)

result = pipeline("AI agent frameworks, April 2026")
print(result.text())
print(f"total cost: ${result.metadata.cost_usd}")
```

What's happening:

- `research_pipeline` is a regular `Agent`. Its `engine` happens to be a `Plan`, but from the outer plan's perspective it's just a named tool — `Step("research")` resolves to it via the `tools=[research_pipeline, ...]` map.
- Cost / token telemetry from the sub-plan rolls up into the outer envelope's `metadata.nested_*` fields. The single `result.metadata.cost_usd` you read at the end is the **whole tree**.
- Sentinels (`from_step("research")`) work transparently — the outer plan sees `research`'s final envelope, not its intermediate steps.

### When to nest a plan vs flatten the steps

| Shape                                                                                                | When                                                                                                                                                                                                                                                               |
| ---------------------------------------------------------------------------------------------------- | ------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------ |
| Flat `Plan(Step("search"), Step("summarise"), Step("write"))`                                        | The steps are co-evolving — one team owns them, the data flow is straightforward, and you don't want a separate test surface for "research".                                                                                                                       |
| Nested `Plan(Step("research"), Step("write"))` where `research` is its own `Agent(engine=Plan(...))` | You want **isolation**: the research sub-pipeline ships as a reusable unit, has its own tests, its own checkpoint key if needed, and can be swapped for a different implementation (e.g. a `SupervisorEngine` or a custom engine) without touching the outer plan. |

## Horizontal + parallel: parallel bands of sub-pipelines

`Step(..., parallel=True)` runs sibling steps concurrently. When each "step" is itself a sub-pipeline, you get **N independent pipelines running side by side**, with their outputs aggregated via `from_parallel_all(...)`.

```python
from lazybridge import Agent, LLMEngine, Plan, Step, from_parallel_all

# Three independent research pipelines, each a Plan of its own.
def make_research_pipeline(name: str, source_agent: Agent) -> Agent:
    summarise_agent = Agent(engine=LLMEngine("claude-haiku-4-5"), name="summarise")
    return Agent(
        engine=Plan(
            # Step.target is positional — pass the agent object as ``target=``
            # and override the in-plan name to "search" so the sub-pipeline
            # has stable step names regardless of which source agent it wraps.
            Step(target=source_agent, name="search"),
            # String target → tool-map lookup; ``summarise_agent`` is in
            # ``tools=[...]`` below under its own ``name="summarise"``.
            Step("summarise"),
        ),
        tools=[summarise_agent],
        name=name,
    )

web_research      = make_research_pipeline("web",      Agent(engine=LLMEngine("claude-sonnet-4-6"), name="web_search"))
academic_research = make_research_pipeline("academic", Agent(engine=LLMEngine("claude-sonnet-4-6"), name="academic_search"))
internal_research = make_research_pipeline("internal", Agent(engine=LLMEngine("claude-sonnet-4-6"), name="internal_search"))

synthesiser = Agent(engine=LLMEngine("claude-sonnet-4-6"), name="synthesise")

pipeline = Agent(
    engine=Plan(
        Step("web",      parallel=True),   # branch 1: full sub-pipeline
        Step("academic", parallel=True),   # branch 2: full sub-pipeline
        Step("internal", parallel=True),   # branch 3: full sub-pipeline
        Step("synthesise",
             context=from_parallel_all("web")),   # joins all 3 parallel siblings
    ),
    tools=[web_research, academic_research, internal_research, synthesiser],
    name="multi_source_brief",
)
```

What this gives you:

- **Three independent sub-pipelines** run concurrently via `asyncio.gather`. The framework caps concurrency at `Plan(max_parallel_steps=…)` (defaults to unbounded).
- **`from_parallel_all("web")`** resolves to the labelled-text join of every contiguous `parallel=True` sibling starting at `web` — so `synthesise` sees one input containing all three branches' outputs, each labelled with its branch name.
- **First-error short-circuit**: if any branch errors, the outer envelope carries that error and the remaining branches' results are dropped. Use a `verify=` judge or `fallback=` agent on individual branches if you need graceful degradation.

### When to use parallel bands vs `Agent.parallel(...)`

| Shape                                                                  | When                                                                                                                                                                      |
| ---------------------------------------------------------------------- | ------------------------------------------------------------------------------------------------------------------------------------------------------------------------- |
| `Agent.parallel(a, b, c)`                                              | You want a **single envelope** out (labelled-text join). No further plan structure needed. See [Parallel](https://core.lazybridge.com/guides/mid/parallel/index.md).      |
| `Step("a", parallel=True) … Step("d", context=from_parallel_all("a"))` | The parallel work is **one stage of a larger plan** — there's setup before, aggregation after, sentinels across, and you want crash-resume on the whole thing. This page. |

## Horizontal: agent-as-tool with LLM-decided dispatch

The previous two patterns are **deterministic** — the plan decides which sub-pipelines run. The third horizontal shape hands the decision to an LLM: pass sub-pipelines in the **outer agent's `tools=[...]`** and let the model pick.

```python
from lazybridge import Agent, LLMEngine

# Same three sub-pipelines as above (each is an Agent(engine=Plan(...))).
orchestrator = Agent(
    engine=LLMEngine(
        "claude-sonnet-4-6",
        system=(
            "You have three research sub-pipelines available.  Call only "
            "the ones that match the user's question; combine their results."
        ),
    ),
    tools=[web_research, academic_research, internal_research],
    name="adaptive_research",
)

orchestrator("Compare LangGraph and CrewAI for our use case.")
```

What's different:

- **The LLM chooses** which sub-pipelines to call, in what order, and whether to call multiple in parallel (the engine emits parallel tool calls automatically when the model requests them in the same turn).
- **No `Plan` at the outer layer** — the engine is `LLMEngine`, so there's no compile-time DAG validation, no checkpointing on the outer call, no sentinels. You trade auditability for adaptability.
- **Sub-pipeline internals stay deterministic.** Each sub-agent's own `Plan` still validates at construction, still produces predictable token cost, still respects its own `checkpoint_key=`.

### Choosing between the three horizontal shapes

```text
Do all sub-pipelines always need to run?
├── Yes, sequentially  → outer Plan with one Step per sub-pipeline
├── Yes, concurrently  → outer Plan with parallel=True bands + from_parallel_all
└── No — let the model decide → outer LLMEngine with sub-pipelines in tools=[...]
```

## Diagram — vertical vs horizontal

```text
VERTICAL (chain / linear Plan)
─────────────────────────────
  start → [search] → [summarise] → [write] → [edit] → end
  one process, one path, total latency = sum of steps


HORIZONTAL — Plan-of-Plans
──────────────────────────
  start → ╔════════════ research ════════════╗ → [write] → [edit] → end
          ║ [search] → [summarise]           ║
          ╚══════════════════════════════════╝
  sub-pipeline is one tool to the outer plan; sentinels traverse the boundary


HORIZONTAL — parallel bands of sub-pipelines
────────────────────────────────────────────
              ╔════════ web ════════╗
              ║ [search]→[summarise]║
              ╠══════ academic ═════╣
  start  →    ║ [search]→[summarise]║ →  [synthesise(from_parallel_all)]  → end
              ╠══════ internal ═════╣
              ║ [search]→[summarise]║
              ╚═════════════════════╝
  three sub-pipelines run concurrently; aggregator joins their outputs


HORIZONTAL — LLM-decided dispatch
─────────────────────────────────
                     ┌── web ────────┐
                     │ [Plan...]     │
  start → [LLM] ──── │ academic      │ ── (LLM may call 1-N, may parallel)
                     │ [Plan...]     │
                     │ internal      │
                     │ [Plan...]     │
                     └───────────────┘
  no outer Plan; engine emits tool calls based on the user prompt
```

## Cost and observability across nested boundaries

Every horizontal shape preserves the cost roll-up:

- Inner pipelines write their tokens + cost + latency to their own envelope's `metadata.*`.
- When the outer plan invokes the inner agent as a tool, the inner envelope's metadata is folded into the outer envelope's `metadata.nested_*` fields.
- `Session.usage_summary()` walks the whole tree and gives you one number per provider / model.

There is no "I called three sub-pipelines and have no idea what it cost" failure mode — the rollup is automatic.

## Pitfalls

- **Naming collisions are silent.** If two sub-agents share the same `name=`, the outer tool map registers a single entry (with a `UserWarning`) and the outer plan resolves both `Step("search")` references to the same target. Always name sub-agents distinctly, even when they live in separate sub-pipelines.
- **`Plan(max_iterations=N)` is per-plan, not transitive.** A nested `Plan` that loops forever is invisible to the outer plan's iteration counter. Set sensible caps on every level.
- **Checkpoint keys must be unique across the tree.** If both the outer plan and an inner plan write to the same `Store` with the same `checkpoint_key=`, their state collides. Namespace them (e.g. `"article/research"`, `"article/write"`).
- **Sentinels are per-plan.** `from_step("research")` in the outer plan resolves to the inner agent's **final** envelope — you cannot reach into `from_step("research.search")` to read the inner step's output. If the outer plan needs an inner step's value, surface it via the inner plan's `writes=` to a shared `Store` and read it with `from_agent("research.search")`.
- **First-error short-circuit at each level.** A failing branch in an inner parallel band aborts the inner plan, which aborts the outer step, which aborts the outer plan — unless one of the intermediate agents has a `fallback=` or the failing step is wrapped with `verify=`.

## See also

- [Chain](https://core.lazybridge.com/guides/mid/chain/index.md) — the vertical baseline.
- [Parallel](https://core.lazybridge.com/guides/mid/parallel/index.md) — single-level `Agent.parallel(...)` fan-out; the lighter sibling of parallel plan bands.
- [Plan](https://core.lazybridge.com/guides/full/plan/index.md) — the outer-pipeline surface this page composes.
- [Parallel plan steps](https://core.lazybridge.com/guides/full/parallel-plan-steps/index.md) — the `parallel=True` mechanism in depth.
- [Sentinels](https://core.lazybridge.com/guides/full/sentinels/index.md) — `from_step`, `from_parallel_all`, `from_agent` semantics across plan boundaries.
- [Checkpoint & resume](https://core.lazybridge.com/guides/full/checkpoint/index.md) — applies independently per plan in the tree; namespace your `checkpoint_key=`.
- [Recipes → Supervisor pattern](https://core.lazybridge.com/recipes/supervisor-pattern/index.md) — an LLM-decided dispatch over sub-agents, the runnable form of the third horizontal shape.

# Exporters

The sinks that consume `Session` events. Five built-ins ship from the core package; an OpenTelemetry exporter ships from `lazybridge.ext.otel`. Compose them — most production setups use two or three at once.

## Signature

```python
# Protocol every exporter satisfies.
class EventExporter:
    def export(self, event: dict) -> None: ...
    def close(self) -> None: ...           # optional; called by Session.close()


# Built-ins from core (lazybridge package).
from lazybridge import (
    CallbackExporter,
    ConsoleExporter,
    FilteredExporter,
    JsonFileExporter,
    StructuredLogExporter,
    EventType,
)

# Every core exporter has a keyword-only constructor.
CallbackExporter(*, fn)                                  # fn: Callable[[dict], None]
ConsoleExporter(*, stream=None)                          # defaults to sys.stdout when None
FilteredExporter(*, inner, event_types)                  # combinator: forward only matching events
JsonFileExporter(*, path)                                # JSON-lines append (one event per line)
StructuredLogExporter(*, logger_name="lazybridge")


# Built-in from ext (lazybridge[otel] extras).
from lazybridge.ext.otel import OTelExporter
OTelExporter(*, endpoint=None, exporter=None, batch=True)
```

Wire any list of exporters into a `Session(exporters=[...])`.

## Synopsis

A `Session` fans every event into all registered exporters in registration order. Each event is a `dict` with at minimum `event_type`, `session_id`, `run_id` (possibly `None`); engine- specific fields are merged in by the emitter.

The full `EventType` enum (`lazybridge.session.EventType`):

| Member                                                      | Emitted by                                                     |
| ----------------------------------------------------------- | -------------------------------------------------------------- |
| `AGENT_START` / `AGENT_FINISH`                              | every `Agent` run, including nested                            |
| `LOOP_STEP`                                                 | each iteration of an `LLMEngine` tool-calling loop             |
| `MODEL_REQUEST` / `MODEL_RESPONSE`                          | every provider call                                            |
| `TOOL_CALL` / `TOOL_RESULT` / `TOOL_ERROR` / `TOOL_TIMEOUT` | every tool dispatch (with `TOOL_TIMEOUT` carrying `timeout_s`) |
| `HIL_DECISION`                                              | one per `HumanEngine` / `SupervisorEngine` decision            |

The five core exporters cover the dev / prod / custom surface:

| Exporter                | What it does                                                 | When                                                |
| ----------------------- | ------------------------------------------------------------ | --------------------------------------------------- |
| `ConsoleExporter`       | Pretty-prints events to stdout                               | Dev — same as `Agent(verbose=True)`                 |
| `JsonFileExporter`      | Appends events as JSON Lines                                 | Durable run logs for offline analysis               |
| `StructuredLogExporter` | Emits via Python `logging`                                   | Integrates with existing log pipelines              |
| `CallbackExporter`      | Calls a user-supplied function                               | Custom sinks: Slack alerts, Prometheus, your own DB |
| `FilteredExporter`      | Wraps another exporter, forwarding only matching event types | Layer onto the others to scope the volume           |

The OpenTelemetry exporter is a separate install — it emits spans conforming to the OpenTelemetry GenAI Semantic Conventions (`gen_ai.system`, `gen_ai.usage.input_tokens`, `gen_ai.tool.call.id`, …) so dashboards built for the standard render LazyBridge traces without translation. Span hierarchy mirrors the run:

- `invoke_agent <name>` (root per `agent.run`)
  - `chat <model>` (one per LLM round-trip)
  - `execute_tool <tool>` (one per tool invocation, correlated by `tool_use_id`)

Cross-agent parenting works automatically through OTel contextvars — an inner agent invoked through a tool becomes a descendant of the outer tool span, no run-id chaining required.

For high-throughput emit paths, pair `Session(batched=True, on_full="hybrid")` with the slow exporters (OTel, JSON-file).

`OTelExporter` defaults to `batch=True`, which wraps the underlying OTLP exporter in `BatchSpanProcessor`. Set `batch=False` to use `SimpleSpanProcessor` instead — primarily useful in tests against an in-memory exporter, where you want each span flushed synchronously on close.

## When to use which

- **`ConsoleExporter`** — local development, REPL inspection. The `Agent(verbose=True)` shortcut creates a private session with this exporter wired in; you don't need to construct it manually unless you also want other sinks.
- **`JsonFileExporter`** — durable per-run logs you can grep, pipe into jq, or load into pandas. The cheap default for any long-running production agent.
- **`StructuredLogExporter`** — integrates with existing `logging` pipelines. Lets central log management (CloudWatch / GCP Logging / ELK / Loki) ingest agent events through the same path as your application logs.
- **`CallbackExporter`** — anything else: alerting, custom DB writes, real-time UIs, metrics. Keep the callback fast; slow callbacks block the session unless `batched=True`.
- **`FilteredExporter`** — wrap one of the above when you only want a slice of events at that sink. Common pattern: an alert callback only sees `TOOL_ERROR` and `AGENT_FINISH`.
- **`OTelExporter`** — distributed tracing. The right answer when you already have an OTel collector and want LazyBridge traces in the same dashboards as the rest of your services.

## When NOT to use exporters

- **You don't have a `Session`.** Exporters are sinks for session events; an agent without a `session=` (and without `verbose=True`) emits nothing.
- **You want raw queryable history rather than push streams.** `Session.events.query(...)` reads the SQLite-backed `EventLog` directly — no exporter required. Use exporters when something *external* needs the events.
- **Your hot path can't tolerate the I/O cost.** Default `Session(batched=False)` blocks the engine on every export. Either batch, or pick exporters whose `export(event)` is microsecond-cheap.

## Example

```python
from lazybridge import (
    Agent,
    CallbackExporter,
    ConsoleExporter,
    FilteredExporter,
    JsonFileExporter,
    LLMEngine,
    Session,
)
from lazybridge.session import EventType


def alert_pagerduty(event: dict) -> None:
    """Fire a PagerDuty incident for tool errors and finish events."""
    ...


# 1) Dev — single console exporter, blocking emit.
sess = Session(console=True)
agent = Agent(
    engine=LLMEngine("gpt-5.4-mini"),
    session=sess,
)
agent("hello")


# 2) Production — batched writer + multiple exporters + filtered alerts.
def alert_pagerduty(event: dict) -> None:
    ...


sess = Session(
    db="events.sqlite",
    batched=True,
    on_full="hybrid",
    exporters=[
        JsonFileExporter(path="run.jsonl"),
        FilteredExporter(
            CallbackExporter(fn=alert_pagerduty),
            event_types={EventType.TOOL_ERROR, EventType.AGENT_FINISH},
        ),
    ],
)

researcher = Agent(engine=LLMEngine("gpt-5.4-mini"), name="research")
writer = Agent(engine=LLMEngine("gpt-5.4-mini"), name="write")
pipeline = Agent.chain(researcher, writer, session=sess)
pipeline("AI trends")

# Drain the writer before exit so JsonFileExporter / OTel get every
# in-flight event.
sess.flush()


# 3) OpenTelemetry — install lazybridge[otel].
from lazybridge.ext.otel import OTelExporter

sess.add_exporter(OTelExporter(endpoint="http://otelcol:4318"))
pipeline("more AI trends")
sess.close()


# 4) Custom — a CallbackExporter feeding a Slack channel.
def slack_post(event: dict) -> None:
    if event["event_type"] == EventType.AGENT_FINISH:
        post_to_slack(f"agent {event.get('agent_name')} finished")


sess = Session(
    exporters=[
        FilteredExporter(
            CallbackExporter(fn=slack_post),
            event_types={EventType.AGENT_FINISH},
        ),
    ],
)
```

## Pitfalls

- **Slow exporters block the engine when `Session(batched=False)` (the default).** Set `batched=True` for any exporter doing network I/O — JSON-file disk writes are usually fine unbatched; OTel network calls and external HTTP callbacks are not.
- **Exporter exceptions warn once per instance.** Subsequent failures from the same exporter are silently suppressed. While debugging a noisy exporter, wrap it in `CallbackExporter(fn=lambda e: print(e))` so you see every emission attempt.
- **`OTelExporter` keeps a per-instance tracer rooted in its own `TracerProvider`** so multiple exporters in one process don't fight. The provider is also installed globally as a best-effort default; you can supply your own and pass an in-memory exporter for tests.
- **Stale reads on batched sessions.** When `Session(batched=True)`, `session.events.query(...)` may return rows that don't yet include the most recent events — the writer hasn't drained. Call `session.flush()` (or use `Session.close()`) before querying.
- **Exporters are a per-session list.** A nested agent that inherits the parent's session inherits the parent's exporters. If a sub-agent should be invisible, give it its own `session=Session()`.
- **`FilteredExporter` only filters; it doesn't transform.** Use `CallbackExporter(fn=...)` if you need to rewrite events before forwarding (or wrap multiple sinks in a single callback that does the rewrite + dispatch).

## See also

- [Session](https://core.lazybridge.com/guides/mid/session/index.md) — the bus that fans events into exporters; covers `batched=`, `on_full=`, `redact=`, `usage_summary()`.
- [GraphSchema](https://core.lazybridge.com/guides/full/graph-schema/index.md) — the topology view, separate from the event stream; populated as agents register.
- *Guides → Advanced → OTel* (Phase 3c) — the deeper OpenTelemetry surface (tracer providers, custom resource attributes, in-memory exporters for tests).

# GraphSchema

The agent topology view that a `Session` auto-populates as agents register. Use it to inspect what your pipeline actually looks like after construction, to render in an external diagramming tool, or to persist as part of a run report.

`GraphSchema` is the **descriptor** layer — separate from the event stream that [exporters](https://core.lazybridge.com/guides/full/exporters/index.md) consume.

## Signature

```python
from lazybridge import GraphSchema, Session
from lazybridge.graph import NodeType, EdgeType

# Construction (rare — Session creates one for you).
GraphSchema(session_id="")


# Methods (auto-populated by Session; explicit calls are unusual).
graph.add_agent(agent)
graph.add_router(router)
graph.add_edge(from_id, to_id, *, label="", kind=EdgeType.TOOL)


# Inspection.
graph.nodes()                        # list[_BaseNode]
graph.edges()                        # list[Edge]
graph.edges_from(node_id)            # outgoing edges
graph.edges_to(node_id)              # incoming edges


# Serialisation.
graph.to_dict()
graph.to_json(indent=2)
graph.to_yaml()                      # requires lazybridge[yaml]
graph.save("topology.json")          # extension chooses format
GraphSchema.from_dict(d)
GraphSchema.from_json(s)
GraphSchema.from_file("topology.yaml")


# Enums
class NodeType:
    AGENT
    ROUTER

class EdgeType:
    TOOL                             # outer Agent uses inner via tools=[...]
    CONTEXT                          # data dependency (e.g. shared Memory)
    ROUTER                           # routing edge from a router node
```

## Synopsis

A `GraphSchema` records three kinds of nodes and three kinds of edges:

- **`AgentNode`** — every `Agent` constructed with `session=sess` gets one. Carries `id`, `name`, `provider`, `model`, `system`, `engine_type` (the engine class name — `"LLMEngine"`, `"Plan"`, `"HumanEngine"`, `"SupervisorEngine"`, …), and `tools` (a list of names of plain Python-callable tools registered on the agent; agent-as-tool wrappings are captured as edges, not listed here).
- **`RouterNode`** — explicit router primitive (rare; most routing is encoded directly on `Step(routes=...)` and shows up as an edge instead).
- **`_ToolNode`** — auto-registered for every plain Python-callable tool an agent exposes (i.e. tools where `returns_envelope=False`). Lets the topology show *every* tool stub as a separate node before any execution starts. `_ToolNode.type` is `NodeType.TOOL`; the leading underscore on the class is a hint that the class itself is a small internal stub, not an extension surface, but the enum value is part of the public `NodeType` surface.
- **`Edge(from_id, to_id, label, kind)`** — three flavours:
  - `TOOL` — outer agent's `tools=[...]` includes inner agent or a plain callable. `as_tool` wrappings and the auto-registered `_ToolNode` stubs both produce these. **Edges are idempotent**: the same `(from, to, kind, label)` quadruple registered twice is a no-op, so a tool fired N times in a run shows up once in the graph.
  - `CONTEXT` — data dependency (e.g. a shared `Memory` referenced by another agent's `sources=`).
  - `ROUTER` — routing edges from explicit router nodes.

The graph is **descriptor-only**. Reconstructing a runnable pipeline from a saved graph is the caller's job — `from_dict` / `from_json` give you the topology; you wire the actual `LLMEngine`, `Agent`, `Plan` instances yourself.

## When to use it

- **Debugging.** `print(session.graph.to_json())` after a run confirms the pipeline you built matches what executed. Especially useful when you suspect a typo or missing `as_tool` registration.
- **External rendering.** Save the graph and load it in a diagramming tool, your own UI renderer, or a GraphViz pipeline. YAML / JSON output keeps the format human-friendly.
- **Run reports.** Persist the graph alongside event logs to document what the run looked like — useful for audits and post-mortems.
- **Pipeline diffs.** Hash or normalise the graph to detect unintentional changes to topology between deployments.

## When NOT to use it

- **Live event streams.** Use [exporters](https://core.lazybridge.com/guides/full/exporters/index.md) instead; the graph captures *structure*, not *behaviour*.
- **Reconstructing a live pipeline from JSON.** `from_*` methods return descriptors only — they don't instantiate `Agent` / `LLMEngine` / `Plan` objects. For round-trippable Plan serialisation see *Plan.to_dict* (Phase 3c — Advanced).
- **Cost / token / latency aggregation.** That's `session.usage_summary()`, not the graph.

## Example

```python
from lazybridge import Agent, LLMEngine, Session


sess = Session()


researcher = Agent(
    engine=LLMEngine("gemini-3-flash-preview"),
    name="researcher",
    session=sess,
)
writer = Agent(
    engine=LLMEngine("gemini-3-flash-preview"),
    name="writer",
    session=sess,
)


# 1) The orchestrator's tools=[...] auto-registers as_tool edges.
orchestrator = Agent(
    engine=LLMEngine("gemini-3-flash-preview"),
    name="orchestrator",
    tools=[researcher, writer],
    session=sess,
)


# 2) Inspect what was built.
import json
print(json.dumps(sess.graph.to_dict(), indent=2))
# {
#   "session_id": "...",
#   "nodes": [
#     {"id": "researcher",   "type": "agent", ...},
#     {"id": "writer",       "type": "agent", ...},
#     {"id": "orchestrator", "type": "agent", ...},
#   ],
#   "edges": [
#     {"from": "orchestrator", "to": "researcher", "label": "as_tool", "kind": "tool"},
#     {"from": "orchestrator", "to": "writer",     "label": "as_tool", "kind": "tool"},
#   ]
# }


# 3) Persist + reload (descriptors only; not a runnable pipeline).
sess.graph.save("topology.yaml")

from lazybridge import GraphSchema
replay = GraphSchema.from_file("topology.yaml")
assert len(replay.nodes()) == 3
assert any(e.kind == "tool" for e in replay.edges())


# 4) Filter outgoing edges from the orchestrator.
for edge in sess.graph.edges_from("orchestrator"):
    print(f"{edge.from_id} -> {edge.to_id} ({edge.label})")


# 5) Render with your tool of choice — descriptor-only output is
#    tool-friendly. (graphviz is illustrative, not a dependency.)
import graphviz
g = graphviz.Digraph()
for node in sess.graph.nodes():
    g.node(node.id, label=node.id)
for edge in sess.graph.edges():
    g.edge(edge.from_id, edge.to_id, label=edge.label)
g.render("pipeline.gv")
```

## Pitfalls

- **An agent without `session=` is not registered anywhere.** If you pass it as a nested tool to an agent with a session, the outer agent propagates its session down and registers the nested one for you — but a top-level agent with no session produces no graph entry.
- **Duck-typed reads of `agent.engine.provider` and `agent.engine.model`.** A custom engine that doesn't expose those attributes leaves the corresponding `AgentNode` fields empty (`None`). If you need a custom name in the topology, set them explicitly on your engine.
- **`to_yaml` requires PyYAML.** Install via `pip install lazybridge[yaml]`. `to_json` is stdlib-only.
- **`from_dict` / `from_json` reconstruct descriptors only.** The `provider` / `model` strings on `AgentNode` are inert metadata; there's no live `LLMEngine` behind them. Don't try to `agent.run(...)` against a reloaded graph — wire your own agents with the same names if you want to replay.
- **The graph captures registration time, not runtime.** A conditional route that *could* fire shows up as an edge whether or not it actually fired in this run. For per-run "what actually happened", read the event log via `session.events.query(...)` instead.
- **Manual `add_edge` is rare.** Most edges register automatically when an agent is wrapped via `as_tool` (or passed directly into another agent's `tools=[...]`). Use `session.register_tool_edge(outer, inner, label="")` only when wiring outside of `as_tool` (custom routing primitives, etc.).
- **Engine class name is the fallback for `provider` / `model`.** When an `Agent` is built with a non-LLM engine (`Plan`, `HumanEngine`, `SupervisorEngine`), `_derive_provider_model` fills `provider` and `model` with the engine's class name so non-LLM nodes don't render as empty strings in dumps. If you rely on the `provider` field being a real LLM provider name, filter on `engine_type == "LLMEngine"` first.

## See also

- [Session](https://core.lazybridge.com/guides/mid/session/index.md) — owns the graph; populates it as agents register.
- [Exporters](https://core.lazybridge.com/guides/full/exporters/index.md) — the live event stream; complements the graph's static topology view.
- [As tool](https://core.lazybridge.com/guides/mid/as-tool/index.md) — every `as_tool` wrapping records one `EdgeType.TOOL` edge automatically.
- *Guides → Advanced → Plan.to_dict* (Phase 3c) — round-trippable Plan serialisation, the runnable counterpart to graph descriptors.

# Parallel plan steps

`Step(parallel=True)` marks a step as a member of a concurrent band: the engine bundles consecutive `parallel=True` steps and dispatches them via `asyncio.gather`. After the band finishes, the next non- parallel step acts as the join. Use a `from_parallel(...)` / `from_parallel_all(...)` sentinel on the join to read the branch outputs.

For application-level scripted fan-out → `list[Envelope]` (no Plan, no aggregation), use [`Agent.parallel`](https://core.lazybridge.com/guides/mid/parallel/index.md) instead.

## Signature

```python
from lazybridge import Step, from_parallel, from_parallel_all

# Mark a step as a band member.
Step(target, *, parallel=True, name="...", writes=None, output=None, ...)


# Sentinels for the join step that reads branch outputs.
from_parallel("branch_name")          # one specific branch
from_parallel_all("first_branch")     # aggregate the whole band as labelled text
```

## Synopsis

A "parallel band" is one or more **consecutive** `Step(parallel=True)` declarations. The engine groups them into a single dispatch unit and runs them via `asyncio.gather`; control flow waits until every branch finishes (success, error, or timeout). The first non-parallel step that follows the band is the **join** — it reads branch outputs via the parallel sentinels.

The idiomatic shape is:

```python
Plan(
    Step(a, name="a", parallel=True),
    Step(b, name="b", parallel=True),
    Step(c, name="c", parallel=True),
    Step(join, name="join",
         task="Synthesise the three branches.",
         context=[from_parallel("a"), from_parallel("b"), from_parallel("c")]),
)
```

The list-context form lets you mix branches with literal strings (e.g. style notes); `from_parallel_all("a")` is the one-line equivalent that produces a single labelled-text join.

### Atomicity

If any branch in the band errors, **no `writes=` from the band are applied** — not even those of succeeded siblings. The first-error envelope propagates as the band's outcome, and the checkpoint points to the band's **first** step. A subsequent `resume=True` re-runs the whole band cleanly. This is intentional: resuming mid-band would leave earlier branches' Store keys stale relative to the re-run ones.

## When to use it

- **Independent steps that can run concurrently** and the next step needs all of their results — multi-source research, multi-region fetches, multi-model ensembles.
- **Map-reduce shapes.** N similar branches process N inputs in parallel, then a summariser folds the results.
- **Parallel side-effects with shared visibility.** Three searchers each `writes="findings_<x>"` — downstream steps with `sources=[store]` see all three live.

## When NOT to use it

- **Application-level fan-out where you just want `list[Envelope]`.** Use [`Agent.parallel`](https://core.lazybridge.com/guides/mid/parallel/index.md) — no Plan, no aggregation.
- **Conditional concurrency.** Routing primitives are silently ignored on parallel branches — the whole band runs every time. If only one branch should run conditionally, route to a non-parallel step instead and decide there.
- **Branches with data dependencies on each other.** If branch B needs branch A's output, they belong in sequence. Parallel bands are for genuinely independent work.
- **Side effects that aren't crash-safe to re-run.** Atomicity re-runs the entire band on any branch failure. If a branch writes to an external system that doesn't tolerate duplicate writes, gate it with idempotency keys or run it sequentially.

## Example

```python
from pydantic import BaseModel

from lazybridge import (
    Agent,
    LLMEngine,
    Plan,
    Step,
    Store,
    from_parallel,
    from_parallel_all,
)


def search_anthropic(query: str) -> str:
    """Search Anthropic's bulletins."""
    return "..."


def search_openai(query: str) -> str:
    """Search OpenAI's bulletins."""
    return "..."


def search_google(query: str) -> str:
    """Search Google's bulletins."""
    return "..."


anthropic_search = Agent(engine=LLMEngine("deepseek-v4-flash"), tools=[search_anthropic], name="search_a")
openai_search = Agent(engine=LLMEngine("deepseek-v4-flash"), tools=[search_openai], name="search_o")
google_search = Agent(engine=LLMEngine("deepseek-v4-flash"), tools=[search_google], name="search_g")
synthesiser = Agent(engine=LLMEngine("deepseek-v4-flash"), name="synth")


# 1) Three branches → join with explicit per-branch sentinels.
store = Store(db="monitor.sqlite")
plan = Agent(
    engine=Plan(
        Step("search_a", parallel=True, writes="findings_a"),
        Step("search_o", parallel=True, writes="findings_o"),
        Step("search_g", parallel=True, writes="findings_g"),
        Step("synth",
             task="Compare the three sources; flag agreement and disagreement.",
             context=[
                 from_parallel("search_a"),
                 from_parallel("search_o"),
                 from_parallel("search_g"),
                 "Style: terse, factual, no superlatives.",
             ],
             writes="brief"),
        store=store,
    ),
    tools=[anthropic_search, openai_search, google_search, synthesiser],
)
plan("framework update — April 2026")


# 2) Same shape, one-line aggregation via from_parallel_all.
plan = Agent(
    engine=Plan(
        Step("search_a", parallel=True, writes="findings_a"),
        Step("search_o", parallel=True, writes="findings_o"),
        Step("search_g", parallel=True, writes="findings_g"),
        Step("synth",
             task="Compare the three sources; flag agreement and disagreement.",
             context=from_parallel_all("search_a"),     # the FIRST branch's name
             writes="brief"),
    ),
    tools=[anthropic_search, openai_search, google_search, synthesiser],
)


# 3) Map-reduce — N items processed in parallel, summarised at the end.
class ItemResult(BaseModel):
    item: str
    score: float


class Report(BaseModel):
    summary: str
    items: list[ItemResult]


def make_pipeline(items: list[str]) -> Plan:
    branches = [
        Step(item_processor, name=f"proc_{i}", parallel=True,
             task=f"Run end-of-day analysis on {item}.",
             writes=f"out_{i}", output=ItemResult)
        for i, item in enumerate(items)
    ]
    return Plan(
        *branches,
        Step(summariser, name="summary",
             task="Summarise the per-ticker analyses.",
             context=from_parallel_all(branches[0].name),
             output=Report),
    )


agent = Agent(engine=make_pipeline(["AAPL", "GOOG", "MSFT"]))
agent("end-of-day market scan")
```

## Pitfalls

- **Only *consecutive* `parallel=True` steps are bundled.** A non-parallel step in between starts a new band. Keep parallel siblings contiguous.
- **`from_parallel_all("X")` requires X to be the FIRST step of its band.** Mid-band references fail at construction. PlanCompiler walks forward from the named step; if an earlier parallel sibling exists, the reference is rejected.
- **The join is implicit — the first non-parallel step after the band.** If you forget to read branches via `from_parallel(...)` or `from_parallel_all(...)`, the join sees only `from_prev`, which resolves to the last completed branch (timing-dependent and rarely what you want).
- **Routing is ignored on parallel branches.** A `parallel=True` step's `routes=` / `routes_by=` is silently dropped — parallel bands have their own control flow. Set routing on the join instead.
- **Atomicity re-runs the whole band on any branch failure.** Branches that write to external systems (HTTP POSTs, DB inserts) need idempotency keys to be safe under retry.
- **`max_iterations` counts each branch.** A wide band of N parallel steps consumes N from the iteration budget. Raise `Plan(max_iterations=...)` accordingly for very wide fan-outs.
- **Per-branch errors don't propagate as exceptions.** The first failing branch's error envelope becomes the band's outcome; every other branch's result is discarded along with its `writes=`. If you need partial-success semantics, run the branches as `Agent.parallel(...)` and inspect the `list[Envelope]` yourself.

## See also

- [Plan](https://core.lazybridge.com/guides/full/plan/index.md) — the engine that orchestrates parallel bands.
- [Step](https://core.lazybridge.com/guides/full/step/index.md) — `parallel=True` is one of the step's three concurrency-and-naming fields.
- [Sentinels](https://core.lazybridge.com/guides/full/sentinels/index.md) — `from_parallel("name")` and `from_parallel_all("name")` semantics.
- [Parallel](https://core.lazybridge.com/guides/mid/parallel/index.md) — application-level scripted fan-out with `list[Envelope]` return; complementary, not redundant.
- [Checkpoint & resume](https://core.lazybridge.com/guides/full/checkpoint/index.md) — band-level atomicity drives the "next_step points to the band's first step" rule on failure.

# Plan

The deterministic-orchestration engine. A `Plan` is a declared sequence of `Step`s with explicit data flow, validated at construction time — broken references, duplicate names, and unknown route targets fail before any LLM call. Pass it to an `Agent` like any other engine.

## Signature

```python
from lazybridge import Agent, Plan, Step, Store, Tool

Plan(
    *steps,                        # one or more Step instances
    max_iterations=100,            # cap on total step executions per run; guards against routing loops
    store=None,                    # Store for writes= / checkpointing
    checkpoint_key=None,           # str — required for resume
    resume=False,                  # pick up at the failed step (Phase 3b: Checkpoint & resume)
    on_concurrent="fail",          # "fail" | "fork"
)


# Concurrent fan-out — N inputs against the same Plan shape.
# Pair with on_concurrent="fork" so each input run claims its own
# isolated keyspace.
plan.run_many(tasks, *, concurrency=None, ...)   # sync — returns list[Envelope]
await plan.arun_many(tasks, *, concurrency=None, ...)  # async equivalent


# Construction errors (all raised at Agent construction).
PlanCompileError                   # invalid DAG: dangling refs, duplicates, malformed routes
ConcurrentPlanRunError             # raised at runtime CAS when two runs share a checkpoint_key

# Persisted state shapes.
PlanState                          # checkpoint: plan_id, current_step, next_step, store, history, status
StepResult                         # one record per executed step: step_name, envelope, ts


# Use as an Agent engine.
pipeline = Agent(
    engine=Plan(Step("a"), Step("b")),
    tools=[a, b],
)
```

## Synopsis

`Plan` is the engine for declared, multi-step pipelines. Every step has a named target, a typed input/output, an explicit data source (via [sentinels](https://core.lazybridge.com/guides/full/sentinels/index.md)), and optionally writes its payload to a [`Store`](https://core.lazybridge.com/guides/mid/store/index.md) bucket the rest of the pipeline can read. All of that is validated at **construction time** — `PlanCompileError` fires before the agent ever runs.

A `Plan` does not call an LLM by itself; it dispatches each step to its `target` (an `Agent`, a callable, or a tool name resolved on the wrapping agent). The orchestration layer is the deterministic part; the per-step targets are where LLMs (or other engines) actually run.

`Agent.chain(*agents)` is the one-line sugar for the simplest `Plan` shape — purely linear text hand-offs. Reach for `Plan` directly when you need any of:

- typed hand-offs (`Step(output=Model)` instead of free-form text);
- conditional routing (`Step(routes=...)` or `Step(routes_by="field")`);
- parallel bands (`Step(parallel=True)`);
- named writes to a `Store`;
- crash resume.

## When to use it

- **Multi-step pipelines that need to be auditable.** The DAG is visible at construction; reviewers can read a `Plan(Step(...), Step(...))` block and know the topology without running anything.
- **Production workflows where the LLM should *not* decide the order.** When determinism, repeatability, or cost predictability matter, lift control flow out of the model and into the `Plan`.
- **Pipelines that span multiple agents and need typed payloads between them.** `Step(output=Model)` preserves the type at the step boundary; `Agent.chain` flattens to text.
- **Workflows with conditional branching, fan-out / fan-in, early-out, or self-correction loops.** `routes`, `routes_by`, `parallel=True`, and `from_parallel_all` cover the canonical shapes.
- **Crash-resumable runs.** `Plan(store=..., checkpoint_key=..., resume=True)` writes plan state after every step; a re-run with `resume=True` picks up at the failed step.

## When NOT to use it

- **One agent, one model call.** That's `Agent(engine=LLMEngine(...))`. No `Plan` needed.
- **Linear text hand-offs with no other features.** Use `Agent.chain(...)` — it's sugar for the simplest `Plan` and reads better at the call site.
- **LLM-directed dispatch** ("the model decides which agent to call"). Use `Agent(tools=[a, b, c])`. `Plan` is for the *opposite* case — explicit, declared flow.
- **Deterministic fan-out → list of envelopes.** Use [`Agent.parallel(...)`](https://core.lazybridge.com/guides/mid/parallel/index.md) — its return shape is `list[Envelope]`, which a `Plan` step can't natively produce.

## Example

```python
from pydantic import BaseModel

from lazybridge import Agent, LLMEngine, Plan, Step, Store, from_prev, from_step, Tool


class Hits(BaseModel):
    items: list[str]


class Ranked(BaseModel):
    top: list[str]


def search_web(query: str) -> str:
    """Return search hits for ``query``."""
    return "..."


searcher = Agent(
    engine=LLMEngine("claude-haiku-4-5"),
    tools=[Tool.wrap(search_web, name="search_web")],
    name="search",
)
ranker = Agent(
    engine=LLMEngine("claude-haiku-4-5"),
    name="rank",
)
writer = Agent(
    engine=LLMEngine("claude-haiku-4-5"),
    name="write",
)


# 1) Linear typed pipeline — search → rank → write.
pipeline = Agent(
    engine=Plan(
        Step("search",
             task="Search the web for the user's topic.",
             writes="hits",
             output=Hits),
        Step("rank",
             task="Rank these search hits by relevance; return the top 5.",
             context=from_prev,
             output=Ranked),
        Step("write",
             task="Write a 200-word brief from the ranked items below.",
             context=from_step("rank")),
    ),
    tools=[searcher, ranker, writer],
    store=Store(db="research.sqlite"),
)
result = pipeline("AI trends April 2026")
print(result.text())


# 2) The same pipeline as Agent.chain (sugar — only works because the
#    flow is purely linear with text hand-offs).
sugar_pipeline = Agent.chain(searcher, ranker, writer, name="research")
```

For the full surface — typed payloads, routing, parallel bands, crash-resume, fan-out runs — see the dedicated guides: [Step](https://core.lazybridge.com/guides/full/step/index.md), [Sentinels](https://core.lazybridge.com/guides/full/sentinels/index.md), [Routing](https://core.lazybridge.com/guides/full/routing/index.md), and the Phase 3b guides *Parallel plan steps* and *Checkpoint & resume*.

## Pitfalls

- **`max_iterations` is a safety net for routing loops** (default 100). Hitting the cap returns a `MaxIterationsExceeded` error envelope — not a crash. Lower it during development to fail fast; raise it for legitimate long plans.
- **Cyclic routing is not a compile error.** `routes` cycles (`A → B → A`) may be intentional (self-correction loops) and surface at runtime as `MaxIterationsExceeded`. Pair every loop-routing pattern with a counter or termination predicate.
- **`resume=True` without `store=` is a silent no-op.** Pass both, and pick a `checkpoint_key`.
- **`on_concurrent="fork"` + `resume=True` is a configuration error.** Fork mode gives each run its own keyspace, so there's no shared checkpoint to resume from. The framework raises at construction.
- **`PlanCompileError` catches** duplicate step names, dangling `from_step` / `from_parallel` / `from_parallel_all` references, forward references, mid-band `from_parallel_all` start, unknown `routes=` targets, malformed `routes_by=` Literal types, and predicates that aren't callable. Read the error message — it names the offending step.
- **Plan writes go through the same `Store` as application writes.** Namespace your keys (`"pipeline_research/hits"` rather than `"hits"`) so a step's `writes=` doesn't collide with unrelated state.
- **`Step("name")` resolves the name on the wrapping agent's `tools=[...]` map.** `Plan(Step("research"))` with no `tools=[...]` on the agent is a `PlanCompileError` — the target has nowhere to resolve. `Step(target=researcher)` (the agent itself) is the alternative — it dispatches via `target.run()` directly with no tool-map lookup.

## See also

- [Step](https://core.lazybridge.com/guides/full/step/index.md) — the per-step anatomy: target, task, context, sources, writes, output.
- [Sentinels](https://core.lazybridge.com/guides/full/sentinels/index.md) — wiring data between steps (`from_prev`, `from_step`, `from_parallel_all`, `from_memory`, `from_agent`).
- [Routing](https://core.lazybridge.com/guides/full/routing/index.md) — `routes={...}` predicate map and `routes_by="field"` LLM-decided dispatch, plus `when` DSL.
- [Chain](https://core.lazybridge.com/guides/mid/chain/index.md) — the sugar for the linear case.
- [Nested pipelines](https://core.lazybridge.com/guides/full/composition-patterns/index.md) — Plan-of-Plans, parallel bands of sub-pipelines, and LLM-decided dispatch over sub-pipelines (the horizontal counterpart to this page).
- *Guides → Full → Parallel plan steps* (Phase 3b) — `parallel=True` bands and `from_parallel_all` aggregation.
- *Guides → Full → Checkpoint & resume* (Phase 3b) — `store=`, `checkpoint_key=`, `resume=True`, `on_concurrent=`.
- [Canonical vs sugar](https://core.lazybridge.com/concepts/canonical-vs-sugar/index.md) — `Agent(engine=Plan(*steps))` vs `Agent(engine=Plan(*steps))`.

# ReplanEngine

The adaptive-orchestration engine — the dynamic counterpart to [`Plan`](https://core.lazybridge.com/guides/full/plan/index.md). Where `Plan` compiles a fixed DAG at construction time, `ReplanEngine` lets a **planner agent** decide the shape of the work at runtime: it is called every round, its output drives which tools run, and a Store-backed checkpoint is written after every round so a restart resumes from the correct round without re-executing completed work.

Pass it to an `Agent` like any other engine.

## Signature

```python
from lazybridge import Agent, LLMEngine, ReplanEngine, Store
from lazybridge.engines.replan import PlanRound, Task

ReplanEngine(
    planner_name="planner",   # name of the planner tool in the parent Agent's tool_map
    store=None,               # Store for checkpoint/resume
    checkpoint_key=None,      # str — required to enable persistence
    resume=False,             # continue from the last checkpoint on the next call
    max_rounds=20,            # safety cap on replan rounds; guards against bad termination
)
```

`ReplanEngine` has no constructor injection of the planner or workers — it follows LazyBridge's **"everything is a tool"** principle. Everything it dispatches is resolved from the parent `Agent`'s `tool_map` at run time:

- The **planner** is a `Tool` in `tools=[]`, built with `output=PlanRound`, and located by `planner_name`.
- The **workers** (agents, plain functions, pool routes) are also in `tools=[]`. Each task is dispatched verbatim via `tool.run(**task.kwargs)` — no special-casing for pools or agents.

## The two output types

The planner emits a `PlanRound` each turn; `ReplanEngine` deserialises it and dispatches its tasks.

```python
class Task(BaseModel):
    tool: str                 # name of a tool in the tool_map
    kwargs: dict[str, Any]    # forwarded verbatim to tool.run(**kwargs)
    parallel: bool = True     # True → run concurrently with adjacent parallel siblings

class PlanRound(BaseModel):
    reasoning: str            # why this set of tasks was chosen
    tasks: list[Task]         # tasks to execute this round
    done: bool = False        # True → stop; final_answer required
    final_answer: str | None  # the user-facing answer (required when done=True)
```

Tasks within the same round flagged `parallel=True` run concurrently via `asyncio.gather`; `parallel=False` tasks run sequentially after the parallel group. **Dependent tasks belong in the next round** — after the planner has seen the outputs from this one.

## When to use it

| Use…                                                                     | when…                                                                                                                                                      |
| ------------------------------------------------------------------------ | ---------------------------------------------------------------------------------------------------------------------------------------------------------- |
| [`LLMEngine`](https://core.lazybridge.com/reference/engines/#llm-engine) | a single agent calls tools in a loop and you need no persistence — the built-in tool-calling loop already does ReAct.                                      |
| **`ReplanEngine`**                                                       | the *shape* of the work depends on the query and intermediate results — structured replan rounds, explicit parallelism, and checkpoint/resume on the loop. |
| [`Plan`](https://core.lazybridge.com/guides/full/plan/index.md)          | the step topology is **fixed and known up front** (DAG compiled at construction).                                                                          |

`ReplanEngine` is "ReAct on tasks": the planning unit is a *batch* of tasks rather than a single tool call.

## Minimal example — planner + plain functions

You do **not** need a hierarchy of sub-agents. The workers can be plain Python functions; the only required `Agent` is the planner.

```python
from lazybridge import Agent, LLMEngine, ReplanEngine
from lazybridge.engines.replan import PlanRound

def fetch(url: str) -> str:
    """Download a page."""
    return f"[contents of {url}]"

def word_count(text: str) -> int:
    """Count words."""
    return len(text.split())

planner = Agent(
    engine=LLMEngine("claude-opus-4-8", system="You are a task planner. Emit one PlanRound per round."),
    output=PlanRound,
    name="planner",                      # ← ReplanEngine finds it by this name
)

agent = Agent(
    engine=ReplanEngine(max_rounds=5),
    tools=[planner, fetch, word_count],  # workers are just functions
    name="agent",
)

print(agent("Download example.com and tell me how many words it has").text())
```

Why not just `LLMEngine`?

For a single agent that reasons and calls tools in a loop, `LLMEngine` already does ReAct — you don't need `ReplanEngine`. Reach for `ReplanEngine` when you want **structured replan rounds**, **explicit parallelism**, or **checkpoint/resume** on the loop.

## Parallel fan-out across workers

The planner can emit several independent tasks in one round; they run concurrently. Dependent work goes in the next round.

```python
research = Agent(
    engine=LLMEngine("claude-sonnet-4-6", system="You look up facts via web_search. No math."),
    tools=[web_search], name="research", description="Web lookups. Cannot do math.",
)
math = Agent(
    engine=LLMEngine("claude-sonnet-4-6", system="You do arithmetic with add/multiply."),
    tools=[add, multiply], name="math", description="Arithmetic only.",
)
writer = Agent(
    engine=LLMEngine("claude-sonnet-4-6", system="You synthesise prior results into prose."),
    name="writer", description="Final synthesis. Adds no new facts.",
)

guardian = Agent(
    engine=ReplanEngine(max_rounds=10),
    tools=[planner, research, math, writer],
    name="guardian",
)

env = guardian(
    "Combined headcount of Apple and Google in 2024, then write a paragraph "
    "on what those numbers say about their staffing strategies."
)
print(env.text())
```

A round the planner might emit (the `PlanRound` schema):

```python
PlanRound(
    reasoning="The two headcounts are independent → run them in parallel.",
    tasks=[
        Task(tool="research", kwargs={"task": "Apple headcount 2024"},  parallel=True),
        Task(tool="research", kwargs={"task": "Google headcount 2024"}, parallel=True),
    ],
    done=False,
)
# next round: Task(tool="math", ...) to sum them, then Task(tool="writer", ...)
```

The planner's system prompt does **not** hardcode worker names — `ReplanEngine` injects the available tool schemas and the accumulated history into every planner call dynamically.

## Checkpoint & resume

For long or expensive pipelines, pass `store=` **and** `checkpoint_key=` to persist round state after every round. Pass `resume=True` to continue from the last checkpoint on the next call.

```python
from lazybridge import Agent, ReplanEngine, Store

store = Store(db="project.sqlite")

guardian = Agent(
    engine=ReplanEngine(
        store=store,
        checkpoint_key="report-apple-google",   # unique key per run
        resume=True,                             # continue from the last checkpoint
        max_rounds=20,
    ),
    tools=[planner, research, math, writer],
    name="guardian",
)

guardian("…the long query…")     # first session — checkpoints each round
guardian("continue")             # resumes from the last completed round
```

Semantics match [`Plan`](https://core.lazybridge.com/guides/full/plan/index.md):

- The `store` alone does nothing — persistence is keyed on `checkpoint_key`. Without it, every run is in-memory.
- The first call **claims** the key via compare-and-swap.
- With **`resume=False`**, a second run against a key already held by another run raises `ConcurrentPlanRunError` — fail-fast, single-writer. Use a unique `checkpoint_key` for a fresh concurrent run.
- With **`resume=True`**, a second call **adopts** the existing checkpoint instead of raising (it stamps its own `run_uid`). This is what lets *you* resume your own crashed or paused run — but it is not a concurrency guard: do **not** point two `resume=True` workers at the same key, or the adopter will preempt the still-running one, which then loses its next checkpoint CAS. Give each concurrent run its own `checkpoint_key`.
- A completed run (`status="done"`) short-circuits on the next `resume=True` call and returns the cached `final_answer` immediately.

## Termination & safety

- **`max_rounds`** is the safety net for bad termination logic. If the planner keeps emitting `done=False`, the loop bails after this many rounds. Set it defensively.
- **`done=True` requires `final_answer`.** `ReplanEngine` rejects a `done` round with a `None` answer *before* writing a permanent `done` checkpoint — otherwise every future `resume=True` call would short-circuit with an empty payload.
- **Pathological case**: a planner that emits `done=False` with an empty task list spins until `max_rounds`. Mitigate by steering the planner to set `done=True` with a `final_answer` when no tasks remain.

## See also

- [Plan](https://core.lazybridge.com/guides/full/plan/index.md) — the static alternative when the topology is known up front.
- [Dynamic re-planning recipe](https://core.lazybridge.com/recipes/dynamic-replanning/index.md) — the runnable end-to-end example this guide is drawn from.
- [Parallel](https://core.lazybridge.com/guides/mid/parallel/index.md) — application-layer fan-out used inside a round.
- [Engines reference](https://core.lazybridge.com/reference/engines/index.md) — the auto-generated `ReplanEngine`, `PlanRound`, and `Task` API.

# Routing

Conditional flow inside a `Plan`: when one step's output decides which step runs next. Two forms — `routes={...}` for code-decided branches, `routes_by="field"` for LLM-decided branches — with the same call-site visibility and the same compile-time validation.

## Signature

```python
from lazybridge import Step, when

# Form A — predicate map. Your code decides.
Step(
    target,
    name="...",
    routes={
        "target_step_name": predicate,        # Callable[[Envelope], bool]
        "another_target":   another_predicate,
    },
    after_branches=None,                       # str — optional rejoin point
)

# Form B — Literal field. The LLM decides.
Step(
    target,
    name="...",
    output=SomeModel,                          # must declare a Literal[...] field
    routes_by="field_name",                    # name of that Literal field
    after_branches=None,
)
```

`routes` and `routes_by` are mutually exclusive — use one (or neither) per step. Both place the routing decision **at the call site** so reviewers can see the branch table without reading any Pydantic class.

### The `when` DSL

`routes` predicates are callables `(Envelope) -> bool`. Writing them as raw lambdas works but is dense; the `when` DSL makes the common shapes declarative.

```python
from lazybridge import when

# Field-level checks (the workhorse)
when.field("items").empty()              # field is None or zero-length container
when.field("items").not_empty()
when.field("severity").equals("urgent")  # ==
when.field("severity").not_equals("spam")
when.field("approved").is_(True)         # `is` — for True / False / None
when.field("kind").in_({"a", "b"})       # membership
when.field("kind").not_in_({"a", "b"})
when.field("score").greater_than(0.5)
when.field("score").less_than(0.5)
when.field("text").matches(r"^urgent")   # re.search

# Strict mode — typos in field names raise instead of silently routing wrong
when.field("items", strict=True).empty()

# Escape hatches
when.payload(callable)                   # callable(payload) -> bool
when.envelope(callable)                  # callable(envelope) -> bool
when.errored()                           # True iff envelope carries an error
```

## Synopsis

Routing in `Plan` is **declared on the `Step`**, never hidden inside the output model. Two forms:

**Form A — `routes={...}` (predicate map).** Your code decides the branch. The framework calls each predicate in declared order with the step's output envelope; the **first one returning `True`** makes Plan jump to that target step. If none returns `True`, linear progression continues to the next declared step.

**Form B — `routes_by="field"` (Literal field).** The LLM decides the branch. You declare a `Literal[...]`-typed field on the step's `output=Model`; the framework reads `env.payload.<field>` after the step runs. If the value is a string matching a step name, Plan jumps there; if `None` or unmatched, linear progression continues.

Both forms are validated at construction. Unknown route targets, malformed Literal types, predicates that aren't callable — all caught by `PlanCompileError` before any LLM call.

### Detour vs. exclusive branch

By default, routing is a **detour**: when step A routes to step X, Plan runs X and then **resumes linear progression from X's declared position**. There is no implicit "no fall-through" mode.

Set `after_branches="step_name"` alongside `routes` or `routes_by` to make routing **exclusive**: only the matched branch runs; all declared steps between the routing step and the rejoin point are skipped; execution continues at `step_name` after the chosen branch completes.

| Pattern                                                          | When to use                                                                                                            | Resume behaviour                                                                                                                                 |
| ---------------------------------------------------------------- | ---------------------------------------------------------------------------------------------------------------------- | ------------------------------------------------------------------------------------------------------------------------------------------------ |
| **Exclusive branch with rejoin** *(default for forward routing)* | Predicates cover all outcomes; `after_branches="<rejoin>"` set                                                         | Routed branch runs, then jumps to `<rejoin>`; sibling branches are skipped                                                                       |
| **Skip optional middle step**                                    | Single forward predicate, routed-to step is the *last* declared *and* serves as the rejoin for the non-routed path too | Routed: jumps over optional steps to the last step. Non-routed: linear, runs intermediate steps, then the last step. Both end at the same place. |
| **Loop / self-correction**                                       | Backwards route to an earlier step; no `after_branches=`                                                               | Route fires → run earlier step → linear walks forward from there until the predicate stops firing. `max_iterations` is the safety net.           |

Don't use a single forward predicate without a rejoin

`routes={"X": predicate}` with no `after_branches=` and `X` not at the end of the declared list is **always a bug**. When the predicate doesn't fire, control falls through linearly and runs *every* step between the routing step and the end — including `X` itself. Either cover every outcome with explicit predicates plus `after_branches=`, or use the "Skip optional middle step" pattern where `X` is genuinely the last step.

## When to use which form

- **`routes={...}`** when *your code* decides — programmatic checks on the typed payload (empty list, score below threshold, regex match, multi-field combinator). Cheap, deterministic, no extra LLM call.
- **`routes_by="field"`** when *the LLM* decides — classification, triage, intent detection. The model emits a Literal value as part of its structured output; you don't write the dispatch logic.

## When NOT to use routing

- **Linear pipelines.** Just stack `Step`s in declared order; no routing fields needed.
- **Parallel fan-out where every branch should run.** Use `Step(parallel=True)` on each branch; no routing primitives are involved.
- **Crash recovery.** That's `resume=True` + `checkpoint_key=`, not routing. Routing is *control flow within a single run*; resume is *durability across runs*.
- **Loop counters.** Routing back to an earlier step works for bounded retries via `max_iterations`, but if you need an explicit attempt counter, persist it via `writes=` and read it via a sentinel rather than relying on routing alone.

## Example

```python
from typing import Literal

from pydantic import BaseModel

from lazybridge import Agent, LLMEngine, Plan, Step, Store, from_prev, from_start, from_step, when


# 1) Form A (predicate map) — empty-search early-out via the when DSL.
#
# The routes table covers BOTH outcomes (when.empty() AND when.not_empty())
# so linear fall-through never fires.  after_branches="log_outcome"
# guarantees every chosen branch ends at the same rejoin point.  For
# multi-step branches (e.g. rank → write), wrap them in a nested Plan
# and pass it as a single Step's target — see the routing.md "Detour vs.
# exclusive branch" section.
class Hits(BaseModel):
    items: list[str]


searcher      = Agent(engine=LLMEngine("gpt-5.4-mini"), name="search", output=Hits)
writer        = Agent(engine=LLMEngine("gpt-5.4-mini"), name="write")
apology_agent = Agent(engine=LLMEngine("gpt-5.4-mini"), name="apology")
log_outcome   = Agent(engine=LLMEngine("gpt-5.4-mini"), name="log_outcome")


plan = Agent(
    engine=Plan(
        Step("search",
             output=Hits,
             routes={
                 "apology": when.field("items").empty(),       # empty hits
                 "write":   when.field("items").not_empty(),   # has hits
             },
             after_branches="log_outcome"),         # every branch lands here
        Step("write",       task="Write a 200-word brief from the search hits."),
        Step("apology",     task="Apologise; suggest broader terms."),
        Step("log_outcome", task="Emit one line of metrics for this run."),
    ),
    tools=[searcher, writer, apology_agent, log_outcome],
)
# Execution shapes (verified against _routing()):
#   items=[] (empty):    search → apology → log_outcome
#   items=[...] (some):  search → write → log_outcome


# 2) Form B (routes_by) — LLM-decided triage with exclusive branching.
class Triage(BaseModel):
    summary: str
    severity: Literal["urgent", "normal", "spam"] | None = None


classifier = Agent(engine=LLMEngine("gpt-5.4-mini"), name="classify", output=Triage)
escalator  = Agent(engine=LLMEngine("gpt-5.4-mini"), name="urgent")
triager    = Agent(engine=LLMEngine("gpt-5.4-mini"), name="normal")
closer     = Agent(engine=LLMEngine("gpt-5.4-mini"), name="spam")
archiver   = Agent(engine=LLMEngine("gpt-5.4-mini"), name="archive")


plan = Agent(
    engine=Plan(
        Step("classify",
             output=Triage,
             routes_by="severity",            # reads env.payload.severity
             after_branches="archive"),       # skip siblings; rejoin at "archive"
        Step("urgent",  task="Page on-call; open P0."),
        Step("normal",  task="Add to support backlog."),
        Step("spam",    task="Close as spam."),
        Step("archive", task="Log to the audit archive."),    # always runs
    ),
    tools=[classifier, escalator, triager, closer, archiver],
)


# 3) Self-correction loop — route back when the reviewer rejects.
class Verdict(BaseModel):
    feedback: str
    approved: bool


reviewer  = Agent(engine=LLMEngine("gpt-5.4-mini"), name="review")
publisher = Agent(engine=LLMEngine("gpt-5.4-mini"), name="publish")


store = Store()
plan = Agent(
    engine=Plan(
        Step("write",
             task="Draft a 200-word answer; if a 'verdict' is in the store, "
                  "rewrite addressing the feedback.",
             context=from_start,
             sources=[store],
             writes="draft"),
        Step("review",
             task="Score the draft; approved=True only if accuracy + tone + length all pass.",
             context=from_prev,
             output=Verdict,
             writes="verdict",
             # Loop back to the writer when rejected.
             routes={"write": when.field("approved").is_(False)}),
        Step("publish", task="Final-format and publish.", context=from_step("write")),
        # Without max_iterations, an infinite-rejection bug would loop
        # forever. 8 attempts is a defensible upper bound for most
        # policies; tune to your SLA.
        max_iterations=8,
    ),
    tools=[writer, reviewer, publisher],
    store=store,
)


# 4) Lambda escape hatch for one-off predicates — still covers BOTH branches
#    and uses after_branches.  Lambdas don't change the safety contract.
plan = Agent(
    engine=Plan(
        Step("classify",
             output=Hits,
             routes={
                 "apology": lambda env: not env.payload.items,
                 "rank":    lambda env: bool(env.payload.items),
             },
             after_branches="log_outcome"),
        Step("rank",        task="Rank the search hits."),
        Step("apology",     task="Apologise; suggest broader terms."),
        Step("log_outcome", task="Emit metrics."),
    ),
    tools=[searcher, writer, apology_agent, log_outcome],
)


# 5) Custom predicate function for multi-field combinators — paired with its
#    explicit complement so coverage is exhaustive.  Skip-on-no-match is a
#    bug, not a shorthand.
class Score(BaseModel):
    score: float
    topic: str


def needs_review(env) -> bool:
    """Route to review when the score is low AND the topic is sensitive."""
    return env.payload.score < 0.5 and env.payload.topic in {"medical", "legal"}


def safe_to_auto(env) -> bool:
    """Catch-all complement of needs_review."""
    return not needs_review(env)


score_classifier = Agent(engine=LLMEngine("gpt-5.4-mini"), output=Score, name="classify")
auto_agent       = Agent(engine=LLMEngine("gpt-5.4-mini"), name="auto")
review_agent     = Agent(engine=LLMEngine("gpt-5.4-mini"), name="review")
audit_agent      = Agent(engine=LLMEngine("gpt-5.4-mini"), name="audit")

plan = Agent(
    engine=Plan(
        Step("classify",
             routes={
                 "review": needs_review,
                 "auto":   safe_to_auto,
             },
             after_branches="audit"),
        Step("auto"),
        Step("review"),
        Step("audit"),     # rejoin terminal — always runs
    ),
    tools=[score_classifier, auto_agent, review_agent, audit_agent],
)
```

## Pitfalls

- **The detour trap — single-predicate forward routing.** The single most common bug. `routes={"X": predicate}` only fires *when the predicate is True*. When it's False, the Plan falls through linearly and runs every declared step between the routing step and the end — *including the step you thought was reachable only via routing*. Symptoms: the "branch-only" step runs even on the success path, and the final output comes from the wrong step. Fixes (in order of preference): (1) cover every outcome explicitly with multiple predicates plus `after_branches="<rejoin>"`; (2) use `routes_by="field"` with an exhaustive `Literal[...]` output; (3) use the "skip optional middle step" pattern where the routed-to step is genuinely the last in declared order.
- **`routes_by` requires `output=` to be a Pydantic model with the named field as `Literal[...]` (or `Literal[...] | None`).** Anything else fails at construction. The compiler also verifies every Literal value matches a declared step name.
- **Routing cycles are not a compile error.** `A → B → A` may be intentional (self-correction loops). They surface at runtime as `MaxIterationsExceeded` once `Plan(max_iterations=...)` fires. Always pair a loop with a counter or termination predicate.
- **Predicate evaluation order matters.** `routes={...}` evaluates in declared order; the first `True` wins. If multiple predicates can match, put the more specific one first.
- **`routes_by` and `routes` are mutually exclusive.** Setting both on the same step fails at construction.
- **`routes_by="field"` returning `None`** (or any value not matching a step name) means "don't route" — linear progression continues. This is *not* an error; design your Literal type accordingly.
- **`when.field(name)` is non-strict by default.** A typo in the field name silently returns `None`, which can mask routing bugs (e.g. a payload that always routes to the empty branch because the predicate sees nothing). Use `when.field(name, strict=True)` to make typos raise.
- **`when` chains return predicates, not bools.** `when.field("x").empty()` is a callable; `Step` invokes it later. Don't accidentally call it eagerly (`when.field("x").empty()(env)` works but is rarely what you mean to write).
- **Routing primitives are ignored on parallel branches.** A `parallel=True` step's `routes=` / `routes_by=` is silently dropped — parallel bands have their own control flow. Set routing on the step *after* the band.
- **A predicate that raises is wrapped as `PlanRuntimeError`.** The engine catches the underlying exception and re-raises it as `PlanRuntimeError` (a `RuntimeError` subclass) with the offending step name, target, and underlying error class in the message. Distinct from `PlanCompileError` (build-time DAG validation) so caught-at-runtime predicate bugs don't conflate with caught-at-construction DAG bugs.
- **`after_branches` must come AFTER the routing step in declared order.** A typo or a backward reference fails fast at construction with a `PlanCompileError` message that names both positions. The rejoin point is also validated for existence.

## See also

- [Plan](https://core.lazybridge.com/guides/full/plan/index.md) — the engine that interprets routing decisions.
- [Step](https://core.lazybridge.com/guides/full/step/index.md) — the surface that carries `routes=`, `routes_by=`, and `after_branches=`.
- [Sentinels](https://core.lazybridge.com/guides/full/sentinels/index.md) — sentinels are about *data flow*; routing is about *control flow*. They don't overlap.
- [verify=](https://core.lazybridge.com/guides/mid/verify/index.md) — judge-and-retry around an output; complementary to routing for "wrong → try again" semantics.
- *Guides → Full → Parallel plan steps* (Phase 3b) — concurrent bands have their own control flow; routing primitives don't apply.

# Sentinels

How a `Step` says where its input comes from. Without sentinels you'd thread arguments manually through every step; with them, the data flow is one declaration per step. All sentinel references are validated at `Plan` construction time — typos become `PlanCompileError` before any LLM call.

## Signature

```python
from lazybridge import (
    from_prev,                     # singleton: previous step's output (default)
    from_start,                    # singleton: original user task
    from_step,                     # callable: from_step("name") — named prior step
    from_parallel,                 # callable: from_parallel("name") — alias for from_step
    from_parallel_all,             # callable: from_parallel_all("name") — aggregate a parallel band
    from_memory,                   # callable: from_memory("name") — agent's live conversation history
    from_agent,                    # callable: from_agent("name") — agent's last output from Store
)


# Valid placements on a Step:
Step(target, task=<sentinel>)
Step(target, context=<sentinel>)
Step(target, context=[<sentinel>, <sentinel>, "literal string"])
```

### The seven sentinels

| Sentinel                 | Reads                                                                         | Resolved                   | Compile-time validation                                                                                                                       |
| ------------------------ | ----------------------------------------------------------------------------- | -------------------------- | --------------------------------------------------------------------------------------------------------------------------------------------- |
| `from_prev`              | The previous step's output                                                    | Plan execution history     | — (always available)                                                                                                                          |
| `from_start`             | The Plan's original input                                                     | Plan execution history     | —                                                                                                                                             |
| `from_step("n")`         | A named prior step's output                                                   | Plan execution history     | `"n"` must name an earlier step                                                                                                               |
| `from_parallel("n")`     | Same as `from_step("n")` — different name signals intent                      | Plan execution history     | `"n"` must name an earlier step (validated identically to `from_step`).                                                                       |
| `from_parallel_all("n")` | Every consecutive parallel sibling starting at `"n"`, joined as labelled text | Plan execution history     | `"n"` must exist, come earlier, be `parallel=True`, AND be the FIRST member of its band (the step immediately before it must be non-parallel) |
| `from_memory("n")`       | The live `Memory` of the agent registered under `"n"`                         | Step execution time (live) | The named tool must be an agent with `memory=` attached                                                                                       |
| `from_agent("n")`        | The last output of agent `"n"` from a shared `Store`                          | Step execution time (live) | The tool must be an agent (`returns_envelope=True`) AND the source agent must have `store=` attached                                          |

## Synopsis

Sentinels split into two categories:

**Plan-only** — resolve against the Plan's execution history at step dispatch time. They cannot reach outside the current `Plan.run`:

- `from_prev` — the workhorse. The default for `task=`. Each step reads the one before it.
- `from_start` — the original user task. Use it when a step needs the input regardless of intermediate processing (verification, re-routing, fresh framing).
- `from_step("name")` — name-keyed access to any earlier step's output. The compiler validates the name; a typo fails at construction.
- `from_parallel("name")` — alias for `from_step` that reads better at the call site when the referenced step ran concurrently with siblings.
- `from_parallel_all("name")` — aggregator. Folds every consecutive `parallel=True` step starting at `name` into one envelope whose payload is a labelled-text join (`"[step_a]\n<text>\n\n[step_b]\n<text>"`). See *Parallel plan steps* (Phase 3b) for the full mechanics.

**Universal** — resolve at step *execution* time and work both inside a `Plan` and standalone:

- `from_memory("name")` — reads the *live* `Memory` of the agent registered under `name`. Always reflects the most recent conversation history; absent or empty memory contributes nothing (silent no-op).
- `from_agent("name")` — reads the *last output* of agent `name` from a shared `Store`. Every successful agent run writes to `__agent_output__:{alias}`; `from_agent` reads it back. Works across runs and outside the current `Plan`.

The store key is always the **alias** passed to `as_tool("alias")`, not the agent's internal `name=`.

## When to use which

- **Inside the same Plan, prefer `from_step("name")`** over `from_agent("name")`. `from_step` reads from in-memory step history (no Store required), is validated more tightly at compile time, and is cheaper.
- **Use `from_agent("name")` only when** you need the agent's output independent of the current Plan's history: across Plan runs, in a standalone LLM orchestrator with no step history, or in a step that needs the output of an agent invoked *outside* this Plan.
- **Use `from_memory("name")` for** conversation continuity — when a downstream agent should see what an upstream agent has *been talking about* (memory), not just its last output (Store).
- **Use `from_start` when** the original user task is the right prompt for a step that's deep in the pipeline (verifier, apology branch, fresh-framing summariser).

## Example

```python
from pydantic import BaseModel

from lazybridge import (
    Agent,
    LLMEngine,
    Memory,
    Plan,
    Step,
    Store,
    from_agent,
    from_memory,
    from_prev,
    from_start,
    from_step,
)


store = Store(db="pipeline.sqlite")
mem = Memory(strategy="summary")


# Researcher with memory + store — feeds both `from_memory` and
# `from_agent` references downstream.
researcher = Agent(
    engine=LLMEngine("gemini-3-flash-preview"),
    memory=mem,
    store=store,
    name="research",
)
fact_checker = Agent(
    engine=LLMEngine("gemini-3-flash-preview"),
    store=store,
    name="fact_check",
)
writer = Agent(
    engine=LLMEngine("gpt-5.4-mini"),
    name="write",
)


# 1) Mixed sentinels in a single Plan.
plan = Agent(
    engine=Plan(
        Step("research"),
        # fact_checker sees the researcher's output as task,
        # the original user task as context.
        Step("fact_check",
             task=from_prev,
             context=from_start),
        # writer sees the researcher's live memory PLUS the fact_checker output.
        Step("write",
             context=[from_memory("research"), from_step("fact_check")]),
    ),
    tools=[researcher, fact_checker, writer],
    store=store,
)
plan("AI trends April 2026")


# 2) Multi-source synthesis via context=[...] — no combiner step needed.
class Brief(BaseModel):
    title: str
    body: str


policy_loader = Agent(
    engine=LLMEngine("gemini-3-flash-preview"),
    name="policy",
)
synthesiser = Agent(
    engine=LLMEngine("gemini-3-flash-preview"),
    name="synth",
    output=Brief,
)


plan2 = Agent(
    engine=Plan(
        Step("research"),
        Step("policy"),
        Step("synth",
             task="Draft a brief citing both sources.",
             context=[
                 from_step("research"),
                 from_step("policy"),
                 "Style: neutral, third person, no superlatives.",
             ]),
    ),
    tools=[researcher, policy_loader, synthesiser],
)


# 3) from_agent across runs — read what the researcher produced
#    in a previous Plan execution.
standalone = Agent(
    engine=LLMEngine("gemini-3-flash-preview"),
    tools=[researcher],
    store=store,
)
standalone("find AI trends")

# Later, in a different Plan / process, with the same store:
later_plan = Agent(
    engine=Plan(
        Step("write",
             task="Write a follow-up brief based on prior research.",
             context=from_agent("research")),
    ),
    tools=[writer],
    store=store,
)
```

## Pitfalls

- **`from_prev` after a parallel band returns the *join step's* output, not one of the branches.** Use `from_parallel("<branch-name>")` for a specific branch or `from_parallel_all("<first-branch-name>")` for the aggregate.
- **Sentinels are module-level imports.** Don't shadow them with local variables of the same name (`from_prev = "literal"` is a bug waiting to happen).
- **A `str` passed as `task=` is a LITERAL, not a sentinel reference.** `task="from_prev"` sets the step's task to the string `"from_prev"`. Use the imported `from_prev` symbol.
- **`from_memory("n")` is a silent no-op when the agent hasn't run.** Empty memory contributes nothing, no error. If you want fail-fast behaviour, validate the memory yourself before the Plan dispatches the step.
- **`from_agent("n")` requires `store=` on the source agent.** PlanCompiler rejects it at construction time when the named agent has no store attached. The error message names the offending agent — pass `store=...` to that agent, not just to the Plan.
- **`from_agent("n")` requires the tool to be an agent** (`returns_envelope=True`), not a plain function. PlanCompiler rejects plain-function targets at construction time.
- **The Store key is the *alias*, not the agent's `name=`.** When the agent is wrapped via `agent.as_tool("alias")`, the auto-write key is `__agent_output__:alias`. `from_agent("alias")` reads it back. Mixing these up is one of the most common sources of confusion — keep aliases stable across runs that share a Store.
- **`from_parallel_all("n")` requires `n` to be the FIRST step of its parallel band.** Mid-band references fail at construction with a clear error.

## See also

- [Plan](https://core.lazybridge.com/guides/full/plan/index.md) — the engine that interprets sentinels.
- [Step](https://core.lazybridge.com/guides/full/step/index.md) — the surface that consumes sentinels via `task=` and `context=`.
- [Routing](https://core.lazybridge.com/guides/full/routing/index.md) — sentinels are about *data flow*; routing is about *control flow*. They don't overlap.
- [Store](https://core.lazybridge.com/guides/mid/store/index.md) — the backing store for `from_agent` and the receiver of `Step(writes=...)`.
- [Memory](https://core.lazybridge.com/guides/mid/memory/index.md) — the conversation-history layer that `from_memory` reads live.
- *Guides → Full → Parallel plan steps* (Phase 3b) — `from_parallel_all` aggregation in full.

# Step

The unit a `Plan` is built from. Each `Step` declares a target (an agent, a callable, or a tool name), the task it runs, where its input comes from, what types it expects, and whether to persist its payload. Routing is also declared on the step — see [Routing](https://core.lazybridge.com/guides/full/routing/index.md) for the full surface.

## Signature

```python
from lazybridge import Step, from_prev

Step(
    target,                        # str (tool name) | Callable | Agent
    task=from_prev,                # Sentinel or literal string — the prompt for this step
    context=None,                  # Sentinel | str | list[Sentinel | str] — side context
    sources=(),                    # iterable of objects with .text() (live-view injection)
    writes=None,                   # str — Store key the payload is persisted under
    input=Any,                     # type annotation for the step's input (informational)
    output=str,                    # type for Envelope.payload (Pydantic class enables validation)
    parallel=False,                # mark as a member of a parallel band
    name=None,                     # unique within the Plan; defaults to target's name
    # Routing — see Routing guide for the full surface.
    routes=None,                   # dict[str, Callable[[Envelope], bool]]
    routes_by=None,                # str — name of a Literal field on `output` model
    after_branches=None,           # str — exclusive-branch rejoin point
)
```

`task=from_prev` is the default and means "feed me whatever the previous step produced". A literal string is used verbatim — useful for hard-coded prompts at intermediate steps where the data flows through `context=` instead.

## Synopsis

A `Step` is the smallest declarative unit `Plan` orchestrates. Its fields fall into three groups:

**Data flow.** `target` is *what runs*; `task=` is *the prompt*; `context=` is *the side data*; `sources=` is *the live-view objects* (e.g. a `Store` or `Memory` whose current state should be appended verbatim). All four flow into the step's `Envelope` before the target runs.

**Typing and persistence.** `input=` and `output=` declare the step's expected input and output types; `output=PydanticModel` enables validation and unlocks `routes_by="field"`. `writes="key"` persists the step's payload to `store["key"]` after a successful run — required for checkpoint resume and for downstream agents reading via `sources=[store]`.

**Concurrency and naming.** `parallel=True` marks the step as a member of a concurrent band (consecutive `parallel=True` steps are dispatched together via `asyncio.gather`). `name=` is the authoritative key the rest of the plan references — duplicates and typos surface at construction time.

**Routing fields** (`routes`, `routes_by`, `after_branches`) get their own dedicated guide: [Routing](https://core.lazybridge.com/guides/full/routing/index.md).

## When to use specific fields

- **`task=` literal string** — for *specialised* steps where the prompt is fixed and the data flows through `context=`. Example: `Step("rank", task="Rank by relevance.", context=from_prev)`.
- **`task=` sentinel** — for *delegating* steps where the previous step's output **is** the prompt. Default `from_prev` is appropriate most of the time.
- **`context=` single sentinel** — pull data from one upstream step.
- **`context=[...]` list** — synthesise from multiple upstream steps without an intermediate combiner. Items resolve independently and join with blank-line separators (same shape as `sources`); literal strings can ride along to inject fixed boilerplate.
- **`sources=`** — for *live-view* state that should reflect the most recent value at step execution time (`Store`, `Memory`, any object with `.text()`). Sentinels resolve once at the start of the step; sources re-materialise on every read.
- **`output=Model`** — when the next step's `context=` will read a typed payload, or when you want compile-time validation of a `routes_by="field"` reference.
- **`writes="key"`** — for *crash recovery* (`resume=True` reconstructs from store writes) and for *cross-agent reads* (a downstream agent with `sources=[store]` sees the live key).
- **`parallel=True`** — when the step has no data dependency on its declared neighbours and concurrent execution is safe.

## When NOT to use specific fields

- **Don't set `output=Model` purely for "type docs".** It activates Pydantic validation, structured-output retry, and `routes_by` semantics. If you want documentation, use `description=` or a comment.
- **Don't use `writes=` for in-Plan-only data.** A downstream step can read upstream output via `from_step("name")` directly from the in-memory history — no `Store` needed unless the value must also survive a crash or be visible to other agents.
- **Don't set `name=` to a string that collides with a tool.** When the target is a string (`Step("research")`), the framework resolves it against the wrapping agent's `tools=[...]` map. The step's `name=` and the resolved tool's name are the same key — there's no separate "step name" to disambiguate.

## Example

```python
from pydantic import BaseModel

from lazybridge import Agent, LLMEngine, Plan, Step, Store, from_prev, from_step


class Hits(BaseModel):
    items: list[str]


class Ranked(BaseModel):
    top: list[str]


def normalise(text: str) -> str:
    """Strip and lowercase — pure Python, no LLM."""
    return text.strip().lower()


searcher = Agent(
    engine=LLMEngine("deepseek-v4-flash"),
    name="search",
)
ranker = Agent(
    engine=LLMEngine("deepseek-v4-flash"),
    name="rank",
)
writer = Agent(
    engine=LLMEngine("gpt-5.4-mini"),
    name="write",
)


# 1) Mixed step targets — agents, plain callables, tool names by string.
plan = Agent(
    engine=Plan(
        Step(searcher, name="search"),                 # Agent target
        Step(normalise, name="clean", task=from_prev), # plain callable target
        Step("score",   name="score", task=from_prev), # tool-name string target
        Step(writer,    name="write",
             task="Write a 150-word brief.",
             context=from_step("clean")),
    ),
    tools=[score_tool],                                # the "score" tool name resolves here
)


# 2) Multi-source synthesis with context=[...].
class Brief(BaseModel):
    title: str
    body: str


synth = Agent(
    engine=LLMEngine("deepseek-v4-flash"),
    name="synth",
    output=Brief,
)
policy_loader = Agent(
    engine=LLMEngine("deepseek-v4-flash"),
    name="policy",
)
competitor = Agent(
    engine=LLMEngine("deepseek-v4-flash"),
    name="bench",
)


plan = Agent(
    engine=Plan(
        Step(searcher,      name="search",  writes="hits",    output=Hits),
        Step(policy_loader, name="policy",  task="Load the 2026 acceptable-use policy."),
        Step(competitor,    name="bench",   task="Find three relevant prior posts."),
        Step(synth,         name="synth",
             task="Draft a 300-word brief; cite each source explicitly.",
             context=[
                 from_step("search"),
                 from_step("policy"),
                 from_step("bench"),
                 "Style: neutral, third-person, no superlatives.",
             ],
             output=Brief),
    ),
)


# 3) Live-view sources — a writer reads any prior reviewer verdict
#    on every loop iteration.
class Verdict(BaseModel):
    feedback: str
    approved: bool


store = Store()
plan = Agent(
    engine=Plan(
        Step(writer, name="write",
             task="Draft a 200-word answer. If a 'verdict' is in the store, "
                  "rewrite the previous draft addressing the feedback.",
             sources=[store],                           # live read on every run
             writes="draft"),
        Step(reviewer, name="review",
             task="Score the draft.",
             context=from_prev,
             output=Verdict,
             writes="verdict"),
    ),
    store=store,
)
```

## Pitfalls

- **`task=str` is a literal, not a sentinel reference.** `task="from_prev"` puts the literal string `"from_prev"` into the step's task. Use the imported `from_prev` symbol.
- **`output=Model` is for typing the payload, not for routing.** A `next` field on the model is just a regular field. To declare routing, set `routes={...}` or `routes_by="field"` on the step (see [Routing](https://core.lazybridge.com/guides/full/routing/index.md)).
- **`writes=` does not deduplicate.** Two steps with `writes="result"` overwrite the same key. Pick distinct keys or namespace them.
- **`Step(target=callable, name="...")` doesn't get an LLM.** The callable runs once with the step's task as its argument. Useful for normalisation, validation, or any deterministic transformation between LLM steps.
- **`Step("name")` requires the name to resolve.** If the wrapping agent has no `tools=[...]` matching `"name"`, the framework raises `PlanCompileError` at construction. Either pass the agent in `tools=[...]`, or change the step to `Step(target=agent)` directly.
- **`parallel=True` is bundled with consecutive parallel steps.** The engine groups every adjacent `parallel=True` step into one band; a non-parallel step in between starts a new band. Keep parallel siblings contiguous in the declaration.
- **A failed parallel branch wipes the band's writes.** No `writes=` from the band are applied on error — `resume=True` re-runs the whole band cleanly. Don't write side effects inside parallel steps that aren't crash-safe to repeat.

## See also

- [Plan](https://core.lazybridge.com/guides/full/plan/index.md) — the engine that interprets a list of `Step`s.
- [Sentinels](https://core.lazybridge.com/guides/full/sentinels/index.md) — `from_prev` / `from_start` / `from_step` / `from_parallel` / `from_parallel_all` / `from_memory` / `from_agent` semantics in full.
- [Routing](https://core.lazybridge.com/guides/full/routing/index.md) — `routes={...}` predicates, `routes_by="field"` Literal dispatch, `after_branches=` rejoin points, and the `when` DSL.
- [Store](https://core.lazybridge.com/guides/mid/store/index.md) — `writes=` lands here; `sources=[store]` reads it back live.
- *Guides → Full → Parallel plan steps* (Phase 3b) — concurrent bands and `from_parallel_all` aggregation.

# SupervisorEngine

The heavier human-in-the-loop variant. Where [`HumanEngine`](https://core.lazybridge.com/guides/mid/human-engine/index.md) is a single approval prompt, `SupervisorEngine` is a full REPL: the operator can call tools, retry registered agents with feedback, inspect store keys, and hand control back when ready. Drop it into a `Plan` step like any other engine.

## Signature

```python
from lazybridge import Agent, Tool
from lazybridge.ext.hil import SupervisorEngine, supervisor_agent

# Canonical — Agent + SupervisorEngine
SupervisorEngine(
    *,
    tools=None,                    # list[Tool | Callable | Agent]
    agents=None,                   # list[Agent] — agents the human can `retry`
    store=None,                    # Store the human can `store <key>` to inspect
    input_fn=None,                 # Callable[[str], str] — sync prompt
    ainput_fn=None,                # Callable[[str], Awaitable[str]] — async prompt
    timeout=None,                  # seconds; on expiry triggers default= or raises TimeoutError
    default=None,                  # str returned on timeout
)

agent = Agent(
    engine=SupervisorEngine(
        tools=[search],
        agents=[researcher],
        store=store,
    ),
    name="supervisor",
)


# Sugar — same agent, less plumbing
agent = supervisor_agent(
    tools=[search],
    agents=[researcher],
    store=store,
    name="supervisor",
)
```

The sugar `supervisor_agent(...)` lives in `lazybridge.ext.hil` and forwards engine kwargs to `SupervisorEngine` and remaining `**agent_kwargs` to `Agent`. See [Canonical vs sugar](https://core.lazybridge.com/concepts/canonical-vs-sugar/index.md).

### REPL commands

| Command                     | Effect                                                                                                           |
| --------------------------- | ---------------------------------------------------------------------------------------------------------------- |
| `continue [optional text]`  | Accept; return optional text as the engine's output. Only terminator.                                            |
| `retry <agent>: <feedback>` | Re-run the named registered agent with feedback appended to the task; the output replaces the supervisor buffer. |
| `store <key>`               | Print `store[key]`.                                                                                              |
| `<tool>(<args>)`            | Invoke a registered tool with the given arguments.                                                               |

Unknown commands print help and re-prompt. Only `continue` ends the REPL.

## Synopsis

`SupervisorEngine` implements the same `Engine` protocol as `LLMEngine`, so any agent that accepts an engine can swap one in. The difference is that the engine's "model" is a human at a REPL who can inspect, modify, and dispatch — not a one-shot prompt.

The REPL runs on a worker thread so the caller's event loop is not blocked; `input_fn` is called there. For automated tests, pass a scripted `input_fn` that returns canned responses. For non-terminal contexts, pass an `ainput_fn` that wires into your event system.

`agents=` registers Agent instances that the human can `retry` with feedback. The engine resolves the named agent, appends the feedback to its task, and runs it; the output replaces the supervisor's current buffer so subsequent commands operate on the new value.

## When to use it

- **Interactive debugging of complex pipelines.** Drop in a supervisor step after the misbehaving agent; the operator can retry it with feedback, inspect store keys, and continue once the output is correct.
- **High-stakes operational steps.** A pipeline that posts public content, sends emails, or updates production data benefits from a supervisor gate where the human can verify the draft, retry sub-agents, or call additional tools before approving.
- **Demos and live walkthroughs.** A supervisor mid-pipeline lets you steer the demo without restarting the whole run.
- **Sensitive automation under SLA.** An on-call operator can intervene when the agent's output looks wrong, without giving up the rest of the pipeline's automation.

## When NOT to use it

- **Approval-only flows.** Use [`HumanEngine`](https://core.lazybridge.com/guides/mid/human-engine/index.md) — it's lighter and doesn't pull in REPL machinery the operator doesn't need.
- **Background / unattended pipelines.** A supervisor with `timeout=None` (default) hangs forever waiting for input. Pair with `timeout=...` + `default="continue"` if you need unattended fallback, or use a different engine.
- **Web / HTTP-served agents where the human isn't at a terminal.** Pass an `ainput_fn` adapter that wires into your request queue / websocket; or build a custom UI on top of `HumanEngine`'s simpler protocol.

## Example

```python
from lazybridge import Agent, LLMEngine, Plan, Step, Store, Tool
from lazybridge.ext.hil import SupervisorEngine


def search_web(query: str) -> str:
    """Search the web for ``query``."""
    return "..."


researcher = Agent(
    engine=LLMEngine("gpt-5.4-mini"),
    tools=[Tool.wrap(search_web, name="search_web")],
    name="researcher",
)
writer = Agent(
    engine=LLMEngine("gpt-5.4-mini"),
    name="writer",
)


# 1) Standalone supervisor — REPL with tool dispatch + agent retry.
store = Store()
store.write("policy", "publish only peer-reviewed sources")

supervisor = Agent(
    engine=SupervisorEngine(
        tools=[Tool.wrap(search_web, name="search_web")],
        agents=[researcher],          # human can `retry researcher: <feedback>`
        store=store,                  # human can `store policy` to inspect
    ),
    name="supervisor",
)
result = supervisor("draft a policy brief on AI alignment")


# 2) Inside a pipeline — researcher → supervisor → writer.
pipeline = Agent(
    engine=Plan(
        Step(target=researcher, name=researcher.name),
        Step(target=supervisor, name=supervisor.name),
        Step(target=writer,     name=writer.name),
    ),
    name="release-pipeline",
)
pipeline("AI policy brief")


# 3) Scripted inputs for tests.
script = iter([
    "search_web('alignment 2026')",
    "retry researcher: focus on January-April 2026",
    "store policy",
    "continue Final brief approved.",
])

def scripted_input(prompt: str) -> str:
    return next(script)


supervisor_for_test = Agent(
    engine=SupervisorEngine(
        tools=[Tool.wrap(search_web, name="search_web")],
        agents=[researcher],
        store=store,
        input_fn=scripted_input,
    ),
)


# 4) Async UI — wire the prompt into a queue / websocket.
async def web_input(prompt: str) -> str:
    await my_queue.publish({"prompt": prompt})
    return await my_queue.await_response()


supervisor_web = Agent(
    engine=SupervisorEngine(
        tools=[Tool.wrap(search_web, name="search_web")],
        agents=[researcher],
        ainput_fn=web_input,
        timeout=600,
        default="continue",
    ),
    name="supervisor-web",
)
```

## Pitfalls

- **`input_fn` is called from a worker thread.** If it accesses thread-unsafe state (like `readline` history), guard it. Async callsites should prefer `ainput_fn`.
- **`agents=` expects v1 `Agent` instances.** Duck-typed objects work if they expose `__call__` / `run` and a `name` attribute, but the supervisor's `retry` command resolves by name — make sure the name matches what the human will type.
- **`timeout=None` (default) hangs unattended pipelines forever.** Always pair with `timeout=...` + `default=...` for any pipeline that might run without a human present.
- **Tool calls in the REPL go via `run_sync`.** Async tool functions are driven to completion automatically; this happens synchronously inside the REPL so the operator sees the result before the next prompt.
- **The REPL terminates only on `continue`.** Other commands loop. If your `input_fn` ever returns something the supervisor doesn't recognise, it prints help and re-prompts — scripted-input tests must end the iterator with `continue`.
- **Session events.** Like any engine, an Agent wrapping `SupervisorEngine` emits `AGENT_START` / `AGENT_FINISH` events via the session. **Plus**, the REPL emits one `HIL_DECISION` event per command — `kind` is one of `continue` / `retry` / `store` / `tool` / `unknown` / `empty`, `command` is the raw REPL input, and `result` (when present) is a brief of the outcome. Auditing a multi-step REPL session is a sequential read of those events.
- **`store=` is read-only from the REPL by default.** The `store <key>` command prints the value. Writes happen through tool calls or registered agents — there's no `store set <key> <value>` command (by design — keep mutations explicit and traceable).

## See also

- [DeduplicateGuard](https://core.lazybridge.com/guides/mid/dedup-guard/index.md) — if worker agents accumulate full conversation history across retry loops, attach this guard to strip repeated blocks before the LLM sees them.
- [HumanEngine](https://core.lazybridge.com/guides/mid/human-engine/index.md) — the lighter approval-only variant.
- [Plan](https://core.lazybridge.com/guides/full/plan/index.md) — typical container for a supervisor mid- pipeline.
- [Store](https://core.lazybridge.com/guides/mid/store/index.md) — the inspection target for the supervisor's `store <key>` command.
- [Canonical vs sugar](https://core.lazybridge.com/concepts/canonical-vs-sugar/index.md) — `supervisor_agent(...)` vs `Agent(engine=SupervisorEngine(...))`.

# Agent.as_tool

Wrap an agent as a `Tool` that another agent can call. The default shape is the same as passing the agent directly into `tools=[...]`; the explicit form unlocks two extras: a different surface name than the agent's own `name=`, and a `verify=` judge-and-retry loop around every call.

## Signature

```python
agent.as_tool(
    name=None,                     # surface tool name; defaults to agent.name
    description=None,              # LLM-facing description; defaults to agent.description
    *,
    verify=None,                   # Agent or callable[[str], Any] — judge-and-retry gate
    max_verify=3,                  # max attempts when verify=...
) -> Tool
```

The returned `Tool` has the schema `(task: str) -> Envelope` (the framework sets `returns_envelope=True` automatically so engines roll up cost / token / latency metadata correctly through the call).

The implicit form — `Agent(tools=[other_agent])` — is canonical for the no-rename, no-verify case. See [Canonical vs sugar](https://core.lazybridge.com/concepts/canonical-vs-sugar/index.md) for the exhaustive comparison.

## Synopsis

`as_tool` does three things, in order of how often you'll reach for them:

1. **Wraps an agent so it satisfies the `Tool` contract.** The wrapped tool's `name` becomes the LLM-facing tool name; the wrapped tool's `description` is what the model reads when deciding whether to call it.
1. **Optionally renames.** If you pass `name="alias"`, that's what the LLM sees; the source agent's own `name=` is unchanged. The alias is also what `from_agent("alias")` reads from the `Store` (see [Store](https://core.lazybridge.com/guides/mid/store/index.md)).
1. **Optionally adds a `verify=` judge.** Every invocation runs up to `max_verify` times; after each run the judge sees the output and replies `"approved"` (case-insensitive) or `"rejected: <reason>"`. On rejection the judge's feedback is threaded into the next attempt's task. This is the "Option B" placement — judge sits at the **tool-call boundary**, not around the agent's own engine.

The implicit form (`tools=[other_agent]`) covers (1). When you need (2) or (3), construct explicitly with `as_tool(...)`.

## When to use it

- **You want a different surface name than `agent.name=`.** The agent's own `name=` is the authoritative key for `Store` writes, graph nodes, and cost rollup. If you want the LLM to see a different name (e.g. `research` instead of `senior_researcher_v2`), pass `name="research"` to `as_tool`.
- **You want a different LLM-facing description.** Override `description=` to give the model a more focused or context-specific cue without changing the source agent.
- **You want a judge-gated call.** `verify=judge` runs the judge after every call, retrying up to `max_verify` times with the judge's feedback. Use this for high-stakes outputs where the parent agent should not see "first try, possibly wrong" results.
- **You want to expose `Agent.parallel(...)` as a single tool.** The `ParallelAgent` returned by `Agent.parallel(...)` already returns ONE Envelope from `__call__` / `run` since 0.7.9; its `as_tool()` just delegates so the outer agent reads the same labelled-text join the runner produces directly.

## When NOT to use it

- **For the simple "agent A calls agent B" case.** Just pass the agent: `Agent(tools=[other_agent])` is canonical and saves the noise. The framework wraps it the same way internally.
- **When the wrapped agent has structured output.** The default schema is `(task: str) -> str`. If the parent needs a typed payload from the child, orchestrate with `Plan` and `Step(target=child, output=Model)` instead — `Plan` preserves typed envelopes between steps; `as_tool` flattens them.
- **As a substitute for `verify=` on the parent agent.** `Agent(verify=judge)` wraps the parent's *whole run*; this is the right choice when the policy applies to every output. The per-tool `verify=` is for when only specific sub-tools need gating.

## Example

```python
from lazybridge import Agent, LLMEngine


def search(query: str) -> str:
    """Search the web for ``query`` and return the top three hits."""
    return "..."


researcher = Agent(
    engine=LLMEngine("gemini-3-flash-preview"),
    tools=[search],
    name="senior_researcher_v2",
)
judge = Agent(
    engine=LLMEngine(
        "claude-opus-4-7",
        system='Respond "approved" or "rejected: <reason>".',
    ),
    name="judge",
)


# 1) Implicit — pass the agent directly. Tool name = "senior_researcher_v2".
orchestrator = Agent(
    engine=LLMEngine("gemini-3-flash-preview"),
    tools=[researcher],
)


# 2) Explicit alias — tool name "research", source agent name unchanged.
orchestrator = Agent(
    engine=LLMEngine("gemini-3-flash-preview"),
    tools=[
        researcher.as_tool(
            name="research",
            description="Find three high-quality sources for a topic.",
        ),
    ],
)


# 3) Verified call — judge gates every research invocation, up to two attempts.
orchestrator = Agent(
    engine=LLMEngine("gemini-3-flash-preview"),
    tools=[
        researcher.as_tool(
            name="research",
            verify=judge,
            max_verify=2,
        ),
    ],
)
result = orchestrator("write a paragraph on bee population trends")
print(result.text())


# 4) Parallel fan-out as a single tool — wrapper Envelope already carries the join.
fan_out = Agent.parallel(researcher_us, researcher_eu, researcher_asia)
orchestrator = Agent(
    engine=LLMEngine("gemini-3-flash-preview"),
    tools=[fan_out.as_tool(name="multi_region_research")],
)
```

## Pitfalls

- **Default schema is `(task: str) -> str`.** The wrapped agent's `output=Model` is **not** preserved in the tool's signature — the LLM sees a string contract regardless. For typed downstream consumption use a `Plan` step with `Step(output=Model)` instead.
- **`max_verify` is the upper bound, not the target.** A judge that's too strict can fail every attempt, costing `max_verify` full agent runs per call. Pick a defensible verdict prompt and keep `max_verify=2` or `3` unless you have a reason.
- **Verify retries cost tokens on both the child and the judge.** Each attempt is a full child-agent run plus a judge run. Use `verify=` only on calls that genuinely warrant the cost.
- **The alias is the Store key.** `as_tool(name="research")` makes `__agent_output__:research` the auto-write key — not `__agent_output__:senior_researcher_v2`. `from_agent("research")` reads it back. Keep aliases stable across runs that share a Store.
- **Long nested chains share one Session.** Pass `session=sess` on the outer agent only — inner agents (whether wrapped via `as_tool` or passed directly) inherit it. `usage_summary()` aggregates cost across the whole tree.
- **Pre-existing `as_tool()` callsites are still supported.** `tools=[agent.as_tool("name")]` and `tools=[agent]` are equivalent when the agent already has `name="name"`. Don't rewrite working code mechanically; prefer the implicit form for new code, and reach for `as_tool` when you actually want one of its three jobs.

## See also

- [Chain](https://core.lazybridge.com/guides/mid/chain/index.md) — when you want a fixed sequential pipeline rather than an LLM-directed dispatch.
- [Parallel](https://core.lazybridge.com/guides/mid/parallel/index.md) — `Agent.parallel(...).as_tool()` folds scripted fan-out into a single tool.
- [Tool](https://core.lazybridge.com/guides/basic/tool/index.md) — the surface every `as_tool()` call produces.
- *Guides → Full → verify=* (Phase 3) — verify around the parent agent's whole run versus per-tool gating.
- [Canonical vs sugar](https://core.lazybridge.com/concepts/canonical-vs-sugar/index.md) — the rules for when to call `as_tool` explicitly versus relying on the implicit `tools=[agent]` form.

# Chain

The simplest multi-agent shape: run agents one after another, each agent's output becoming the next agent's task. A chain is a `Plan` of sequential `Step`s — `Agent.chain(...)` is the one-line sugar over the canonical form.

## Signature

```python
# Canonical — explicit Plan with one Step per agent
from lazybridge import Agent, LLMEngine, Plan, Step

pipeline = Agent(
    engine=Plan(
        Step(target=researcher, name=researcher.name),
        Step(target=editor,     name=editor.name),
        Step(target=writer,     name=writer.name),
    ),
    name="pipeline",
)


# Sugar — Agent.chain wraps the same Plan for you
pipeline = Agent.chain(researcher, editor, writer, name="pipeline")
```

The two forms produce structurally identical agents. Sugar saves the `Plan(Step(...))` boilerplate when there is no router, no parallel band, and no checkpoint. See [Canonical vs sugar](https://core.lazybridge.com/concepts/canonical-vs-sugar/index.md) for the full comparison.

## Synopsis

A chain runs `N` agents sequentially. The first agent receives the input task; every subsequent agent receives the previous agent's `Envelope.text()` as its task (the default `from_prev` sentinel). The result is the last agent's `Envelope`.

Internally, both `Agent.chain` and the canonical form build the same `Plan` of `Step(target=..., name=...)` entries. `Plan` dispatches `Agent` targets via `target.run()` directly — no `tools=[...]` is needed and no extra `Tool` wrapping happens.

State on the outer agent (memory, session, guards, timeout, fallback) applies at the chain boundary. Each inner agent keeps its own.

## When to use it

- **Linear multi-agent pipelines** with text hand-offs: research → edit → write, extract → classify → summarise, plan → draft → revise.
- **Quick CrewAI-style sequential crews** where ordering is fixed and each agent's output is consumed verbatim by the next.
- **Adding session / memory / guards once at the top** of a small pipeline — they apply at the boundary; you don't have to plumb them into every step.

## When NOT to use it

- **Typed hand-offs.** `Agent.chain` carries the previous step's `text()` (a string) into the next step's task — Pydantic models flatten to JSON. If step *N* must consume a typed payload from step *N-1*, use a `Plan` with `Step(output=Model)` and a sentinel like `context=from_step("name").payload`.
- **Conditional routing.** Chains have no branches. If a step's output decides which agent runs next, use `Plan` with `Step(routes=...)` predicates or `Step(routes_by="field")`.
- **Crash resume.** Chains have no `checkpoint_key=`. For resumable pipelines use `Plan` directly with `store=` + `checkpoint_key=`.
- **Fan-out on the same task.** That's `Agent.parallel(...)` — see [Parallel](https://core.lazybridge.com/guides/mid/parallel/index.md).
- **LLM-directed dispatch.** When you want the *model* to choose which sub-agent to call, put the agents in `tools=[...]` of an outer `Agent`, not in a chain.

## Example

```python
from lazybridge import Agent, LLMEngine, Memory, Plan, Step


def search(query: str) -> str:
    """Search the web for ``query`` and return the top three hits."""
    return "..."


researcher = Agent(
    engine=LLMEngine("gpt-5.4-mini"),
    tools=[search],
    name="researcher",
)
editor = Agent(
    engine=LLMEngine("gpt-5.4-mini"),
    name="editor",
)
writer = Agent(
    engine=LLMEngine("gpt-5.4-mini"),
    name="writer",
)


# 1) Canonical — explicit Plan; what Agent.chain produces internally.
pipeline = Agent(
    engine=Plan(
        Step(target=researcher, name=researcher.name),
        Step(target=editor,     name=editor.name),
        Step(target=writer,     name=writer.name),
    ),
    name="pipeline",
    memory=Memory(strategy="auto"),
)
result = pipeline("AI trends April 2026")
print(result.text())


# 2) Sugar — same agent, two characters less.
pipeline_sugar = Agent.chain(
    researcher,
    editor,
    writer,
    name="pipeline",
    memory=Memory(strategy="auto"),
)
result = pipeline_sugar("AI trends April 2026")
print(result.text())
```

## Pitfalls

- **Typed outputs flatten to text.** A chain step that produces a Pydantic model passes its JSON serialisation to the next step's task — the downstream agent sees a string, not a typed payload. Use `Plan` with explicit sentinels (`context=from_step("name")`) to preserve types.
- **The outer agent's name is `"chain"` by default.** Set `name="…"` if you want it to appear distinctly in `Session.graph` or in cost rollup tables.
- **Memory on the chain wraps the whole pipeline**, not individual steps. If you want each inner agent to keep its own conversation history, give each its own `memory=Memory(...)` and pass nothing on the chain.
- **`Agent.chain(...).chain(...)` does not exist.** `Agent.chain` returns a regular `Agent` whose engine is a `Plan`; you can wrap it again only by passing it as a target to another `Plan`. The same agent shape composes recursively.
- **Errors propagate.** A failed step short-circuits the chain and the chain's envelope carries the error. Use a `fallback=` agent on the chain (or on individual steps via `verify=` semantics) if you want graceful degradation.

## See also

- [Parallel](https://core.lazybridge.com/guides/mid/parallel/index.md) — fan-out instead of sequence.
- [As tool](https://core.lazybridge.com/guides/mid/as-tool/index.md) — wrapping an agent (or a chain) as a tool for an outer agent that decides when to invoke it.
- [Nested pipelines](https://core.lazybridge.com/guides/full/composition-patterns/index.md) — when a chain isn't enough: Plan-of-Plans, parallel bands of sub-pipelines, and LLM-decided dispatch over sub-pipelines (horizontal composition beyond the linear case).
- [Canonical vs sugar](https://core.lazybridge.com/concepts/canonical-vs-sugar/index.md) — the full table mapping `Agent.chain` and friends to their canonical `Plan(...)` equivalents.
- *Guides → Full → Plan* (Phase 3) — typed hand-offs, routing, checkpoints, and the `Step` surface in full.

# DeduplicateGuard

Removes repeated text blocks from the agent's input **before** they reach the LLM. Drop-in solution for the "recursive copy-paste" problem that inflates context windows in multi-agent dialogue chains.

## The problem it solves

In a multi-agent pipeline each agent typically prepends the full conversation history to the next agent's task. After a few hops the same dialogue turn can appear two, three, or ten times in the same context window:

```text
[Turn 1] User: Summarise the report.
[Turn 2] Assistant: The report says…
[Turn 1] User: Summarise the report.   ← duplicate
[Turn 2] Assistant: The report says…   ← duplicate
[Turn 3] Supervisor: Refine the summary.
```

Every duplicated block wastes tokens, degrades coherence, and can push useful content out of the context window. `DeduplicateGuard` detects and removes these blocks automatically — no prompt engineering needed.

## Quick start

```python
from lazybridge import Agent, DeduplicateGuard, LLMEngine

agent = Agent(
    engine=LLMEngine("claude-haiku-4-5"),
    guard=DeduplicateGuard(),
)
```

The guard fires only on `check_input`. The LLM's output is never touched.

## Parameters

```python
DeduplicateGuard(
    similarity_chars: int = 60,   # prefix length for near-duplicate detection
    min_block_chars:  int = 40,   # blocks shorter than this are never removed
    verbose:          bool = True, # print a one-line summary when blocks removed
)
```

| Parameter          | Default | What it controls                                                                                                                                             |
| ------------------ | ------- | ------------------------------------------------------------------------------------------------------------------------------------------------------------ |
| `similarity_chars` | `60`    | How many leading chars are used as a "fingerprint". Lower = more aggressive.                                                                                 |
| `min_block_chars`  | `40`    | Blocks shorter than this (in characters) are kept as-is and never considered for deduplication. Prevents removing short repeated phrases like "Yes" or "Ok". |
| `verbose`          | `True`  | Prints `[DeduplicateGuard] removed N block(s) — X → Y chars (-Z%)` to stdout. Set `False` in production.                                                     |

## How it works

### 1 — Block splitting

The guard splits the input into *blocks* using the first strategy that produces more than one block:

1. **`[Turn N]` markers** — dialogue turns in multi-agent history
1. **Double newlines** — paragraphs
1. **Single newlines** — individual lines

### 2 — Normalisation

Each block is normalised before comparison: whitespace collapsed, lowercased. This means `" Hello World "` and `"hello world"` are considered the same block.

### 3 — Fingerprint matching

A block is dropped if either condition holds:

- Its **full normalised form** matches a previously seen block (exact duplicate).
- Its **first `similarity_chars` characters** match a previously seen block (near-duplicate — catches blocks that share an intro but diverge slightly).

### 4 — Re-joining

The kept blocks are re-joined with the same separator as the original input (`\n`, `\n\n`, or no separator) so the cleaned text is a drop-in replacement.

## Tuning

### Dialogue chains (default settings work well)

```python
# Default: aggressively deduplicate [Turn N] markers
DeduplicateGuard()
```

### Long structured documents

If your context contains long sections (e.g., code blocks, JSON payloads) that legitimately start with the same prefix, raise `similarity_chars` to avoid false positives:

```python
DeduplicateGuard(similarity_chars=120)
```

### Short repeated phrases are meaningful

If your domain has many short repeated commands that should be preserved, raise `min_block_chars`:

```python
DeduplicateGuard(min_block_chars=80)
```

### Silent operation in production

```python
DeduplicateGuard(verbose=False)
```

## Combining with other guards

`DeduplicateGuard` slots into a `GuardChain` like any other guard. Run it **first** so downstream guards operate on the already-cleaned text:

```python
import re
from lazybridge import Agent, ContentGuard, DeduplicateGuard, GuardAction, GuardChain, LLMEngine

def no_pii(text: str) -> GuardAction:
    if re.search(r"[\w.+-]+@[\w-]+\.[\w.-]+", text):
        return GuardAction(allowed=False, message="Remove email addresses.")
    return GuardAction(allowed=True)

agent = Agent(
    engine=LLMEngine("claude-haiku-4-5"),
    guard=GuardChain(
        DeduplicateGuard(verbose=False),   # 1. clean duplicates first
        ContentGuard(input_fn=no_pii),     # 2. then check policy
    ),
)
```

## Example: supervisor chain

```python
from lazybridge import Agent, DeduplicateGuard, LLMEngine

researcher = Agent(engine=LLMEngine("claude-haiku-4-5"), name="researcher")
supervisor = Agent(
    engine=LLMEngine("claude-sonnet-4-6"),
    guard=DeduplicateGuard(),   # protect the supervisor from bloated history
    name="supervisor",
)

# Build a history string and pass it to the supervisor.
history = "\n".join([
    "[Turn 1] User: Summarise Q3 sales.",
    "[Turn 2] Researcher: Q3 revenue was $4.2M…",
    "[Turn 1] User: Summarise Q3 sales.",   # duplicated by the pipeline
    "[Turn 2] Researcher: Q3 revenue was $4.2M…",  # duplicated
])

result = supervisor(f"{history}\n[Turn 3] Supervisor: Refine this summary.")
# The guard removed 2 duplicate blocks before the LLM saw the input.
```

## The `deduplicate()` function

The underlying function is also available standalone if you want to clean text without attaching a guard to an agent:

```python
from lazybridge.dedup_guard import deduplicate

cleaned, n_removed = deduplicate(
    "[Turn 1] Hello\n[Turn 1] Hello\n[Turn 2] World",
    similarity_chars=60,
)
print(cleaned)    # "[Turn 1] Hello\n[Turn 2] World"
print(n_removed)  # 1
```

## Pitfalls

- **Not for semantic deduplication.** The guard uses exact and prefix matching, not embedding similarity. Two blocks that say the same thing in different words will both pass through. For semantic dedup, combine with an `LLMGuard` that detects content overlap.
- **Block splitting is heuristic.** If your input mixes `[Turn N]` markers and double-newline paragraphs in the same string, the guard uses `[Turn N]` markers and ignores paragraph breaks within a turn.
- **`min_block_chars` is measured on the original block, not the normalised form.** A block with lots of whitespace may be shorter after normalisation but still pass the length threshold and be eligible for deduplication.
- **The guard modifies the task string, not the message list.** If you are passing a structured `messages=` list directly to the engine rather than a plain string task, the guard does not inspect individual messages. Use it at the outermost agent boundary where the full history is concatenated into a single string.
- **`verbose=True` writes to stdout.** In production or in test suites that assert on stdout, set `verbose=False`.

## See also

- [Guards](https://core.lazybridge.com/guides/mid/guards/index.md) — guard protocol, `ContentGuard`, `LLMGuard`, `GuardChain`, and when to use hard gates vs soft verify.
- [Session](https://core.lazybridge.com/guides/mid/session/index.md) — `DeduplicateGuard` emits no session events (it rewrites silently); add your own `ContentGuard` with `metadata=` if you need observability of dedup decisions.
- [Reference → Guards](https://core.lazybridge.com/reference/guards/index.md) — full API surface.

# Dynamic graph (AgentPool + conclude)

Build a multi-agent system whose **topology is decided by the LLM at runtime**: agents delegate to each other by name, and any agent — however deeply nested — can end the whole task. This is the dynamic counterpart to a `Plan` (a fixed DAG you declare up front) and to plain `tools=[other_agent]` composition (a one-way, statically-wired tree).

An `AgentPool` is **not just a registry**. It is a *bounded local action space*: the set of specialists an agent inside it is allowed to reach directly. By giving selected agents access to *another* pool, you create **gateway agents** that connect one local action space to the next. A **pool chain** is therefore a dynamic workflow where the builder defines local worlds and transition points, while the runtime path emerges from agent decisions inside those bounded spaces.

Two primitives, both ordinary tools the engine does not special-case:

```python
from lazybridge import AgentPool, conclude

pool = AgentPool(max_depth=25)     # a local action space → a `route` tool
# conclude(message)                # non-local exit → returns to the top
```

## Plan vs AgentPool — static vs dynamic workflow runtime

The two composition runtimes are duals, not competitors:

- **`Plan` = static workflow runtime.** The topology is known up front. You declare the steps; the order is fixed, validated at construction, typed, and checkpointable.
- **`AgentPool` = dynamic workflow runtime.** The next step is selected at runtime. You declare *local action spaces* and *transition points*; the path emerges from agent decisions and ends when some agent calls `conclude`.

`AgentPool` does **not** replace `Plan`. Use `Plan` when the path must be known; use `AgentPool` when the path should emerge inside bounded local spaces; compose both when an outer lifecycle must stay deterministic while an inner phase explores. The rest of this guide develops the AgentPool half of that duality.

## Pools as local action spaces

Read a pool as the *local world* an agent can act in, not as a flat address book. Consider two pools that share one member:

```text
Pool 1: A, B, C, D
D has access to Pool 2
Pool 2: D, E, F, G
```

Interpretation:

- `A`, `B`, `C`, `D` can **self-organise inside Pool 1** — route to one another in whatever order the task demands.
- The system enters **Pool 2 only if `D` is selected** and chooses to call into it.
- `D` acts as a **gateway** between two local action spaces.
- The builder **did not define all edges**. The builder defined two local action spaces and a single transition point.

"Self-organising" here means only this: the route emerges at runtime inside builder-defined local action spaces. It is not an unbounded swarm — membership, `max_depth`, and `conclude` placement bound it.

## Signature

```python
class AgentPool:
    def __init__(self, *, max_depth: int = 25) -> None: ...
    def register(self, *agents: Agent) -> None: ...        # populate AFTER construction
    def roster(self) -> str: ...                            # one line per agent
    async def route(self, agent_name: str, task: str) -> str: ...
    def as_tool(self, name: str = "route") -> Tool: ...     # for tools=[...]

def conclude(message: str) -> str: ...                      # raises ConcludeSignal
```

`pool.as_tool()` returns a `Tool` named `route` with the schema `(agent_name: str, task: str) -> str`. `conclude` is a plain function; drop it straight into `tools=[conclude]`.

## Synopsis

`AgentPool` solves the **circular-reference problem**. Agents that delegate to each other can't all be passed into each other's `tools=[...]` at construction — the tool map is frozen in `Agent.__init__`. Instead, every agent references the *pool* (which already exists) via `pool.as_tool()`, and the pool references the agents through `register(...)`, called *after* they're built. At call time the LLM invokes `route("bob", "...")` and the pool dispatches to the agent registered under that name. Each `pool.as_tool(name)` an agent carries is one local action space it can reach.

`conclude` provides a **non-local exit**. In a nested call chain (`A → route → B → route → C`), `C` calling `conclude("done")` unwinds the entire chain in one step and returns `Envelope(payload="done")` from the *original* top-level `run()` — no need to thread the answer back up level by level. It is implemented as a `ConcludeSignal` (a `BaseException`) so it slips past the engine's tool-error handling; only `Agent.run` catches it. Nested invocations (`as_tool`, `AgentPool.route`, Plan agent-steps) run via an internal path that lets the signal keep propagating.

`max_depth` bounds routing recursion. Because routes can loop (`A → B → A → …`), the pool tracks call depth with a `contextvars` counter and, past `max_depth`, returns a "call conclude now" message instead of recursing — turning a would-be `RecursionError` into a graceful nudge.

## When to use it

- **The next agent should be chosen by the model, not the wiring.** A triage agent that routes to specialists, a debate between personas, a "society of mind" where workers hand off opportunistically.
- **Agents need to call each other (cycles).** `AgentPool` is the only composition path that expresses `A ⇆ B`; direct `tools=[...]` cannot.
- **Any node may finish the task early.** A worker that discovers the answer should not have to pass it back through every caller — it calls `conclude`.
- **You want layered routing.** Give different pools to agents at different levels (`pool.as_tool("ask_team")` vs `peers.as_tool("ask_peer")`) to scope which neighbours each level can reach — the basis for the gateway pattern below.

## When NOT to use it

- **The control flow is fixed.** If you already know the step order, use [`Plan`](https://core.lazybridge.com/guides/full/plan/index.md) (deterministic, checkpointable, validated at construction) or [`Agent.chain`](https://core.lazybridge.com/guides/mid/chain/index.md). A dynamic graph trades that guarantee for flexibility.
- **One agent simply calls another.** Plain `tools=[other_agent]` (see [As tool](https://core.lazybridge.com/guides/mid/as-tool/index.md)) is canonical and needs no pool.
- **You need typed hand-offs between agents.** `route` returns text; for structured payloads between stages use a `Plan` with `Step(output=Model)`.

## Example

```python
from lazybridge import Agent, AgentPool, LLMEngine, conclude

pool = AgentPool()

researcher = Agent(
    name="researcher",
    engine=LLMEngine("claude-opus-4-8", max_tool_calls_per_turn=1,
                     system="Gather facts, then route to 'writer'."),
    tools=[pool.as_tool(), conclude],
)
writer = Agent(
    name="writer",
    engine=LLMEngine("claude-opus-4-8", max_tool_calls_per_turn=1,
                     system="Write the answer, then call conclude(...)."),
    tools=[pool.as_tool(), conclude],
)

pool.register(researcher, writer)        # register AFTER construction

result = researcher("Summarise 2026 AI-policy trends in 3 bullets.")
print(result.text())                     # whatever 'writer' passed to conclude(...)
```

`researcher` may `route("writer", ...)`; `writer` ends the task with `conclude(...)`, and its message surfaces from the top-level `researcher(...)` call — regardless of how deep the routing went.

Layered routing — one agent, two pools, distinct tool names:

```python
orchestrator = Agent(
    name="orchestrator",
    engine=LLMEngine("claude-opus-4-8", max_tool_calls_per_turn=1),
    tools=[team.as_tool("ask_team"), peers.as_tool("ask_peer"), conclude],
)
```

## Gateway agents

A gateway agent is **not a special class**. It is an ordinary `Agent` that belongs to one local action space but also carries *another pool's* route tool. That second tool is its only extra authority — this preserves the LazyBridge rule that **everything remains a tool**. The gateway controls the transition between local action spaces by choosing whether, and when, to call into the next pool.

```python
discovery_pool = AgentPool(max_depth=8)
build_pool = AgentPool(max_depth=8)

gateway = Agent(
    name="gateway_to_build",
    description="Gateway from discovery to build.",
    engine=LLMEngine(
        "...",
        system=(
            "You decide whether discovery has enough stable information "
            "to move into build. Use the build pool only when requirements "
            "are clear enough to design, implement, or test."
        ),
        max_tool_calls_per_turn=1,
    ),
    tools=[
        discovery_pool.as_tool("ask_discovery_pool"),
        build_pool.as_tool("ask_build_pool"),
    ],
)

scout = Agent(..., tools=[discovery_pool.as_tool("ask_discovery_pool")])
analyst = Agent(..., tools=[discovery_pool.as_tool("ask_discovery_pool")])
critic = Agent(..., tools=[discovery_pool.as_tool("ask_discovery_pool")])

architect = Agent(..., tools=[build_pool.as_tool("ask_build_pool")])
implementer = Agent(..., tools=[build_pool.as_tool("ask_build_pool")])
tester = Agent(..., tools=[build_pool.as_tool("ask_build_pool"), conclude])

discovery_pool.register(scout, analyst, critic, gateway)
build_pool.register(gateway, architect, implementer, tester)
```

Notes:

- The gateway appears in **both local worlds conceptually** — it is registered in `discovery_pool` and in `build_pool`.
- Its actual authority comes from **the tools it carries**, not from any privileged status. Here it carries both pools' route tools.
- If you need *different* authority in each region — e.g. a forward-only hop into build versus a backward-only hop into discovery — use **separate agents** such as `gateway_to_build` and `gateway_to_discovery`, each carrying only the route tool appropriate to its direction.

## Reversible gateway agents

A gateway does not have to be one-way. Give one agent access to **both** pools and it becomes a *boundary controller* that can move in either direction:

```text
Pool 1: A, B, C, D
Pool 2: D, E, F, G
D has access to both Pool 1 and Pool 2
```

Interpretation — `D` is not just a forward gateway from Pool 1 to Pool 2. `D` is a **reversible boundary controller**. `D` can decide whether to:

- keep working inside Pool 1,
- transition into Pool 2,
- return from Pool 2 back to Pool 1,
- mediate between the two local action spaces,
- `conclude`, **only if `D` is explicitly intended to be a terminal agent**.

The model to hold onto:

```text
Pool                       = local action space
Gateway agent              = transition operator
Reversible gateway agent   = boundary controller between local action spaces
```

This creates a self-organising workflow that can move **forward and backward** between bounded local worlds. The builder still does not enumerate every edge — it defines local worlds, boundary controllers, and terminal conditions.

```python
discovery_pool = AgentPool(max_depth=8)
build_pool = AgentPool(max_depth=8)

boundary = Agent(
    name="boundary_manager",
    description=(
        "Boundary controller between discovery and build. "
        "Use discovery when the problem is unclear. "
        "Use build when requirements are stable enough to design or implement. "
        "Return to discovery when build exposes missing requirements or contradictions."
    ),
    engine=LLMEngine(
        "...",
        system=(
            "You are the reversible boundary manager between Discovery and Build. "
            "Use the discovery pool for clarification, evidence gathering, and critique. "
            "Use the build pool for architecture, implementation, and testing. "
            "Move back to discovery if build work reveals ambiguity. "
            "Do not conclude unless both sides have converged."
        ),
        max_tool_calls_per_turn=1,
    ),
    tools=[
        discovery_pool.as_tool("ask_discovery_pool"),
        build_pool.as_tool("ask_build_pool"),
    ],
)

discovery_pool.register(scout, analyst, critic, boundary)
build_pool.register(boundary, architect, implementer, tester)
```

> **Caution.** A reversible gateway is a **high-authority node**. Do not treat it as a generic specialist. It controls movement between local action spaces and can transport context across phases.

Design rules for reversible gateways:

1. Use a **shared** reversible gateway when the same cognitive role should mediate both directions.
1. Use **two separate** gateway agents when forward and backward transitions have different criteria.
1. Keep reversible-gateway prompts explicit about when to **stay, transition, return, or conclude**.
1. Keep dangerous tools out of generic shared agents.
1. If one role needs different authority in different pools, split it into two named agents.
1. Use `max_depth` to bound recursive routing.
1. Use `max_tool_calls_per_turn=1` for clearer dynamic paths.
1. Give `conclude` only to legitimate terminal roles.

## Pool chains as recombining state processes

It helps to read a pool chain as a **local-state process** — as an explanatory model, not a strict mathematical claim:

- A **pool** is a *macro-state*, or local action space.
- The **agents inside it** are *micro-states*, or local policy nodes — the actions available from that macro-state.
- **Gateway agents** are *transition operators* between macro-states.
- **`conclude`** is an *absorbing terminal state*.

A **one-way gateway** creates a **progressive, non-recombining** chain. A **reversible gateway** creates a **recurrent, recombining** chain, where execution can move forward into another local space or return to an earlier one when new evidence changes the task state.

```text
Progressive / non-recombining:

    Discovery Pool -> gateway_to_build -> Build Pool -> gateway_to_release -> Release Pool -> conclude

Recombining / recurrent:

    Discovery Pool <-> boundary_manager <-> Build Pool
    Build Pool <-> release_boundary <-> Release Pool
    Review can send the workflow back to Build.
    Build can send the workflow back to Discovery when requirements are incomplete.
```

```
stateDiagram-v2
    [*] --> Discovery
    Discovery --> Build: gateway_to_build
    Build --> Release: gateway_to_release
    Release --> [*]: conclude
    Build --> Discovery: boundary returns on ambiguity
    Release --> Build: review finds issue
```

> **Caveat.** This is *not* a strict Markov chain unless the full context, memory, store, and conversation history are treated as part of the state. In practice, the LLM policy selects the next route from the current local action space **conditioned on accumulated context** — so two visits to the same pool are not identical states.

Put plainly: pool chains are useful to think of as local-state processes. A pool is a macro-state, the agents inside it are available local actions, gateway agents are transition operators, and `conclude` is an absorbing terminal state. A one-way gateway creates a progressive chain. A reversible gateway creates a recurrent, recombining chain where execution can move forward, return, or self-correct as new evidence appears.

Practical implications:

- **Non-recombining chains** are easier to debug and closer to pipelines.
- **Recombining chains** are more adaptive and can self-correct.
- Recombining chains require **stronger prompts, lower `max_depth`, clear terminal agents, and route tracing**, because they can oscillate between pools without making progress.

## Why this is not LangGraph

The contrast is precise, not a value judgement:

- **LangGraph:** you define **nodes and edges** explicitly; conditional routes are still part of the graph design.
- **LazyBridge pool chains:** you define **local action spaces and gateway agents**; the runtime path **emerges** from agent decisions inside those bounded spaces.

This is not better in all cases:

- Use **`Plan`** when the path must be known, validated, typed, and checkpointed.
- Use **`AgentPool`** when the path should emerge dynamically inside bounded local spaces.
- **Compose both** when needed.

| `Plan` (static)         | `AgentPool` (dynamic)         |
| ----------------------- | ----------------------------- |
| known path              | runtime path                  |
| compile-time validation | cyclic delegation             |
| typed hand-off          | local action spaces           |
| checkpoint-friendly     | gateway transitions           |
| deterministic lifecycle | `conclude`-driven termination |

## Structural scoping is not formal permission enforcement

A pool limits what an agent can route to **directly**. It does *not* enforce a transitive permission policy:

- A pool limits the *direct* routes available to its members.
- An agent reachable *through* that pool may itself carry access to **another** pool.
- This is **intentional** when that agent is a gateway.
- It must therefore be **designed deliberately** — pool topology is *structural control*, not a complete permission system.

Keep this distinction: pool membership is structural scoping, not formal policy enforcement. There is no runtime check that an agent "should not" have reached a given pool — reachability follows entirely from the route tools each agent carries.

**Good:**

- `D` is explicitly named and documented as the gateway from Discovery to Build, with clear instructions for when to transition and when to stay.
- `boundary_manager` has access to both Discovery and Build **because it is explicitly designed** as a reversible boundary controller.

**Bad:**

- A generic shared `critic` accidentally carries access to an admin pool — an unintended transition path nobody designed.
- A reversible gateway has vague instructions and keeps bouncing between pools until `max_depth` stops it.

## Design rules for pool chains

1. Treat every pool as a **bounded local world**.
1. Treat agents with access to another pool as **gateways**.
1. **Name** gateway agents explicitly.
1. Put gateway intent in the agent **description and system prompt**.
1. Give `conclude` only to **legitimate terminal agents**.
1. Use `max_tool_calls_per_turn=1` for clearer dynamic paths.
1. Use `max_depth` to bound recursive delegation.
1. Keep shared agents minimal.
1. If the same role needs different authority in different pools, **split it into two named agents**.
1. Use reversible gateways only when **returning to a prior local space** is genuinely useful.
1. Add **progress criteria** for reversible gateways to reduce oscillation.
1. Use `Plan` around pool chains when an outer lifecycle must stay deterministic.

## When not to use this pattern

- Do **not** use pool chains when every step must be known upfront.
- Do **not** use pool chains when typed hand-off between every stage is required.
- Do **not** use pool chains for irreversible side effects unless gateway and terminal agents are tightly controlled.
- Do **not** use reversible gateways when oscillation would be worse than failing fast.
- **Use `Plan`** around or inside the pool chain when deterministic lifecycle boundaries are needed.

## Pitfalls

- **Always set `max_tool_calls_per_turn=1` on members.** Without it the model can emit several `route`/`conclude` calls in one turn and the graph *branches* (every call still runs, just concurrently — `max_parallel_tools` bounds concurrency, **not** the number of calls). One call per turn keeps a single, traceable path.
- **`conclude` is not instantaneous if it shares a turn with other tools.** Same-turn siblings run to completion first (they execute via `asyncio.gather`), so a slow sibling delays the exit. `max_tool_calls_per_turn=1` removes the issue entirely.
- **Register after construction.** Agents take `pool.as_tool()`; the pool takes the agents via `register(...)`. Calling `route` for an unregistered name returns an "Unknown agent" message (not an error) so the model can recover.
- **`route` returns text, not a typed envelope.** Nested cost/token metadata is still rolled up, but structured payloads are flattened to `.text()`. Use a `Plan` step when you need a typed hand-off.
- **Context inflation in nested or cyclic routing.** Each hop typically passes the full conversation history as the task string. In a deep or cyclic chain (`A → B → A → …`) the same dialogue turns can appear multiple times inside a single context window, wasting tokens and degrading coherence. Add a [`DeduplicateGuard`](https://core.lazybridge.com/guides/mid/dedup-guard/index.md) to any agent that receives accumulated history:

```python
from lazybridge import DeduplicateGuard

worker = Agent(
    name="worker",
    engine=LLMEngine("claude-haiku-4-5"),
    guard=DeduplicateGuard(),   # strip repeated turns before the LLM sees them
    tools=[pool.as_tool(), conclude],
)
```

- **`max_depth` is per-pool.** Each pool counts its own routing depth via an independent `contextvars` counter; in a layered setup each pool bounds only its own recursion. Cross-pool cycles are still bounded, but more loosely — keep `max_depth` low on recombining chains.
- **`conclude` inside a `Plan` unwinds the whole plan.** A plan step that concludes skips the remaining steps and returns to the top-level `pipeline.run()` — it does not just end that step.

## See also

- [Pool chains](https://core.lazybridge.com/guides/mid/pool-chain/index.md) — a full worked example: three local worlds connected by gateway agents, plus a reversible-boundary variant.
- [As tool](https://core.lazybridge.com/guides/mid/as-tool/index.md) — static one-way `agent → agent` composition.
- [Chain](https://core.lazybridge.com/guides/mid/chain/index.md) / [Plan](https://core.lazybridge.com/guides/full/plan/index.md) — fixed, deterministic control flow when the topology is known up front.
- [DeduplicateGuard](https://core.lazybridge.com/guides/mid/dedup-guard/index.md) — strip repeated history blocks from task strings in nested or cyclic routing chains.
- [Everything is a tool](https://core.lazybridge.com/concepts/everything-is-a-tool/index.md) — why `route` and `conclude` need no special engine support.
- [Multi-agent graphs](https://core.lazybridge.com/reference/multi-agent/index.md) — API reference for `AgentPool`, `conclude`, `ConcludeSignal`.

# Evals

A thin pytest-shaped harness for testing an agent's *output behaviour*. Define cases, run them through the agent, get a pass/fail report. Pair deterministic checks (`exact_match`, `contains`) with an LLM-judge for grading subjective outputs.

## Signature

```python
from lazybridge.ext.evals import (
    EvalCase,
    EvalSuite,
    EvalReport,
    EvalResult,
    # built-in checks
    exact_match,
    contains,
    not_contains,
    min_length,
    max_length,
    llm_judge,
)


EvalCase(
    input,                         # str — task to feed the agent
    check,                         # callable: (output) or (output, expected) -> bool
    expected=None,                 # optional metadata for the check
    description="",                # human-readable label for the report
)


EvalSuite(*cases)
suite.run(agent)                   # returns EvalReport
await suite.arun(agent)            # async variant


EvalReport(results)
  .total                           # int
  .passed                          # int
  .failed                          # int
  .errors                          # int (checks that raised)
```

### Built-in check builders

| Builder                      | Returns a check that                            |
| ---------------------------- | ----------------------------------------------- |
| `exact_match(expected)`      | `True` iff the output equals `expected`         |
| `contains(substring)`        | `True` iff `substring in output`                |
| `not_contains(substring)`    | `True` iff `substring not in output`            |
| `min_length(n)`              | `True` iff `len(output) >= n`                   |
| `max_length(n)`              | `True` iff `len(output) <= n`                   |
| `llm_judge(agent, criteria)` | `True` iff the judge agent replies `"approved"` |

## Synopsis

`EvalSuite` is a deterministic test harness for "given this input, the agent must produce an output that satisfies this check". Each `EvalCase` carries an input string, a check callable, and an optional `expected` value (carried for reporting, not enforced — the check closes over its expected value).

The suite feeds each input to the agent, captures `Envelope.text()`, and runs `check(output)`. A check that returns `False` counts as a failure; one that raises counts as an error. Both surface in the `EvalReport` so you can see the difference between "agent produced an unexpected answer" and "the check itself crashed".

`llm_judge(judge, criteria)` returns a check that calls the judge agent with the candidate output and the criteria; it passes only when the judge replies with a string starting with `"approved"` (case-insensitive).

## When to use it

- **Behaviour regression.** "After this prompt change, do my five canonical inputs still produce answers that mention the right city / contain the right phrase / pass the policy judge?"
- **CI gates on agent quality.** Run an `EvalSuite` against a staging agent before promotion; gate on `report.passed == report.total`.
- **Subjective grading at scale.** When deterministic substring checks aren't enough, an `llm_judge` lets you encode policies in English ("must be a poem of at least 4 lines mentioning bees") and grade in batch.

## When NOT to use it

- **Unit-testing internal helpers.** That's pytest's job. Use `EvalSuite` when the unit under test is the agent's *response*, not a function.
- **Runtime gating of every call.** `EvalSuite` is for offline / CI batches. For live "judge every output and retry if rejected" semantics use [`verify=`](https://core.lazybridge.com/guides/mid/verify/index.md).
- **High-frequency probes.** Each case is a full agent run. If you need millisecond-level checks, write a deterministic check outside the LLM path.

## Example

```python
from lazybridge import Agent, LLMEngine
from lazybridge.ext.evals import (
    EvalCase,
    EvalSuite,
    contains,
    llm_judge,
)


bot = Agent(
    engine=LLMEngine(
        "claude-opus-4-7",
        system="You are a helpful assistant.",
    ),
)
judge = Agent(
    engine=LLMEngine(
        "claude-opus-4-7",
        system='Respond "approved" or "rejected: <reason>".',
    ),
    name="judge",
)


suite = EvalSuite(
    EvalCase(
        "What's the capital of France?",
        check=contains("Paris"),
    ),
    EvalCase(
        "Write a poem about bees.",
        check=llm_judge(
            judge,
            "Output must be a poem of at least 4 lines mentioning bees.",
        ),
    ),
    EvalCase(
        "hello",
        check=lambda out: len(out) < 500,
        description="brevity check",
    ),
)


report = suite.run(bot)
print(f"{report.passed}/{report.total} passed ({report.passed / report.total:.0%})")

# In CI:
assert report.passed == report.total, [
    r.case.input for r in report.results if not r.passed
]
```

## Pitfalls

- **`llm_judge` costs tokens on every case.** Use a cheap-tier agent as the judge (`claude-haiku-...` / `gpt-4o-mini` / equivalent). One judge across all cases is fine; per-case judges multiply the cost.
- **Evals see `Envelope.text()`, not the typed payload.** When testing a structured-output agent (`output=PydanticModel`) the check receives the JSON serialisation of the payload, not the model instance. Write check predicates against the JSON shape accordingly.
- **`EvalSuite.run` is synchronous.** It uses `asyncio.run`-style internals to drive the agent, so calling it inside an existing event loop fails. Use `await suite.arun(agent)` in async test harnesses (pytest-asyncio, FastAPI startup hooks, etc.).
- **A check that raises is an error, not a failure.** The report separates the two — `report.errors` is for checks that crashed, `report.failed` is for checks that returned `False`. When surfacing the report in a log, surface both counters.
- **`expected=` is metadata only.** It's stored on the case for reporting; the check is responsible for using it. Most built-in builders close over the expected value at construction time (`contains("Paris")` already encodes "Paris" in the closure).

## See also

- [verify=](https://core.lazybridge.com/guides/mid/verify/index.md) — runtime sibling: rather than gating in CI, the judge runs on every call and retries the agent up to `max_verify` times with feedback.
- [Guards](https://core.lazybridge.com/guides/mid/guards/index.md) — hard-gate filtering at run time; use guards for "this output must never pass" and evals for "this output should usually be good".
- [Agent](https://core.lazybridge.com/guides/basic/agent/index.md) — the surface every eval runs against.
- *Guides → Full → Testing (MockAgent)* (Phase 3) — deterministic agent doubles for unit tests of orchestration code that *contains* an agent (rather than testing the agent's response itself).

# Guards

Hard input / output filters that run **before** and **after** the engine. A blocked input never reaches the engine; a blocked output is replaced with an error envelope. Compose cheap deterministic guards with an LLM-as-judge fallback for what regex can't see.

## Signature

```python
from lazybridge import Guard, ContentGuard, GuardChain, LLMGuard, GuardAction


# The protocol every guard satisfies.
class Guard:
    async def acheck_input(self, text: str) -> GuardAction
    async def acheck_output(self, text: str) -> GuardAction


# Verdict object.
GuardAction(
    allowed=True,                 # False blocks the run
    message=None,                 # error message when blocked
    modified_text=None,           # rewrite the input or output text
    metadata={},                  # opaque dict carried into the event log
)


# Built-ins.
ContentGuard(
    input_fn=None,                # callable(text) -> GuardAction (input gate)
    output_fn=None,               # callable(text) -> GuardAction (output gate)
)

GuardChain(*guards)               # first blocker wins; short-circuits on allowed=False

LLMGuard(
    judge,                        # an Agent — its run() decides
    policy,                       # str describing what to allow / reject
    *,
    timeout=60.0,                 # deadline for the judge; None = unbounded
)


class GuardError(Exception):      # raised by some integrations on hard policy failure
    ...
```

Pass a guard to an `Agent` via `guard=...`. To stack multiple, wrap them in a `GuardChain`.

## Synopsis

A guard is a hard gate. `acheck_input` runs **before** the engine — if it returns `allowed=False`, the engine is never invoked and the agent returns an error envelope. `acheck_output` runs **after** the engine on `Envelope.text()` — if it blocks, the payload is replaced with an error envelope (type `GuardBlocked`).

Either gate can also **rewrite** instead of blocking: returning `GuardAction(allowed=True, modified_text="…")` from `acheck_input` replaces the engine's task; the same on `acheck_output` replaces the payload string.

`GuardChain` runs guards in order and short-circuits on the first `allowed=False`. The convention is **cheap first, LLM last**: a regex or substring `ContentGuard` runs in microseconds, an `LLMGuard` only fires when the cheap layer didn't decide. Saves tokens.

`Agent.stream()` enforces guards too — `acheck_input` runs before the first token; a blocked task raises `ValueError` instead of silently streaming.

## When to use it

- **Compliance / safety policies** that must hold regardless of the engine's behaviour. The agent literally can't bypass a guard because it never sees blocked inputs.
- **Layered defence in depth.** Pair a deterministic regex guard (cheap, fast, false-positive-prone) with an `LLMGuard` (slow, more nuanced) so the LLM only adjudicates the hard cases.
- **Output redaction.** Use a `ContentGuard(output_fn=...)` that returns `GuardAction(allowed=True, modified_text=redacted)` to mask PII before the user ever sees the payload.
- **Streaming workflows where you must enforce at first byte.** `acheck_input` runs synchronously before streaming begins.

## When NOT to use it

- **Soft preferences ("the model should generally avoid X").** Guards are hard gates with no feedback loop — once blocked, the run ends. For "try again with feedback" semantics, use `verify=` (Phase 3).
- **Conversation-level rules that need history context.** A guard sees only the current task / output text, not memory. Use a `verify=` agent or a custom engine wrapper for stateful policies.
- **Performance-critical paths where every microsecond counts.** A regex `ContentGuard` is essentially free; an `LLMGuard` adds a judge call. Profile before stacking too many layers.

## Example

```python
import re

from lazybridge import (
    Agent,
    ContentGuard,
    GuardAction,
    GuardChain,
    LLMEngine,
    LLMGuard,
)


# 1) Cheap regex guard — block input mentioning email addresses.
def no_emails(text: str) -> GuardAction:
    if re.search(r"[\w.+-]+@[\w-]+\.[\w.-]+", text):
        return GuardAction(
            allowed=False,
            message="Remove email addresses before submitting.",
        )
    return GuardAction(allowed=True)


# 2) LLM-as-judge for harder policy violations.
judge = Agent(
    engine=LLMEngine(
        "claude-opus-4-7",
        system='Respond "approved" or "rejected: <reason>".',
    ),
    name="judge",
)


# 3) Compose: cheap first, LLM last.
guard = GuardChain(
    ContentGuard(input_fn=no_emails),
    LLMGuard(
        judge,
        policy="Reject outputs that contain medical advice.",
        timeout=10.0,
    ),
)


bot = Agent(
    engine=LLMEngine("claude-haiku-4-5"),
    guard=guard,
    name="bot",
)


# Blocked by the cheap regex — engine is never invoked.
result = bot("my email is foo@bar.com, what's the weather?")
assert not result.ok
print(result.error.type, result.error.message)


# 4) Output rewrite — redact PII before the caller sees the payload.
def redact_phone_numbers(text: str) -> GuardAction:
    redacted = re.sub(r"\b\d{3}[-.\s]?\d{3}[-.\s]?\d{4}\b", "[REDACTED]", text)
    if redacted != text:
        return GuardAction(allowed=True, modified_text=redacted)
    return GuardAction(allowed=True)


sanitiser = Agent(
    engine=LLMEngine("claude-haiku-4-5"),
    guard=ContentGuard(output_fn=redact_phone_numbers),
)
```

## Pitfalls

- **A guard that raises aborts the run.** Always return `GuardAction(allowed=False, message=str(e))` on internal errors; letting an exception escape produces an unrecoverable failure instead of a structured rejection.
- **`LLMGuard.timeout` is honoured on both sync and async paths.** The sync path uses a daemon thread + `join(timeout=)`; the async path uses `asyncio.wait_for`. On timeout the guard fails closed (blocked). `timeout=None` restores unbounded behaviour — only do this if you trust your judge to be fast.
- **`LLMGuard` costs tokens on every call.** Order it last in a `GuardChain` so the cheap layer catches the obvious cases first.
- **Guards see `Envelope.text()`, not the typed payload.** If you're using structured output (`output=PydanticModel`), the output guard operates on the JSON serialisation. Check string content accordingly.
- **`modified_text` on output replaces the payload string** but does not re-validate against `output=`. If you redact a structured output's JSON, the consumer may receive invalid JSON; prefer to redact within the model's `model_dump()` output instead.
- **`GuardChain` short-circuits on the first `allowed=False`.** Subsequent guards never run, including their side-effect-free observations. Don't rely on a downstream guard to "also see" the blocked input.
- **Streaming respects guards.** `agent.stream(task)` raises `ValueError` on a blocked input rather than yielding silently. Catch it explicitly in streaming UIs.

## See also

- [Session](https://core.lazybridge.com/guides/mid/session/index.md) — guard outcomes are emitted as events (`metadata` from `GuardAction` is preserved on the event payload).
- [Agent](https://core.lazybridge.com/guides/basic/agent/index.md) — `guard=` is a first-class kwarg alongside `tools=`, `memory=`, `output=`.
- *Guides → Full → verify=* (Phase 3) — different placement: a judge wraps the agent's *output* with a retry feedback loop, rather than acting as a hard gate.

# HumanEngine

A drop-in replacement for `LLMEngine` whose "model" is a person at a terminal (or a custom UI). The agent prompts the human, receives their input, and treats it as the engine's response. Use it as an approval gate or as a structured form filler.

## Signature

```python
from lazybridge import Agent
from lazybridge.ext.hil import HumanEngine, human_agent

# Canonical — Agent + HumanEngine
HumanEngine(
    *,
    ui="terminal",                 # "terminal" | "web" | _UIProtocol
    timeout=None,                  # seconds; on expiry triggers default= or raises TimeoutError
    default=None,                  # str returned on timeout
)

agent = Agent(
    engine=HumanEngine(timeout=120, default="no comment"),
    output=Review,                 # Pydantic model → field-by-field prompt
    name="reviewer",
)


# Sugar — same agent, less plumbing
agent = human_agent(
    timeout=120,
    default="no comment",
    output=Review,
    name="reviewer",
)
```

The sugar `human_agent(...)` lives in `lazybridge.ext.hil` and forwards engine kwargs (`ui`, `timeout`, `default`) to `HumanEngine` and the rest (`output=`, `name=`, `session=`, …) to `Agent`. See [Canonical vs sugar](https://core.lazybridge.com/concepts/canonical-vs-sugar/index.md) for the exhaustive comparison.

> **Status: ext.** Available out of the box once `lazybridge` is installed; lives under `lazybridge.ext.hil` to respect the core/ext import boundary.

## Synopsis

`HumanEngine` implements the same `Engine` protocol as `LLMEngine`, so `Agent(engine=HumanEngine())` swaps in cleanly anywhere an LLM agent fits. The terminal UI prompts the human with the task, captures the typed response, and returns it as the `Envelope.payload`. When `output=` is a Pydantic model, the prompt goes field-by-field instead of asking for a single string.

There are two HIL engines in `lazybridge.ext.hil`:

- **`HumanEngine`** — approval gate / form filler. The human types a string (or fills fields). No tool calls, no agent retries — this engine is the lightweight variant.
- **`SupervisorEngine`** — full REPL with tool dispatch, agent retries, and store inspection. Lands in Phase 3 (Full tier); reach for it when the human needs to *do work*, not just decide.

## When to use it

- **Approval gates in pipelines.** Drop a `HumanEngine` agent into a `Plan` step or `Agent.chain` between drafting and finalising; the pipeline halts until the human types a verdict.
- **Structured human review.** When `output=` is a Pydantic model, the terminal UI prompts field-by-field — useful for review forms (rating, comment, approved boolean) without writing a per-field prompt loop.
- **Manual data entry inside an agent flow.** Sometimes the cheapest "tool" is a human: an agent that needs an OAuth code, a CAPTCHA solution, or a domain expert's call.
- **Tests / demos that need deterministic input.** Pass a custom `ui=` adapter implementing `prompt(task, *, tools, output_type) -> str` to script the human side.

## When NOT to use it

- **The human needs to call tools.** `HumanEngine` does not dispatch tools — the human can only type a raw string. Use `SupervisorEngine` (Full tier) for that.
- **Long-running async workflows where blocking on input is wrong.** The terminal UI blocks the current process. For web apps, supply a custom `ui=` adapter that wires into your event system (queue, websocket, …).
- **As a substitute for `verify=`.** `HumanEngine` is the engine itself, not a judge wrapping an LLM agent's output. If you want "LLM produces output → human approves before returning", combine an LLM agent inside a `Plan` with a human approval step — see [as-tool](https://core.lazybridge.com/guides/mid/as-tool/index.md) and [verify=](https://core.lazybridge.com/guides/mid/verify/index.md) for policy gating.

## Example

```python
from pydantic import BaseModel

from lazybridge import Agent, LLMEngine, Plan, Step
from lazybridge.ext.hil import HumanEngine


class Review(BaseModel):
    approved: bool
    comment: str
    rating: int                    # 1..5


# 1) Standalone — a single agent that prompts a person.
reviewer = Agent(
    engine=HumanEngine(timeout=120, default="no comment"),
    output=Review,
    name="reviewer",
)
result = reviewer("draft #42 — please review and rate")
if result.payload.approved:
    print("✓ approved", result.payload.comment)


# 2) Inside a pipeline — draft → review → finalise.
drafter = Agent(
    engine=LLMEngine("gpt-5.4-mini"),
    name="drafter",
)
finaliser = Agent(
    engine=LLMEngine("gpt-5.4-mini"),
    name="finaliser",
)

pipeline = Agent(
    engine=Plan(
        Step(target=drafter,   name=drafter.name),
        Step(target=reviewer,  name=reviewer.name),
        Step(target=finaliser, name=finaliser.name),
    ),
    name="release-pipeline",
)
pipeline("draft the v1.2 release notes")


# 3) Custom UI adapter for a web app — the prompt callable resolves
#    when the user submits a form.
class WebUI:
    def __init__(self, queue):
        self._queue = queue

    async def prompt(self, task, *, tools, output_type):
        await self._queue.publish({"task": task, "schema": output_type})
        return await self._queue.await_response()


reviewer_web = Agent(
    engine=HumanEngine(ui=WebUI(my_queue), timeout=600),
    output=Review,
    name="reviewer-web",
)
```

## Pitfalls

- **The terminal UI blocks the current process.** This is intended — a synchronous human-in-the-loop step shouldn't race the rest of the agent's tool loop. For non-blocking flows supply a custom `ui=` adapter or run the engine on a worker thread.
- **`timeout` uses the event loop, not signals.** It works in async contexts but may hang in tightly-blocking sync nests (a custom `ui` that calls `input()` inside a synchronous-only callsite). Pair with an `ainput_fn` adapter when you need cancellation.
- **`output=Model` switches the terminal UI to per-field prompting.** Without `output=`, the human types one free-form string. The model class is the trigger — there is no separate "form mode" flag.
- **`HumanEngine` is not a judge.** It produces the output, it doesn't grade an LLM's. To grade an LLM's output with a human in the loop, run the LLM in one step and a `HumanEngine` agent in the next; or use a custom callable as `verify=`.
- **`default=` is only applied on timeout.** If the human enters an empty string, that empty string is the response. If you need empty-input handling, validate after the call (or use a Pydantic `output=` model with a non-empty constraint).

## See also

- [Chain](https://core.lazybridge.com/guides/mid/chain/index.md) — typical pattern for inserting `HumanEngine` mid-pipeline.
- [Guards](https://core.lazybridge.com/guides/mid/guards/index.md) — hard input/output gates that don't need a human; complementary, not redundant.
- [verify=](https://core.lazybridge.com/guides/mid/verify/index.md) — judge-and-retry placement when an LLM judges instead of a human.
- *Guides → Full → SupervisorEngine* (Phase 3) — the heavier cousin: a full REPL with tools, agent retry, and store inspection.
- [Canonical vs sugar](https://core.lazybridge.com/concepts/canonical-vs-sugar/index.md) — `human_agent(...)` vs `Agent(engine=HumanEngine(...))`.

# Memory

Conversation history that survives multiple `agent(task)` calls. Per-agent by default, shareable across agents, automatically bounded by a sliding-window or LLM-summary compression strategy.

## Signature

```python
from lazybridge import Memory

Memory(
    *,
    strategy="auto",               # "auto" | "sliding" | "summary" | "none"
    max_tokens=4000,               # token budget that triggers compression
    max_turns=1000,                # hard backstop on retained turns
    store=None,                    # reserved — durable persistence (1.1+)
    summarizer=None,               # Agent or callable used by strategy="summary"
    summarizer_timeout=30.0,       # deadline for async summarisers (None = unbounded)
)

# Methods
mem.add(user, assistant, *, tokens=0)   # append a turn
mem.messages()                          # list[Message] for the LLM
mem.text()                              # current view as a plain string (live)
mem.clear()                             # wipe everything in process
```

Pass to an `Agent` via `memory=mem` (private to that agent) or `sources=[mem]` (live read-only view shared across agents).

### Strategies

| `strategy=` | What it does                                                                                        | When                                                      |
| ----------- | --------------------------------------------------------------------------------------------------- | --------------------------------------------------------- |
| `"auto"`    | Sliding window + summary of older turns once `max_tokens` is exceeded                               | General chat, the safe default                            |
| `"sliding"` | Drops oldest turns whenever > 10 are retained; works without `max_tokens`                           | Cheap, lossy, no LLM cost                                 |
| `"summary"` | Compresses whenever > 10 turns are retained, using `summarizer=` (or a keyword-extraction fallback) | Higher fidelity at the cost of summariser tokens          |
| `"none"`    | Never compress; only `max_turns` bounds the buffer                                                  | You want full history and have explicit control over size |

## Synopsis

`Memory` is "what the model should see in the next prompt". It carries conversation continuity across calls to the same agent (or across several agents that share the instance). The default `"auto"` strategy keeps the memory bounded without any tuning — sliding window first, LLM summary of the older turns once the token budget is exceeded.

`Memory` is **not** durable. The whole buffer lives in the agent's process and disappears when the process exits. For state that must survive a crash or be shared across processes, use [Store](https://core.lazybridge.com/guides/mid/store/index.md).

## When to use it

- **Multi-turn conversations** with a single agent. Without `Memory`, every `agent(task)` call starts fresh; with it, the model sees the recent history.
- **Cross-agent shared context** when you want a judge or monitor agent to read the live conversation without writing to it. Pass the same `Memory` to the chat agent's `memory=` and the judge agent's `sources=[mem]`.
- **Bounded buffers in long-running interactive sessions** — the default `"auto"` strategy keeps token usage from growing without bound, and you don't have to do the trimming yourself.

## When NOT to use it

- **Durable cross-run state.** `Memory` doesn't survive process exit and isn't shared across machines. Use `Store` instead.
- **Pipeline data passing.** A `Plan` step's output flows to the next step via the envelope and sentinels (`from_prev`, `from_step("…")`), not via memory. Memory is for conversational context, not workflow state.
- **Structured-output retry loops.** When LazyBridge re-prompts the agent to fix an invalid structured-output payload, those correction turns are *not* added to memory — and neither should you add them manually.

## Example

```python
from lazybridge import Agent, LLMEngine, Memory


# 1) Default "auto" strategy — sliding window + summary fallback.
chat_memory = Memory(
    strategy="auto",
    max_tokens=3000,
)
chat = Agent(
    engine=LLMEngine("gemini-3-flash-preview"),
    memory=chat_memory,
    name="chat",
)

chat("hi, I'm Marco")
result = chat("what's my name?")
print(result.text())                  # "Marco"

print(chat_memory.text())             # current compressed view


# 2) "summary" strategy with a cheap summariser.
summariser = Agent(
    engine=LLMEngine(
        "claude-haiku-4-5-20251001",
        system="Summarize conversations concisely.",
    ),
)
high_fidelity_memory = Memory(
    strategy="summary",
    summarizer=summariser,
    summarizer_timeout=15.0,
)


# 3) Sharing live conversation across agents — chat writes, judge reads.
chat = Agent(
    engine=LLMEngine("gemini-3-flash-preview"),
    memory=chat_memory,
    name="chat",
)
judge = Agent(
    engine=LLMEngine(
        "claude-opus-4-7",
        system="Grade the assistant's last reply on helpfulness 1-5.",
    ),
    sources=[chat_memory],            # read-only live view
    name="judge",
)

chat("explain LazyBridge in one sentence")
print(judge("grade the last turn").text())
```

## Pitfalls

- **`strategy="summary"` without a `summarizer=`** falls back to keyword extraction — bounded, but lossy. Pass a cheap agent for production-quality summaries.
- **`memory.clear()`** wipes everything *including* the in-process summary; it does not persist across restarts. For durable memory use `Store`.
- **`max_turns`** is a hard backstop, not the primary compression knob. When it fires you get a one-shot warning — that's the signal to switch from `strategy="none"` to `"auto"`.
- **`summarizer_timeout=None`** restores the legacy unbounded behaviour. Use it only if you trust your summariser to be fast and reliable; otherwise a stuck summariser will block every `add(...)` call indefinitely.
- **`memory.text()` is a live read** — every call re-materialises the current view. Don't snapshot and cache it; if you need a stable reference for diagnostics, copy the string.
- **Sync summarisers can't be cancelled mid-call.** Only async summarisers (`async def` or returning a coroutine) honour `summarizer_timeout`. On timeout the keyword fallback runs.
- **Compression happens outside the internal lock**, so concurrent `add()` calls keep progressing while a slow summariser is in flight. This means a memory snapshot taken *during* compression may reflect the pre-compression view; that's intentional, but worth knowing if you're debugging.

## See also

- [Store](https://core.lazybridge.com/guides/mid/store/index.md) — the durable counterpart for cross-process state.
- [Session](https://core.lazybridge.com/guides/mid/session/index.md) — observability of `agent(task)` events; separate from memory.
- [Agent](https://core.lazybridge.com/guides/basic/agent/index.md) — the consumer (`memory=` for write access, `sources=[mem]` for live read).

# Multimodal (image / audio)

Pass images and audio clips alongside the text task. The agent forwards them to the underlying provider as native content blocks; the provider returns text (or a structured payload) the same way it does for plain text turns. Capability checking is automatic — you can either silently drop unsupported attachments (default) or raise.

## Signature

```python
result = agent(
    task,                              # str — the textual prompt
    images=[...],                      # list[str | Path | bytes | dict | ImageContent] | None
    audio=...,                         # str | Path | bytes | dict | AudioContent | None
)

# Async + streaming forms accept the same kwargs
result = await agent.run(task, images=[...], audio=...)
async for chunk in agent.stream(task, images=[...], audio=...):
    ...

# Strongly-typed content blocks (when you need control over media_type)
from lazybridge import ImageContent, AudioContent

ImageContent.from_url("https://example.com/cat.jpg")
ImageContent.from_path("/tmp/diagram.png")          # auto-detects media_type
ImageContent.from_bytes(buf, media_type="image/jpeg")
ImageContent.from_data_uri("data:image/png;base64,iVBOR…")

AudioContent.from_url("https://example.com/clip.mp3")
AudioContent.from_path("/tmp/voice.wav")
AudioContent.from_bytes(buf, media_type="audio/wav")
```

`Envelope.images` and `Envelope.audio` carry the (coerced) attachments from input through the run. In a `Plan`, attachments only ride on the **first step**; downstream steps see text and structured payloads, not the original media.

## Synopsis

LazyBridge handles three things automatically:

- **Coercion.** A bare URL string, a `Path`, raw `bytes`, or a `dict` ({"url": …} / {"base64_data": …, "media_type": …}) is turned into the appropriate `ImageContent` or `AudioContent` for you. Pass typed blocks only when you need to override the auto-detected MIME type.
- **Capability gating.** Each provider knows which of its models accept vision / audio. By default an unsupported attachment is dropped with a single `UserWarning`; pass `LLMEngine(strict_multimodal=True)` to raise `UnsupportedFeatureError` instead.
- **Wire mapping.** Anthropic, OpenAI and Google each have a different content-block format; the framework converts once per request. You see one uniform API regardless of provider.

### Provider capability matrix (current)

| Provider                                | Vision-capable models                                                        | Audio-capable models                                                               |
| --------------------------------------- | ---------------------------------------------------------------------------- | ---------------------------------------------------------------------------------- |
| **Anthropic**                           | `claude-3*`, `claude-4*`, `claude-opus*`, `claude-sonnet*`, `claude-haiku*`  | `claude-3-7*`, `claude-4*` (3.0 / 3.5 are vision-only)                             |
| **OpenAI**                              | `gpt-4-turbo`, `gpt-4o*`, `gpt-4.1*`, `gpt-5*`, reasoning `o1` / `o3` / `o4` | `gpt-4o-audio*`, `gpt-4o-realtime*`, `gpt-4o-mini-audio*`, `gpt-4o-mini-realtime*` |
| **Google**                              | `gemini-1.5*`, `gemini-2*`, `gemini-3*`                                      | same — Gemini 1.5+ is uniformly multimodal                                         |
| **DeepSeek**, **LMStudio**, **LiteLLM** | provider-default (none assumed)                                              | provider-default (none assumed)                                                    |

`provider.supports_vision(model)` and `provider.supports_audio(model)` expose the same check programmatically; both are class methods so you can ask without instantiating a client.

## When to use it

- **Vision tasks** — describe an image, OCR a document scan, classify a UI screenshot, audit a chart, locate elements in a photo.
- **Audio tasks** — transcribe, summarise a meeting clip, score pronunciation, extract entities from a voice note (provider-dependent; text-out is universal, audio-out is OpenAI's realtime models only).
- **Mixed input** — pass `task="…"` plus N images plus one audio clip; the LLM sees the whole bundle as one turn.

## When NOT to use it

- **Document parsing where you control the pipeline.** Run OCR / extraction yourself and feed text — far cheaper, more deterministic, no per-image token cost.
- **Streaming media.** The framework batches a finite list of attachments per request. For real-time audio in/out you want a provider's realtime / WebSocket API directly, not LazyBridge.
- **Provider-hosted file search.** That's `NativeTool.FILE_SEARCH` (see [Native tools](https://core.lazybridge.com/guides/basic/native-tools/index.md)) — different surface; the model doesn't see the file content as an attachment, it queries an index server-side.

## Example

```python
from pathlib import Path

from lazybridge import Agent, AudioContent, ImageContent, LLMEngine


# 1) URL — bare string is coerced to ImageContent at call time.
agent = Agent(engine=LLMEngine("claude-opus-4-7"))
result = agent(
    "Describe what's in this picture in two sentences.",
    images=["https://upload.wikimedia.org/wikipedia/commons/4/47/PNG_transparency_demonstration_1.png"],
)
print(result.text())


# 2) Local files — Path is coerced to base64 + correct media_type.
result = agent(
    "Summarise the chart and call out the outlier.",
    images=[Path("/tmp/quarterly_revenue.png")],
)
print(result.text())


# 3) Multiple images in one turn — the model sees them all together.
result = agent(
    "Which of these screenshots shows the broken layout?",
    images=[
        Path("/tmp/before.png"),
        Path("/tmp/after.png"),
        Path("/tmp/expected.png"),
    ],
)


# 4) Audio — same coercion rules.
result = agent(
    "Transcribe this clip; highlight any product names mentioned.",
    audio=Path("/tmp/standup.wav"),
)
print(result.text())


# 5) Strongly-typed block when you need to override media_type or
#    embed bytes you already have in memory.
img = ImageContent.from_bytes(open("/tmp/screenshot.bin", "rb").read(), media_type="image/png")
agent("What's the error in this screenshot?", images=[img])


# 6) Strict mode — raise instead of dropping when the model can't
#    handle the modality. Useful in production, where a silent drop
#    would change semantics undetected.
strict_agent = Agent(
    engine=LLMEngine("claude-opus-4-7", strict_multimodal=True),
)
try:
    strict_agent(
        "Describe this audio.",
        audio=Path("/tmp/clip.mp3"),
    )
except Exception as exc:
    # UnsupportedFeatureError if the chosen model lacks audio support
    print(f"refused: {exc}")
```

Supported MIME types (auto-detected from extension by `from_path`):

- **Images:** `image/jpeg`, `image/png`, `image/gif`, `image/webp`, `image/bmp`, `image/tiff`
- **Audio:** `audio/wav`, `audio/mpeg`, `audio/mp3`, `audio/flac`, `audio/ogg`, `audio/webm`, `audio/aac`, `audio/mp4`

## Pitfalls

- **Default is "drop with warning", not "raise".** A plain `LLMEngine("gpt-4o-mini")` with `images=[...]` silently strips the images and emits one `UserWarning`. Pass `strict_multimodal=True` in production so a model swap that loses vision support breaks loudly instead of changing behaviour silently.
- **Attachments only ride on step 0 of a `Plan`.** Pass `images=` / `audio=` to `pipeline(task, images=...)` and only the first step sees them. Downstream steps receive text + structured payloads, never the original media. If a later step needs the bytes, embed a path or URL in the payload and re-attach in a `from_step(...)` predicate, or re-call the multimodal step with the bytes itself as input.
- **Bare `bytes` requires a typed block.** `agent(..., images=[buf])` works only because the coercer assumes `image/jpeg` for raw bytes. Use `ImageContent.from_bytes(buf, media_type="image/png")` whenever the format isn't JPEG; otherwise the provider may reject the request or render garbage.
- **`audio=` is a single value, `images=` is a list.** Most providers accept N images per turn but at most one audio clip. Pass a list of audio clips and the framework will refuse it.
- **Cost is per-image, per-tile.** Vision input is tokenised in tiles (Anthropic) or scaled blocks (OpenAI / Google). A 4K screenshot costs an order of magnitude more than a 512×512 thumbnail. Resize upstream when the model only needs a thumbnail.
- **`Envelope.images` / `Envelope.audio` after the run** carry the *input* attachments — providers that do *audio out* (OpenAI realtime) surface the response audio through provider-specific extensions, not this field. Read the run's text payload first; the field is for carry-through, not response media.
- **`UnsupportedFeatureError` subclasses `ValueError`.** Catch it precisely (`from lazybridge import UnsupportedFeatureError`) when you want to fall back to a vision-capable model rather than swallow every `ValueError` raised at construction time.

## See also

- [Envelope](https://core.lazybridge.com/guides/basic/envelope/index.md) — `Envelope.images` and `Envelope.audio` are the carry-through fields these calls populate.
- [Native tools](https://core.lazybridge.com/guides/basic/native-tools/index.md) — `NativeTool.FILE_SEARCH` and `NativeTool.IMAGE_GENERATION` are separate features (provider-hosted retrieval / generation), not attachments.
- [Engines](https://core.lazybridge.com/reference/engines/index.md) — `strict_multimodal` and the surrounding production knobs.

# Parallel

Deterministic, scripted fan-out. The same task goes to N agents concurrently. The wrapper returns ONE `Envelope` whose `text()` is the labelled-text join of every branch's output — same shape as `Plan`'s `from_parallel_all` aggregator. For typed per-branch access call `parallel.run_branches(task)` (async). No orchestrator LLM is involved.

## Signature

```python
from lazybridge import Agent

# Canonical — Agent.parallel IS the canonical form for scripted fan-out.
multi = Agent.parallel(
    *agents,                       # one or more Agent instances
    concurrency_limit=None,        # int | None; cap on simultaneous in-flight calls
    step_timeout=None,             # float | None; per-agent asyncio.wait_for deadline (seconds)
    name="parallel",
    description=None,
    session=None,
)

env = multi(task)                  # ONE Envelope: text() = labelled join, payload = joined str
print(env.text())                  # "[a]\n<a-output>\n\n[b]\n<b-output>" ...

# Typed per-branch access (advanced):
import asyncio
branches = asyncio.run(multi.run_branches(task))   # list[Envelope] in input order
```

`Agent.parallel(...)` returns a `ParallelAgent` — a sibling class of `Agent`. Its `__call__` returns ONE folded `Envelope` (so the runner plugs into another agent's `tools=[...]` uniformly with every other agent in the framework). For the underlying `list[Envelope]`, use `run_branches(task)`. The closest from-primitives equivalent is hand-written `asyncio.gather(*[a.run(task) for a in agents])` — see [Canonical vs sugar](https://core.lazybridge.com/concepts/canonical-vs-sugar/index.md) for the full nuance.

## Synopsis

`Agent.parallel(a, b, c)(task)` runs the same task against three agents concurrently and returns three envelopes in the order you passed them. Errors in one branch surface as `Envelope.error_envelope(...)` in the corresponding slot — the call never raises; the positional contract is preserved.

`concurrency_limit=` caps simultaneous in-flight calls (a `asyncio.Semaphore`). `step_timeout=` wraps each per-agent call in `asyncio.wait_for` so a slow branch can't stall the rest.

`Agent.parallel` is **deterministic and scripted** — every input agent runs unconditionally. If you want the LLM to decide which sub-agents to invoke (and whether in parallel), put the agents in `tools=[...]` of a regular `Agent` instead; the engine fans out tool calls automatically when the model emits more than one in a turn.

## When to use it

- **Multi-region / multi-source fan-out.** Three researchers, one per region, called against the same query.
- **Ensemble voting.** Same task, several models, then a downstream step picks the best or aggregates.
- **Independent sub-tasks that share an input.** Where you've already decided every branch must run.
- **Throttled fan-out** when downstream APIs are rate-limited — use `concurrency_limit=` to cap parallelism without serialising.

## When NOT to use it

- **LLM-directed dispatch.** Use `Agent(tools=[a, b, c])`. The engine emits parallel tool calls automatically when the model asks for more than one — so "the model decides which subset to call" comes for free. `Agent.parallel` runs every branch; you cannot opt one out at runtime.
- **Conditional / routed flows.** Use `Plan` with parallel bands (`Step(..., parallel=True)`) plus routing, or use `from_parallel_all("name")` to aggregate. `Agent.parallel` runs every branch unconditionally; if you need concurrent steps that must converge in a typed downstream `Step`, use `Plan`.
- **Anything that needs typed downstream consumption.** Each envelope's payload is whatever the corresponding agent produced — possibly several different shapes. If the next step needs a uniform typed input, normalise via a follow-up summariser or use `Plan` parallel bands with `from_parallel_all`.

## Example

```python
from lazybridge import Agent, LLMEngine


def search_us(query: str) -> str:
    """Search US sources for ``query``."""
    return "..."

def search_eu(query: str) -> str:
    """Search EU sources for ``query``."""
    return "..."

def search_asia(query: str) -> str:
    """Search Asian sources for ``query``."""
    return "..."


us = Agent(
    engine=LLMEngine("deepseek-v4-flash"),
    tools=[search_us],
    name="us",
)
eu = Agent(
    engine=LLMEngine("deepseek-v4-flash"),
    tools=[search_eu],
    name="eu",
)
asia = Agent(
    engine=LLMEngine("deepseek-v4-flash"),
    tools=[search_asia],
    name="asia",
)


# 1) Three branches, one task, results in input order.
multi = Agent.parallel(
    us,
    eu,
    asia,
    concurrency_limit=3,           # cap simultaneous in-flight calls
    step_timeout=30.0,
)
env = multi("AI policy news")
print(env.text())                       # labelled-text join across branches

# For typed per-branch access (advanced):
# branches = await multi.run_branches("AI policy news")
# for b in branches:
#     print(b.metadata.model, b.text()[:100])


# 2) Aggregate into a single answer with a follow-up agent.
def join_branches(envs: list) -> str:
    return "\n\n".join(f"[{e.metadata.model}] {e.text()}" for e in envs)


synthesiser = Agent(
    engine=LLMEngine(
        "claude-opus-4-7",
        system="Combine the regional briefings into one global summary.",
    ),
    name="synth",
)
summary = synthesiser(join_branches(results))
print(summary.text())


# 3) Failure isolation — one branch's error doesn't kill the others.
results = multi("a deliberately tricky question")
for env in results:
    if not env.ok:
        print(f"branch failed ({env.error.type}): {env.error.message}")
    else:
        print(env.text()[:100])
```

## Pitfalls

- **`__call__` returns ONE Envelope, not `list[Envelope]`** (since 0.7.9). Read `env.text()` for the labelled-text join, or call `multi.run_branches(task)` (async) for the typed list.
- **`concurrency_limit=None` (default) fires everything at once.** When the underlying providers are rate-limited or your CPU / network is the bottleneck, set a cap.
- **`step_timeout` returns an error envelope, not a raise.** The positional contract is preserved — the slot for the timed-out agent contains an error envelope; siblings keep their results. Check `env.ok` before reading `env.text()`. The first non-`None` branch error also propagates as the wrapper Envelope's `error` so downstream consumers can short-circuit.
- **Not LLM-directed.** If you want "the model decides whether to call all three", use `Agent(tools=[us, eu, asia])` instead; parallel tool dispatch happens automatically when the engine emits multiple tool calls in a single turn.
- **Automatic aggregation since 0.7.9.** The wrapper Envelope's `payload` is a labelled-text join of every branch — same shape as `Plan`'s `from_parallel_all` aggregator — and `metadata.nested_*` rolls every branch's cost up. Use `as_tool()` to plug the runner into another agent's `tools=[...]` list with no extra adapter.
- **Cost rollup.** Each branch's metadata is preserved in its envelope; the wrapper folds them into `metadata.nested_*` so parallel-of-parallel composes cleanly. Sessions aggregate cost across all branches via `usage_summary()`.

## See also

- [Chain](https://core.lazybridge.com/guides/mid/chain/index.md) — sequential composition; complements parallel.
- [Nested pipelines](https://core.lazybridge.com/guides/full/composition-patterns/index.md) — parallel bands of **sub-pipelines** (not just single agents) with `from_parallel_all(...)` aggregation, plus the choice between `Agent.parallel(...)`, `parallel=True` plan bands, and LLM-decided dispatch over sub-pipelines.
- [As tool](https://core.lazybridge.com/guides/mid/as-tool/index.md) — `multi.as_tool()` exposes the fan-out as a single `Tool` that delegates to `run()` (since 0.7.9 the wrapper Envelope already carries the labelled-text join).
- *Guides → Full → Plan* (Phase 3) — `Plan` parallel bands (`Step(..., parallel=True)`) and `from_parallel_all("name")` aggregation when concurrent steps must produce a single downstream input.
- [Canonical vs sugar](https://core.lazybridge.com/concepts/canonical-vs-sugar/index.md) — why `Agent.parallel` is its own primitive, not sugar over `Agent`.

# Pool chains — a worked example

A **pool chain** is a dynamic workflow built from several bounded *local action spaces* ([pools](https://core.lazybridge.com/guides/mid/dynamic-graph/index.md)) connected by **gateway agents**. The builder defines the local worlds and the transition points; the runtime path emerges from agent decisions and ends when a legitimate terminal agent calls `conclude`.

This page works through a single scenario — a product-investigation workflow with three local worlds — first as a **progressive** chain and then as a **recombining** variant. Read [Dynamic graph (AgentPool + conclude)](https://core.lazybridge.com/guides/mid/dynamic-graph/index.md) first for the primitives and the `Plan` vs `AgentPool` (static vs dynamic) duality.

## The scenario

Three local worlds, each a pool:

```text
Discovery Pool:   scout, analyst, gateway_to_build
Build Pool:       architect, implementer, tester, gateway_to_release
Release Pool:     reviewer, approver, publisher
```

Rules:

- `gateway_to_build` belongs to Discovery but carries the **Build** pool's route tool — it is the only way into Build.
- `gateway_to_release` belongs to Build but carries the **Release** pool's route tool — it is the only way into Release.
- Only `approver` or `publisher` may `conclude`. Nobody else can end the whole chain.
- Every member reaches its own pool through `pool.as_tool()` with an explicit name: `ask_discovery_pool`, `ask_build_pool`, `ask_release_pool`.

```
stateDiagram-v2
    [*] --> Discovery
    Discovery --> Build: gateway_to_build
    Build --> Release: gateway_to_release
    Release --> [*]: conclude (approver / publisher)
```

## Progressive (non-recombining) chain

```python
from lazybridge import Agent, AgentPool, LLMEngine, conclude

discovery_pool = AgentPool(max_depth=8)
build_pool = AgentPool(max_depth=8)
release_pool = AgentPool(max_depth=8)

# --- Discovery local world -------------------------------------------------
scout = Agent(
    name="scout",
    description="Gathers raw evidence and source material for the question.",
    engine=LLMEngine("claude-haiku-4-5", max_tool_calls_per_turn=1),
    tools=[discovery_pool.as_tool("ask_discovery_pool")],
)
analyst = Agent(
    name="analyst",
    description="Turns scout evidence into structured findings and open questions.",
    engine=LLMEngine("claude-haiku-4-5", max_tool_calls_per_turn=1),
    tools=[discovery_pool.as_tool("ask_discovery_pool")],
)
gateway_to_build = Agent(
    name="gateway_to_build",
    description="Gateway from Discovery into Build. Crosses only when requirements are stable.",
    engine=LLMEngine(
        "claude-opus-4-8",
        system=(
            "You belong to Discovery but can enter Build. "
            "Stay in Discovery while requirements are unclear. "
            "Call the build pool only when findings are stable enough to "
            "design, implement, or test."
        ),
        max_tool_calls_per_turn=1,
    ),
    tools=[
        discovery_pool.as_tool("ask_discovery_pool"),
        build_pool.as_tool("ask_build_pool"),
    ],
)

# --- Build local world -----------------------------------------------------
architect = Agent(
    name="architect",
    description="Designs the solution from stable requirements.",
    engine=LLMEngine("claude-opus-4-8", max_tool_calls_per_turn=1),
    tools=[build_pool.as_tool("ask_build_pool")],
)
implementer = Agent(
    name="implementer",
    description="Implements the architect's design.",
    engine=LLMEngine("claude-opus-4-8", max_tool_calls_per_turn=1),
    tools=[build_pool.as_tool("ask_build_pool")],
)
tester = Agent(
    name="tester",
    description="Validates the implementation against the findings.",
    engine=LLMEngine("claude-haiku-4-5", max_tool_calls_per_turn=1),
    tools=[build_pool.as_tool("ask_build_pool")],
)
gateway_to_release = Agent(
    name="gateway_to_release",
    description="Gateway from Build into Release. Crosses only when build is tested and green.",
    engine=LLMEngine(
        "claude-opus-4-8",
        system=(
            "You belong to Build but can enter Release. "
            "Call the release pool only when the implementation is built and tested."
        ),
        max_tool_calls_per_turn=1,
    ),
    tools=[
        build_pool.as_tool("ask_build_pool"),
        release_pool.as_tool("ask_release_pool"),
    ],
)

# --- Release local world ---------------------------------------------------
reviewer = Agent(
    name="reviewer",
    description="Reviews the release candidate; routes to approver or back for fixes.",
    engine=LLMEngine("claude-opus-4-8", max_tool_calls_per_turn=1),
    tools=[release_pool.as_tool("ask_release_pool")],
)
approver = Agent(
    name="approver",
    description="Approves the release. A legitimate terminal agent.",
    engine=LLMEngine("claude-opus-4-8", max_tool_calls_per_turn=1),
    tools=[release_pool.as_tool("ask_release_pool"), conclude],
)
publisher = Agent(
    name="publisher",
    description="Publishes the approved result and concludes the chain.",
    engine=LLMEngine("claude-opus-4-8", max_tool_calls_per_turn=1),
    tools=[release_pool.as_tool("ask_release_pool"), conclude],
)

# --- Register each pool AFTER its members exist ----------------------------
# A forward gateway is registered ONLY in its source pool. It is reachable
# from the pool it leaves, and carries the next pool's route tool to step
# forward — but destination agents cannot select it, so there is no path back.
discovery_pool.register(scout, analyst, gateway_to_build)
build_pool.register(architect, implementer, tester, gateway_to_release)
release_pool.register(reviewer, approver, publisher)

result = scout("Should we ship a usage-based pricing tier? Investigate, build a plan, release it.")
print(result.text())   # whatever approver/publisher passed to conclude(...)
```

What this expresses:

- **The builder does not enumerate every transition.** No edge list maps `scout → analyst` or `architect → tester`. Those routes are chosen at runtime inside each pool.
- **Agents choose the next specialist by routing inside their local pool.** Discovery members only see `ask_discovery_pool`.
- **A phase transition occurs only when a gateway agent is selected** and decides to call the next pool. `gateway_to_build` is the only door from Discovery to Build; `gateway_to_release` the only door onward.
- This is a **progressive, non-recombining** chain: it flows Discovery → Build → Release and terminates at `conclude`. Because each forward gateway is registered **only in its source pool**, agents in a later world cannot select it, so there is no route back to an earlier world — which makes the chain easy to trace. (If a gateway is also registered in its destination pool, that destination becomes able to route back through it — which is exactly how the recombining variant below is built.)

## Recombining (reversible-boundary) variant

Replace the one-way `gateway_to_build` with a **reversible boundary controller** that can send work *back* to Discovery when Build exposes ambiguity. Only the boundary changes; the rest of the graph is the same.

```python
boundary_discovery_build = Agent(
    name="boundary_discovery_build",
    description=(
        "Reversible boundary between Discovery and Build. "
        "Enter Build when requirements are stable; "
        "return to Discovery when Build exposes missing or contradictory requirements."
    ),
    engine=LLMEngine(
        "claude-opus-4-8",
        system=(
            "You are the reversible boundary between Discovery and Build. "
            "Use the discovery pool for clarification and evidence. "
            "Use the build pool for design, implementation, and testing. "
            "Return to discovery if build work reveals ambiguity. "
            "Make progress on each pass; do not bounce without new information. "
            "You are not a terminal agent — do not conclude."
        ),
        max_tool_calls_per_turn=1,
    ),
    tools=[
        discovery_pool.as_tool("ask_discovery_pool"),
        build_pool.as_tool("ask_build_pool"),
    ],
)

discovery_pool.register(scout, analyst, boundary_discovery_build)
build_pool.register(boundary_discovery_build, architect, implementer, tester, gateway_to_release)
```

```
stateDiagram-v2
    [*] --> Discovery
    Discovery --> Build: boundary_discovery_build (forward)
    Build --> Discovery: boundary returns on ambiguity
    Build --> Release: gateway_to_release
    Release --> [*]: conclude (approver / publisher)
```

What this changes:

- **A reversible gateway can move forward or backward** between local worlds. `boundary_discovery_build` can re-enter Discovery when Build surfaces a gap.
- This makes the chain **recombining / recurrent**: execution can revisit Discovery, so the same pool can be a state more than once.
- It is still **bounded** — by pool membership, by `max_depth` on each pool, and by terminal `conclude` placement on `approver`/`publisher` only. The boundary explicitly is *not* a terminal agent.

> This is **not a static DAG**. The path is selected at runtime inside builder-defined local worlds. It is also **not a strict Markov process** unless the full context, memory, store, and conversation history are treated as part of the state — two visits to Discovery differ by the evidence accumulated in between.

## Notes and cautions

- A reversible boundary is a **high-authority node** — it controls movement between worlds and carries context across phases. Keep its prompt explicit about when to stay, cross, return, and (never, here) conclude, and keep its progress criteria sharp to avoid oscillation.
- Keep `conclude` on terminal roles only. If discovery or build members could conclude, the chain could end before reaching Release.
- Keep `max_depth` low on recombining chains; pair it with route tracing so a boundary that bounces without progress is visible.
- When an **outer lifecycle** must stay deterministic (fixed phase boundaries, checkpoints, typed hand-offs), wrap the pool chain in a [`Plan`](https://core.lazybridge.com/guides/full/plan/index.md) rather than expressing that lifecycle as more gateways.

## See also

- [Dynamic graph (AgentPool + conclude)](https://core.lazybridge.com/guides/mid/dynamic-graph/index.md) — the primitives, gateway and reversible-gateway patterns, the state-process model, and the structural-scoping caveat.
- [Plan](https://core.lazybridge.com/guides/full/plan/index.md) — the static workflow runtime for when the path must be known, validated, typed, and checkpointed.
- [Multi-agent graphs](https://core.lazybridge.com/reference/multi-agent/index.md) — API reference for `AgentPool`, `conclude`, `ConcludeSignal`.

# Session

The observability container for an agent run — an event log plus a list of exporters (console, JSON file, OpenTelemetry, your own). Every engine emits the same event schema; nested agents inherit the caller's session, so cost / token / latency rollup works transitively across the whole tree.

## Signature

```python
from lazybridge import Session

Session(
    *,
    db=None,                       # None = in-memory SQLite (lost at close)
    exporters=None,                # list[EventExporter]; None = []
    # Redaction
    redact=...,                    # default: redact_secrets — sk-/ghp-/Bearer/JWT/etc. masked
    redact_on_error="strict",      # "strict" (drop on redact failure) | "fallback" (warn + record raw)
    unsafe_log_payloads=False,     # set True to disable default secret redaction; redact=None has same effect
    console=False,                 # convenience: append a ConsoleExporter
    # Batched-writer (opt-in) — emit() becomes non-blocking
    batched=False,
    batch_size=100,
    batch_interval=1.0,
    max_queue_size=10_000,
    on_full="hybrid",              # "hybrid" (default) | "block" | "drop"
    critical_events=None,          # frozenset[str] — overrides hybrid set
)

# Methods
session.emit(event_type, payload, *, run_id=None)
session.add_exporter(exporter)
session.remove_exporter(exporter)
session.flush(timeout=5.0)         # drain the batched writer
session.close()                    # flush + release SQLite
session.usage_summary()            # {"total": {...}, "by_agent": {...}, "by_run": {...}}

# Live members
session.events                     # EventLog — session.events.query(...) for raw rows
session.graph                      # GraphSchema — agent topology, auto-populated
```

### EventType (StrEnum)

| Member                                     | Emitted by                                         |
| ------------------------------------------ | -------------------------------------------------- |
| `AGENT_START` / `AGENT_FINISH`             | every `Agent` run, including nested                |
| `LOOP_STEP`                                | each iteration of an `LLMEngine` tool-calling loop |
| `MODEL_REQUEST` / `MODEL_RESPONSE`         | every provider call                                |
| `TOOL_CALL` / `TOOL_RESULT` / `TOOL_ERROR` | every tool dispatch                                |
| `HIL_DECISION`                             | `HumanEngine` / `SupervisorEngine` decisions       |

`Agent(verbose=True)` creates a private `Session(console=True)` for that agent — useful for one-off debugging without wiring an explicit session.

## Synopsis

A `Session` does three things:

1. **Persists events** to an SQLite-backed `EventLog`. Every engine emits the same enum, so a single query returns a full per-run trace.
1. **Fans events out to exporters** in registration order — Console, `JsonFileExporter`, `OTelExporter`, custom sinks.
1. **Aggregates cost / tokens / latency** across the whole agent tree via `usage_summary()`, including transitive rollup from nested sub-agents.

Pass a `Session` once at the top-level agent. Nested agents (`Agent A` with `Agent B` in `tools=[...]`) inherit it automatically; the graph view shows the whole tree, the cost rollup includes every child.

## When to use it

- **You want any of**: cost tracking across multiple `agent(task)` calls, a JSON-line event log for offline analysis, OpenTelemetry spans, a graph view of an agent topology.
- **Production deployments** where the hot path can't block on disk / network — pair `batched=True` with `JsonFileExporter` / `OTelExporter`.
- **Multi-agent pipelines.** The session at the outermost agent collects events from every nested agent and tool call without any per-agent plumbing.
- **PII / sensitive-data scrubbing.** Pass a `redact=` callable that rewrites payloads before they reach exporters; the default `redact_on_error="strict"` fails closed if the redactor itself errors. By default Session already runs `redact_secrets` — well-known credential shapes (`sk-...`, `ghp_...`, `AIza...`, JWT, `Bearer ...`) are stripped from every event payload before it leaves the bus. Set `unsafe_log_payloads=True` (or `redact=None`) to disable it; pass your own `redact=` to replace it.

## When NOT to use it

- **Single one-off `agent(task)` call where you just want stdout.** Use `Agent(verbose=True)` instead — it's a private console session and saves you the import.
- **Real-time analytics with sub-millisecond latency requirements.** The default sync writer fits most workloads; for the hot path use `batched=True`. If even that's too slow, write a custom `EventExporter` that pushes to your own pipeline.

## Example

```python
from lazybridge import (
    Agent,
    JsonFileExporter,
    LLMEngine,
    Session,
)
from lazybridge.session import EventType


# 1) Dev-mode tracing — one flag.
sess = Session(console=True)
chat = Agent(
    engine=LLMEngine("gpt-5.4-mini"),
    session=sess,
    name="chat",
)
chat("hello")


# 2) Production shape — multi-sink, batched, redacted.
def mask_pii(payload: dict) -> dict:
    if "task" in payload:
        return {**payload, "task": payload["task"].replace("foo@bar.com", "[REDACTED]")}
    return payload


sess = Session(
    db="events.sqlite",
    batched=True,
    on_full="hybrid",                 # default; explicit for clarity
    exporters=[
        JsonFileExporter(path="events.jsonl"),
    ],
    redact=mask_pii,
)

researcher = Agent(
    engine=LLMEngine("gpt-5.4-mini"),
    name="research",
)
writer = Agent(
    engine=LLMEngine("gpt-5.4-mini"),
    name="write",
)
pipeline = Agent.chain(researcher, writer, session=sess)
pipeline("summarise AI trends")


# 3) Cost / token roll-up across the whole tree.
print(sess.usage_summary()["total"]["cost_usd"])


# 4) Drain the batched writer before reading the log.
sess.flush()
errors = sess.events.query(event_type=EventType.TOOL_ERROR)


# 5) Topology for a UI / report.
print(sess.graph.to_json())


# 6) OpenTelemetry — install lazybridge[otel].
from lazybridge.ext.otel import OTelExporter

sess.add_exporter(OTelExporter(endpoint="http://otelcol:4318"))
```

## Pitfalls

- **`Session(db=":memory:")` behaves like `Session()`** — both are in-memory. Pass a real filename to persist.
- **Exporter failures warn once per instance.** Subsequent failures from the same exporter are suppressed. While debugging a noisy exporter, wrap it in `CallbackExporter(fn=lambda e: print(e))` so you see every emission attempt.
- **`Agent(verbose=True)` creates a fresh private Session.** If you also pass `session=another`, `verbose` is ignored — the explicit session wins.
- **`batched=True` makes reads stale.** `session.events.query(...)` may not reflect events still in the writer queue; call `session.flush()` (or use `Session` as a context manager that auto-closes) before querying.
- **`on_full="drop"` was the pre-1.0.x default.** The new `"hybrid"` default holds critical events (`AGENT_*` / `TOOL_*` / `HIL_DECISION`) and only drops cheap telemetry (`LOOP_STEP` / `MODEL_REQUEST` / `MODEL_RESPONSE`) under saturation. Set `on_full="block"` if you need every event, no exceptions; set `on_full="drop"` to opt back into the old behaviour.
- **`redact_on_error="strict"` (default) drops the event** if the redactor raises or returns a non-dict. Use `"fallback"` only when you want the unredacted payload as a backup; the trade-off is the potential to log raw PII.
- **Default secret redaction is on.** `Session()` with no `redact=` argument wires `redact_secrets` which masks `sk-...` / `ghp_...` / `AIza...` / JWT / `Bearer ...` shapes in payload strings. It does *not* mask emails, phone numbers, or other PII — compose your own redactor on top if you need that. Disable entirely with `unsafe_log_payloads=True` (or `redact=None`); pass your own `redact=` to replace it (LazyBridge does not stack the default in front of a user redactor).
- **Nested agents inherit `session=`** unless they pass their own. This is what gives you transitive cost rollup; pass an explicit `session=None` on a sub-agent only when you genuinely want it invisible.

## See also

- *Guides → Full → Exporters* (Phase 3) — the sinks that consume session events (`ConsoleExporter`, `JsonFileExporter`, `StructuredLogExporter`, `FilteredExporter`, `CallbackExporter`, `OTelExporter`).
- *Guides → Full → GraphSchema* (Phase 3) — the topology view exposed via `session.graph`.
- [Memory](https://core.lazybridge.com/guides/mid/memory/index.md) — separate concept (conversation context, not observability).
- [Agent](https://core.lazybridge.com/guides/basic/agent/index.md) — `session=` is a first-class kwarg; `verbose=True` is the convenience shortcut.

# Store

A thread-safe key-value blackboard, in-memory by default and SQLite-backed when you want durability. The Store is what `Plan` checkpoints land in, what `from_agent("alias")` reads from, and what you reach for whenever state must survive a crash or a process boundary.

## Signature

```python
from lazybridge import Store

Store(db=None)
  # db=None    → in-memory (deep-copied dict; lost on exit)
  # db="path"  → SQLite, WAL mode, thread-safe, persistent

# Methods
store.write(key, value, *, agent_id=None)
store.read(key, default=None)
store.read_entry(key)            # StoreEntry | None — value plus metadata
store.read_all()                 # dict[str, Any]
store.keys()                     # list[str]
store.delete(key)
store.clear()                    # drop every key (irreversible, no resume)
store.to_text(keys=None)         # render as "key: <json>" lines for sources=
store.compare_and_swap(key, expected, new)   # atomic CAS — internal to checkpoint, also useful for cross-process locks
```

`StoreEntry` is a dataclass `(key, value, written_at, agent_id)`.

## Synopsis

`Store` is the durable / shared counterpart to [`Memory`](https://core.lazybridge.com/guides/mid/memory/index.md). Where `Memory` is "what the model should see in the next prompt", `Store` is "what should survive a crash, or be read by another agent / process / machine".

Three things flow through it automatically:

- **Pipeline checkpoints.** A `Plan` with `store=` and a `checkpoint_key=` writes step results so a crashed run can resume.
- **Agent outputs.** Every agent writes its last result under `"__agent_output__:{name}"` after a successful run, so `from_agent("name")` sentinels (and code that wants to peek out-of-band) can read it.
- **Plan step `writes=` declarations.** `Step("research", writes="hits")` copies the step's payload to the key `"hits"` once it succeeds.

You can also use it as a plain blackboard — write whatever keys you like, read them from anywhere that holds the same `Store`. SQLite mode makes it safe to share across threads and concurrent agent runs.

## When to use it

- **Crash-resumable pipelines.** Pass `store=Store(db="run.sqlite")` and a `checkpoint_key=` to a `Plan`; `resume=True` on the next run picks up at the failed step.
- **Cross-agent shared blackboard.** A fan-out of researchers writes intermediate facts; a downstream synthesiser reads them via `sources=[store]` (live view) or `from_agent("…")` (per-agent output).
- **Out-of-band inspection.** A monitor / dashboard process opens the same SQLite file read-only and queries the live state without joining the agent run.
- **Cross-process artefact handoff.** One service writes computed artefacts; another service reads them. Use a shared filesystem path or a pointer (URL, file path) as the value.

## When NOT to use it

- **Conversation context for an LLM call.** That's `Memory`'s job. Don't shovel the whole `Store` into prompts every turn — pass the store as a `sources=` (live view) or read specific keys with `from_agent` / `from_step`.
- **High-throughput counters or queues.** Each write commits immediately; there's no transactional batch. For that workload reach for a real database or queue.
- **Large binary blobs.** Values are JSON-encoded on write. Store a filesystem path / URL as the value and let the consumer read the blob directly.

## Example

```python
from lazybridge import (
    Agent,
    LLMEngine,
    Plan,
    Step,
    Store,
    from_agent,
)


store = Store(db="research.sqlite")


# 1) Plan step writes a result via Step(writes=).
researcher = Agent(
    engine=LLMEngine("gemini-3-flash-preview"),
    store=store,                       # required for from_agent later
    name="research",
)
writer = Agent(
    engine=LLMEngine("gpt-5.4-mini"),
    name="write",
)

pipeline = Agent(
    engine=Plan(
        Step("research", writes="hits"),   # store["hits"] = step payload
        Step("write"),
    ),
    tools=[researcher, writer],
    store=store,
)
pipeline("AI trends in 2026")
print(store.read("hits"))


# 2) Agents auto-write their output to Store after each run.
#    Key: "__agent_output__:{name}".
researcher("AI trends 2026")
print(store.read("__agent_output__:research"))


# 3) from_agent("name") reads that output in a Plan step.
#    IMPORTANT: store= must be on the SOURCE AGENT (researcher), not
#    just the pipeline.
editor = Agent(
    engine=LLMEngine("gemini-3-flash-preview"),
    name="edit",
)
plan_with_handoff = Agent(
    engine=Plan(
        Step("research"),
        Step("edit", context=from_agent("research")),
    ),
    tools=[researcher, editor],
)
plan_with_handoff("Topic: bees")


# 4) Agent with sources=[store] sees the live store on every call.
monitor = Agent(
    engine=LLMEngine("gemini-3-flash-preview"),
    sources=[store],
)
print(monitor("what's the current state?").text())
```

## Pitfalls

- **`Store(db=":memory:")`** is **not** the same as `Store()`. The former opens an in-memory SQLite (connection-scoped); the latter uses a Python dict. Use `Store()` for in-process state and the filename form for durability.
- **Auto-write key uses the alias, not `agent.name`.** When you wrap an agent as `agent.as_tool("alias")`, the auto-write key is `"__agent_output__:alias"`. `from_agent("alias")` reads the same key. If you query the store directly, use the alias.
- **`store=` must be on the source agent**, not just on the pipeline. `from_agent("research")` reads `__agent_output__:research`; that key is only written if the researcher itself was constructed with `store=store`. PlanCompiler rejects `from_agent(...)` references whose source agent has no store attached.
- **Reads return deep copies.** Mutating `store.read("k")` does not change the stored value. The in-memory backend matches the SQLite copy-on-write semantics on purpose.
- **Values are JSON-encoded** via `json.dumps(default=str)`. Non-JSON types are stringified on the way in. Prefer primitives and `pydantic_model.model_dump()` over raw class instances.
- **`store.to_text()` materialises the entire keyspace** by default; pass `keys=[...]` to limit the slice when the store has thousands of keys.
- **Each write commits immediately.** There's no transactional batch. If you need atomic multi-key updates, layer them yourself or use a real RDBMS.

## See also

- [Memory](https://core.lazybridge.com/guides/mid/memory/index.md) — the in-prompt counterpart.
- [Session](https://core.lazybridge.com/guides/mid/session/index.md) — observability of writes (events emitted on agent finish carry the auto-write metadata).
- *Guides → Full → Checkpoint & resume* (Phase 3) — how `Plan` uses `Store` for crash-resilient pipelines.
- *Guides → Full → Sentinels* (Phase 3) — `from_agent("name")` and `from_step("name")` resolve against the Store at run time.

# verify=

A judge-and-retry loop wrapped around an agent's output (or around a specific tool call). Each output is graded; rejection feeds the judge's reason back into the next attempt as feedback, capped at `max_verify` attempts.

The hard sibling of `verify=` is [Guards](https://core.lazybridge.com/guides/mid/guards/index.md): a guard blocks on policy violation; `verify=` accepts feedback and retries. Pick the right tool for the policy.

## Signature

```python
# Three placements, same judge contract.

# 1. Agent-level — final output gate
Agent(
    engine=LLMEngine("claude-haiku-4-5"),
    verify=judge_agent,            # Agent or Callable[[str], Any]
    max_verify=3,                  # max attempts; default 3, must be >= 1
)

# 2. Tool-level — every call through the wrapped agent is gated
agent.as_tool(
    name="...",
    description="...",
    verify=judge_agent,
    max_verify=3,
)

# 3. Plan-level — wrap the step's agent with its own verify=
Plan(
    Step(
        target=Agent(
            engine=LLMEngine("claude-haiku-4-5"),
            verify=judge_agent,
            max_verify=3,
            name="summarise",
        ),
        name="summarise",
    ),
)
```

### Judge contract

- The judge receives the agent's output text (and the original task for context) and must respond with a string starting with `"approved"` (case-insensitive) to accept.
- Anything else is treated as rejection; the verdict text is injected as feedback on the next retry: `f"{original_task}\n\nFeedback: {judge_verdict}"`.
- Judges may be `Agent` instances or plain callables (`Callable[[str], Any]`).

## Synopsis

`verify=` is the soft, retryable counterpart to `Guards`. Where a guard is a hard yes / no gate that ends the run on failure, a verifier *re-asks* the agent with the judge's feedback baked into the next prompt. After `max_verify` attempts the last result is returned as-is, even if still rejected — there's no infinite loop.

There are three placements, all with the same contract:

- **Agent-level** (`Agent(engine=…, verify=judge)`) gates the agent's final output. Whatever tool chain the engine chose internally, the judge sees the eventual text and may force a retry.
- **Tool-level** (`agent.as_tool(verify=judge)`) gates every invocation of one specific wrapped agent — useful when one sub-agent is the risky one and the rest is fine. Already documented in passing in [as-tool](https://core.lazybridge.com/guides/mid/as-tool/index.md); `verify=` is the dedicated reference for the loop semantics.
- **Plan-level** is just a special case of agent-level: wrap the step's agent with its own `verify=`. No special primitive.

## When to use it

- **High-stakes outputs that should not ship "first try".** Drafts, summaries of regulated content, customer-facing replies.
- **Quality control with feedback.** Unlike a guard, `verify=` gives the agent a chance to fix what the judge complained about. Use it when "wrong answer" is recoverable, not just blockable.
- **Per-tool gating.** When one sub-agent in a hierarchy is noisier than the rest, gate just that one with `agent.as_tool(verify=judge)` instead of taxing the whole pipeline.
- **Targeted Plan steps.** When one step in a `Plan` is the quality-critical one (the summary, the final draft), wrap just its agent with `verify=` — leave the rest unchecked.

## When NOT to use it

- **Hard policy violations.** PII leakage, content-safety failures, schema violations — those need [Guards](https://core.lazybridge.com/guides/mid/guards/index.md). `verify=` retries; a guard refuses and ends the run.
- **CI / batch grading.** When you want to test "does this agent generally produce good output?" offline, use [Evals](https://core.lazybridge.com/guides/mid/evals/index.md) instead. `verify=` runs every single time the agent is invoked.
- **As a structured-output validator.** That's what `output=` does (with `max_output_retries=`). The framework already re-prompts on Pydantic validation errors; you don't need `verify=` on top.
- **Multi-criteria judges.** A judge that grades fluency, accuracy, and tone simultaneously produces vague feedback. Either run two separate `verify=` loops (expensive) or pick one criterion and accept the rest will need a different mechanism.

## Example

```python
from lazybridge import Agent, LLMEngine, Plan, Step


judge = Agent(
    engine=LLMEngine(
        "claude-haiku-4-5-20251001",   # cheap-tier judge
        system='Respond "approved" or "rejected: <short reason>".',
    ),
    name="judge",
)


# 1) Agent-level — final output gated, up to 2 attempts.
writer = Agent(
    engine=LLMEngine("claude-haiku-4-5"),
    verify=judge,
    max_verify=2,
    name="writer",
)
result = writer("write a haiku about bees")
print(result.text())


# 2) Tool-level (Option B) — every call of synthesizer is gated;
#    the orchestrator's other tools run unchecked.
synthesizer = Agent(
    engine=LLMEngine("claude-haiku-4-5"),
    name="synthesizer",
)
orchestrator = Agent(
    engine=LLMEngine("claude-haiku-4-5"),
    tools=[
        synthesizer.as_tool(
            name="synth",
            verify=judge,
            max_verify=2,
        ),
    ],
)


# 3) Plan-level — only the summarise step is gated.
fetcher = Agent(
    engine=LLMEngine("claude-haiku-4-5"),
    name="fetch",
)
publisher = Agent(
    engine=LLMEngine("claude-haiku-4-5"),
    name="publish",
)
summariser = Agent(
    engine=LLMEngine("claude-haiku-4-5"),
    verify=judge,
    max_verify=2,
    name="summarise",
)

plan = Agent(
    engine=Plan(
        Step(target=fetcher,    name=fetcher.name),
        Step(target=summariser, name=summariser.name),
        Step(target=publisher,  name=publisher.name),
    ),
)


# 4) Callable judge — boolean verdict, no feedback loop.
def at_least_three_lines(output: str) -> bool:
    return output.count("\n") >= 2

short_writer = Agent(
    engine=LLMEngine("claude-haiku-4-5"),
    verify=at_least_three_lines,
    max_verify=2,
)
```

## Pitfalls

- **A strict judge + small `max_verify` silently returns poor output.** After the cap is hit, the last attempt is returned even if still rejected. Log the retry feedback during development so you know when you're hitting the cap; consider raising `max_verify` or relaxing the policy.
- **Callable judges returning booleans don't produce feedback.** Retries reuse the same task. Return a string verdict (`"approved"` or `"rejected: <reason>"`) if you want the feedback loop. A `bool`-returning callable is acceptable when the failure mode is binary and "try again" alone is enough.
- **Nested verify is allowed but expensive.** Agent-level + tool-level + Plan-level on the same path stacks the loops. Pick one per agent unless you're intentionally building defence in depth.
- **Keep judges cheap and specific.** Use a smaller / faster model. One criterion per judge — multi-criteria judges conflate failure modes and produce vague feedback. Two single-criterion judges chained at different levels often produce better results than one multi-criterion judge at one level.
- **`verify=` operates on `Envelope.text()`.** When the agent has structured output (`output=Model`), the judge sees the JSON serialisation, not the model instance. Phrase the judge's policy accordingly, or write a callable judge that does `json.loads(...)` first.
- **`max_verify=1` disables the retry but keeps the gate.** The agent runs once, the judge grades, and the result is returned whatever the verdict. Effectively a non-blocking quality warning. Use a real `max_verify=2` or `3` if you actually want the retry semantics.
- **Cost rollup.** Each retry is a full agent run plus a judge run. With a `Session=`, the cost rollup includes every attempt; check `session.usage_summary()` for the true cost of a verify-heavy workflow.

## See also

- [Guards](https://core.lazybridge.com/guides/mid/guards/index.md) — the hard counterpart: blocks on policy failure with no retry. Pair them: hard guards for "must never pass", `verify=` for "should usually be good".
- [As tool](https://core.lazybridge.com/guides/mid/as-tool/index.md) — the surface for tool-level `verify=` placement.
- [Evals](https://core.lazybridge.com/guides/mid/evals/index.md) — offline / CI grading; `verify=` is the runtime sibling.
- [Agent](https://core.lazybridge.com/guides/basic/agent/index.md) — `verify=` is a first-class `Agent` kwarg alongside `guard=`, `output=`, and `tools=`.
# Recipes

# Recipes

Each recipe is a runnable example from the `examples/` directory in the repository, embedded verbatim and walked through. Run any of them directly with `python examples/<file>.py` once you have the relevant provider key in your environment.

The recipes are roughly ordered by the [progressive complexity ladder](https://core.lazybridge.com/concepts/progressive-complexity/index.md): single agent → tools → composition → planning → production.

## Single agent + tools

- [React agent](https://core.lazybridge.com/recipes/react-agent/index.md) — `examples/langgraph/01_react_agent_weather.py`
- [Researcher (single agent)](https://core.lazybridge.com/recipes/researcher-single/index.md) — `examples/crewai/01_research_crew_single_agent.py`

## Sequential composition

- [Researcher → reporter](https://core.lazybridge.com/recipes/researcher-reporter/index.md) — `examples/crewai/02_research_and_report.py`

## Hierarchical / supervisor

- [Supervisor pattern](https://core.lazybridge.com/recipes/supervisor-pattern/index.md) — `examples/langgraph/02_supervisor_research_math.py`

## Planning patterns

- [Plan tool](https://core.lazybridge.com/recipes/plan-tool/index.md) — `examples/patterns/plan_tool.py`
- [Agent builds a plan](https://core.lazybridge.com/recipes/agent-builds-plan/index.md) — `examples/patterns/agent_builds_plan.py`
- [Blackboard planner](https://core.lazybridge.com/recipes/blackboard-planner/index.md) — `examples/patterns/blackboard_planner.py`
- [Dynamic re-planning](https://core.lazybridge.com/recipes/dynamic-replanning/index.md) — `examples/patterns/dynamic_planner.py`

## Composition shapes

- [Nested pipelines (horizontal)](https://core.lazybridge.com/guides/full/composition-patterns/index.md) — Plan-of-Plans, parallel bands of sub-pipelines, and LLM-decided dispatch over sub-pipelines. Companion to the vertical `chain` / `Plan` recipes above.

## Observability

- [Live visualization](https://core.lazybridge.com/recipes/live-visualization/index.md) — `examples/viz_demo.py`
- [Visualization mock](https://core.lazybridge.com/recipes/visualization-mock/index.md) — `examples/viz_mock_demo.py`

# Agent builds a plan

A planning agent emits a structured `PlanSpec` (Pydantic) that's materialised into a live `Plan` with typed steps. Demonstrates the "Plan as data" pattern — the LLM decides the topology, the runtime validates it at construction (compile-time DAG checks), then executes deterministically.

Supports a one-shot mode and a re-planning loop.

## Source

```python
"""Planner agent emits a typed PlanSpec; we materialise it into a real Plan.

Why this is the cleanest "planner builds a plan" pattern in LazyBridge
---------------------------------------------------------------------
- The planner returns a ``PlanSpec`` (Pydantic). No free-form text to parse.
- ``materialize()`` turns the spec into a real ``Plan(Step(...), ...)`` with
  live agents bound to step targets. ``PlanCompiler`` validates the DAG when
  the Plan is wrapped by ``Agent(engine=plan)`` — broken DAGs (forward
  ``from_step`` references, unknown step names, duplicates) surface as
  ``PlanCompileError`` *before* any LLM call.
- ``Step(parallel=True)`` runs siblings concurrently. No asyncio loop in user
  code: ``Plan`` does the dispatch.
- Re-planning is opt-in (``solve(replan=True)``): after the Plan finishes, the
  planner sees the result and either declares done or builds a new Plan.

When to prefer the "asyncio loop" pattern in ``dynamic_planner.py`` instead:
when you need to feed *partial* results back mid-round (the planner inspects
results before the round finishes). Plan-based execution is round-atomic.
"""

from typing import Literal

from pydantic import BaseModel, Field

from lazybridge import Agent, LLMEngine, Plan, Step, from_parallel, from_prev, from_step

# ---------------------------------------------------------------------------
# 1. Sub-agents (the registry the planner is allowed to pick from)
# ---------------------------------------------------------------------------


def web_search(query: str) -> str:
    """Look up current facts (stub)."""
    return f"[stub web result for {query!r}]"


def add(a: float, b: float) -> float:
    """Add two numbers."""
    return a + b


def multiply(a: float, b: float) -> float:
    """Multiply two numbers."""
    return a * b


research_agent = Agent(
    engine=LLMEngine("gpt-5.4-mini", system="Look up facts via web_search."),
    tools=[web_search],
    name="research",
)
math_agent = Agent(
    engine=LLMEngine("gpt-5.4-mini", system="Solve arithmetic with add/multiply."),
    tools=[add, multiply],
    name="math",
)
writer_agent = Agent(
    engine=LLMEngine("gpt-5.4-mini", system="Synthesise prior results into prose."),
    name="writer",
)

REGISTRY: dict[str, Agent] = {
    "research": research_agent,
    "math": math_agent,
    "writer": writer_agent,
}


# ---------------------------------------------------------------------------
# 2. PlanSpec — what the planner emits (structured output)
# ---------------------------------------------------------------------------


class StepSpec(BaseModel):
    name: str = Field(..., description="Unique step name; used by from_step references.")
    agent: Literal["research", "math", "writer"] = Field(..., description="Which sub-agent runs this step.")
    task_kind: Literal["literal", "from_prev", "from_step", "from_parallel"] = Field(
        default="from_prev",
        description=(
            "How this step receives its task: literal=use task_text, "
            "from_prev=output of previous step, from_step=output of named step, "
            "from_parallel=list of envelopes from a parallel sibling group."
        ),
    )
    task_text: str | None = Field(default=None, description="Required when task_kind='literal'.")
    task_step: str | None = Field(
        default=None,
        description="Required when task_kind='from_step' or 'from_parallel'.",
    )
    parallel: bool = Field(default=False, description="If True, run concurrently with adjacent parallel siblings.")


class PlanSpec(BaseModel):
    reasoning: str = Field(..., description="Why this DAG was chosen.")
    steps: list[StepSpec] = Field(default_factory=list, description="DAG steps in order.")
    done: bool = Field(default=False, description="Set True with no steps to short-circuit.")
    final_answer: str | None = Field(default=None, description="If done=True, the user-facing answer.")


PLANNER_SYSTEM = f"""\
You are a planner that produces a small DAG of work.

Available sub-agents (pick exactly one per step):
- research : web lookups. Cannot do math.
- math     : arithmetic only.
- writer   : synthesise prior results into prose. Adds no new facts.

You emit a PlanSpec with ordered steps. Each step has:
  - name      : unique short identifier (snake_case)
  - agent     : one of {sorted(REGISTRY)}
  - task_kind : 'literal' (provide task_text), 'from_prev' (default — uses
                previous step's output as task), 'from_step' (uses named
                step's output; provide task_step), 'from_parallel' (gather
                list of envelopes from a parallel sibling group)
  - parallel  : True to run concurrently with adjacent parallel siblings

Rules:
1. The first step must use task_kind='literal' (no previous step exists).
2. To fan out, mark several adjacent steps parallel=True; their join step
   should use task_kind='from_parallel' and task_step=<name of first parallel sibling>.
3. If the question can be answered without any work, set done=true and put
   the answer in final_answer (leave steps empty).
4. End the DAG with a 'writer' step when prose is required.
"""


planner = Agent(
    engine=LLMEngine("gpt-5.4-mini", system=PLANNER_SYSTEM),
    output=PlanSpec,
    name="planner",
)


# ---------------------------------------------------------------------------
# 3. Materialise a PlanSpec into a real Plan (compile-time validation)
# ---------------------------------------------------------------------------


def materialize(spec: PlanSpec) -> Plan:
    steps: list[Step] = []
    for s in spec.steps:
        if s.task_kind == "literal":
            if not s.task_text:
                raise ValueError(f"step {s.name!r}: task_kind='literal' needs task_text")
            task = s.task_text
        elif s.task_kind == "from_step":
            if not s.task_step:
                raise ValueError(f"step {s.name!r}: task_kind='from_step' needs task_step")
            task = from_step(s.task_step)
        elif s.task_kind == "from_parallel":
            if not s.task_step:
                raise ValueError(f"step {s.name!r}: task_kind='from_parallel' needs task_step")
            task = from_parallel(s.task_step)
        else:  # from_prev
            task = from_prev

        steps.append(
            Step(
                target=REGISTRY[s.agent],
                name=s.name,
                task=task,
                parallel=s.parallel,
            )
        )
    # Plan() runs PlanCompiler — bad references raise PlanCompileError here.
    return Plan(*steps, max_iterations=max(20, len(steps) * 3))


# ---------------------------------------------------------------------------
# 4. solve() — one-shot or replan
# ---------------------------------------------------------------------------


def _format_with_history(query: str, history: list[str]) -> str:
    if not history:
        return query
    rounds = "\n\n".join(f"--- prior round {i + 1} ---\n{h}" for i, h in enumerate(history))
    return f"User query: {query}\n\nPrior plan results:\n{rounds}\n\nDecide next plan or set done=true."


def solve(query: str, *, replan: bool = False, max_rounds: int = 5) -> str:
    """Execute one Plan. If replan=True, loop until the planner sets done=true."""
    history: list[str] = []
    for round_num in range(1, max_rounds + 1):
        env = planner(_format_with_history(query, history))
        spec: PlanSpec | None = env.payload
        if spec is None:
            # Typical when ``ANTHROPIC_API_KEY`` is missing — the
            # request errored and the typed payload is None.
            err = env.error or RuntimeError("planner returned no payload")
            return f"planner unavailable: {type(err).__name__}: {err}"
        print(f"\n=== plan round {round_num} ===")
        print("reasoning:", spec.reasoning)

        if spec.done:
            print("planner: DONE")
            return spec.final_answer or (history[-1] if history else "")

        if not spec.steps:
            return spec.final_answer or "planner emitted empty plan; aborting"

        plan = materialize(spec)  # ← compile-time validated
        print(f"materialised plan with {len(spec.steps)} step(s); running…")

        result = Agent(engine=plan)(query).text()
        history.append(result)

        if not replan:
            return result

    return f"max_rounds reached; partial: {history[-1] if history else ''}"


# ---------------------------------------------------------------------------
# 5. Entry point
# ---------------------------------------------------------------------------


def main() -> None:
    query = (
        "Get the headcounts of Apple, Google, and Meta in 2024 (you can look "
        "these up in parallel). Then compute the total. Finally, write a "
        "short paragraph commenting on the total."
    )
    answer = solve(query, replan=False)  # set replan=True for adaptive multi-round
    print("\n=== FINAL ANSWER ===\n" + answer)


if __name__ == "__main__":
    main()
```

## Walkthrough

- **`PlanSpec` is a Pydantic schema** — the planning agent emits a JSON-serialisable spec (steps, sentinels, parallel flags) and a builder function compiles it into a real `Plan`. `PlanCompiler` catches dangling references / unknown targets at construction, before any LLM call runs.
- **Re-planning loop**: when `replan=True`, the planner sees the results of the previous round and emits a new `PlanSpec`. Bound the loop with a counter or a "done" predicate — there's no built-in safety net beyond `Plan(max_iterations=...)`.
- **Sentinels in the spec** (`from_prev`, `from_step("name")`, `from_parallel("name")`) translate from JSON strings back into the Python sentinel objects.

## Variations

- Drop the re-planning loop and run the materialised plan once — cheaper, deterministic.
- Persist the `PlanSpec` (`json.dumps(spec.model_dump())`) for audit / replay. See [Plan serialization](https://core.lazybridge.com/guides/advanced/plan-serialize/index.md) for the runtime side.
- Constrain the planner's choice of targets via the schema (`Literal[...]` field for step `target`) so the LLM can't fabricate non-existent agents.

## See also

- [Plan](https://core.lazybridge.com/guides/full/plan/index.md) — the runtime side.
- [Plan serialization](https://core.lazybridge.com/guides/advanced/plan-serialize/index.md) — for persisting the spec across processes.
- [Dynamic re-planning](https://core.lazybridge.com/recipes/dynamic-replanning/index.md) — sibling pattern using a flat round-of-tasks shape.
- [Plan tool](https://core.lazybridge.com/recipes/plan-tool/index.md) — the pre-built factory alternative.

# Blackboard planner

`make_blackboard_planner` returns an agent whose state is a flat to-do list (`set_plan` / `get_plan` / `mark_done` tools). Simpler than a DAG for exploratory work where the structure isn't known up front and the LLM iterates on a checklist.

> **Canonical name**: `make_blackboard_planner` is a backward-compat alias for `blackboard_orchestrator_agent` (in `lazybridge.ext.planners`). New code can use either; both resolve to the same factory.

## Source

```python
"""Demo: ``lazybridge.ext.planners.make_blackboard_planner`` with three sub-agents.

The factory itself lives in :mod:`lazybridge.ext.planners.blackboard` — this
file just shows a minimal usage pattern. The blackboard planner manages
a flat to-do list (``set_plan`` / ``get_plan`` / ``mark_done``) instead
of composing a DAG; less precise but easier to prompt for exploratory work.
"""

from lazybridge import Agent, LLMEngine
from lazybridge.ext.planners import make_blackboard_planner


def web_search(query: str) -> str:
    """Look up current facts (stub — wire to a real search API)."""
    return f"[stub web result for {query!r}]"


def add(a: float, b: float) -> float:
    """Add two numbers."""
    return a + b


def main() -> None:
    research = Agent(
        engine=LLMEngine("gemini-3-flash-preview", system="Look up facts via web_search."),
        tools=[web_search],
        name="research",
        description="Web lookups. No math.",
    )
    math = Agent(
        engine=LLMEngine("gemini-3-flash-preview", system="Solve arithmetic with add."),
        tools=[add],
        name="math",
        description="Arithmetic only.",
    )
    writer = Agent(
        engine=LLMEngine("gemini-3-flash-preview", system="Synthesise prior results into prose."),
        name="writer",
        description="Turns prior results into a short paragraph.",
    )

    planner = make_blackboard_planner([research, math, writer], verbose=True)

    queries = [
        "What does FAANG stand for?",
        "What is 17 * 23 + 5?",
        "Research recent agent frameworks and write a one-paragraph summary.",
    ]
    for q in queries:
        print(f"\n>>> {q}")
        print(planner(q).text())


if __name__ == "__main__":
    main()
```

## Walkthrough

- **`make_blackboard_planner()`** wires up an `Agent` with three tools (`set_plan`, `get_plan`, `mark_done`) and a `Store`-backed scratch space. The LLM treats the to-do list as state it can read and update.
- **No DAG validation** — the LLM is free to revise the plan, reorder tasks, or insert new ones at any point. This is what makes it "exploratory" relative to the typed `PlanSpec` pattern in [Agent builds a plan](https://core.lazybridge.com/recipes/agent-builds-plan/index.md).
- **Pair with `Memory`** to give the planner a running record of what's been tried, why a previous attempt failed, what the current state of the world looks like — useful for tasks that span many iterations.

## Variations

- Persist the blackboard via `Store(db="planner.sqlite")` so a long-running task can be paused and resumed. The to-do list survives across runs.
- Add a `verify=judge` to gate `mark_done` — useful when the planner tends to over-claim completion.
- Wrap as a sub-agent (`tools=[planner]`) of a higher-level orchestrator that dispatches between blackboard planning and other strategies.

## See also

- [Plan tool](https://core.lazybridge.com/recipes/plan-tool/index.md) — sibling factory; structured decision tree instead of a flat list.
- [Agent builds a plan](https://core.lazybridge.com/recipes/agent-builds-plan/index.md) — typed `PlanSpec` alternative when the structure IS known.
- [Store](https://core.lazybridge.com/guides/mid/store/index.md) — backs the blackboard state.

# Dynamic re-planning

A planner that emits one *round* of independent tasks at a time, runs them in parallel, sees the results, then emits the next round. Adaptive: each round is informed by the previous one. The LangGraph-equivalent shape is "ReAct-on-tasks", but the planning unit is a batch of tasks rather than a single tool call.

## Source

```python
"""Dynamic planner with re-planning, parallel execution, and checkpoint/resume.

Pattern
-------
A planner agent reasons about the user's query and emits a *round* of
independent tasks (run in parallel). After each round it sees the results
and either emits another round or declares the work done. This is "ReAct
on tasks" — the planner re-plans every round, adapting to intermediate
findings instead of committing to a fixed task list up front.

Why not :class:`lazybridge.Plan`?
    ``Plan`` is compiled at construction time — its DAG is fixed. This file
    targets the case where the *shape* of the work depends on the query and
    on intermediate results.

Why :class:`lazybridge.ReplanEngine` instead of a raw Python loop?
    ``ReplanEngine`` is the guardian: it checkpoints after every round so a
    restart continues from the correct round rather than re-executing completed
    work.  Pass ``store=`` and ``checkpoint_key=`` to enable persistence.

Architecture (LazyBridge "everything is a tool")
-------------------------------------------------
The planner and all workers are tools in the guardian Agent's tool_map:

    guardian.tools = [planner, research_agent, math_agent, writer_agent]
                       ↑ (output=PlanRound)   ↑ workers dispatched by ReplanEngine

The planner receives the available tool schemas + history dynamically — its
system prompt does not need to hardcode worker names.
"""

from __future__ import annotations

from lazybridge import Agent, LLMEngine, ReplanEngine
from lazybridge.engines.replan import PlanRound

# ---------------------------------------------------------------------------
# 1. Sub-agents — each owns its own tool set / system prompt
# ---------------------------------------------------------------------------


def web_search(query: str) -> str:
    """Look up current facts on the web (stub)."""
    return f"[stub web result for {query!r}]"


def add(a: float, b: float) -> float:
    """Add two numbers."""
    return a + b


def multiply(a: float, b: float) -> float:
    """Multiply two numbers."""
    return a * b


research_agent = Agent(
    engine=LLMEngine(
        "deepseek-v4-flash",
        system="You look up current facts via web_search. You do not do math.",
    ),
    tools=[web_search],
    name="research",
    description="Web lookups, facts, news. Cannot do math.",
)

math_agent = Agent(
    engine=LLMEngine(
        "deepseek-v4-flash",
        system="You solve arithmetic with the add/multiply tools. One tool at a time.",
    ),
    tools=[add, multiply],
    name="math",
    description="Arithmetic only.",
)

writer_agent = Agent(
    engine=LLMEngine(
        "deepseek-v4-flash",
        system="You synthesise prior results into clear prose. No new facts.",
    ),
    name="writer",
    description="Synthesise prior results into prose. Adds no new facts.",
)


# ---------------------------------------------------------------------------
# 2. Planner — emits PlanRound each turn
#
# The system prompt is minimal: ReplanEngine injects the available tool
# schemas and the accumulated history into every planner call dynamically.
# ---------------------------------------------------------------------------

PLANNER_SYSTEM = """\
You are a task planner. Each turn you receive:
  - "Available tools:" — the tool names, signatures, and descriptions
  - "Task:" — the original user query
  - "History:" — outputs from prior rounds

Produce ONE PlanRound. Rules:
1. Tasks within a round run IN PARALLEL — put dependent tasks in the next round.
2. Use tool names and kwargs exactly as listed in "Available tools".
3. When the question is answered, set done=true and put the answer in final_answer.
4. Be greedy with parallelism: independent lookups belong in the same round.
"""

planner = Agent(
    engine=LLMEngine("deepseek-v4-flash", system=PLANNER_SYSTEM),
    output=PlanRound,
    name="planner",
)


# ---------------------------------------------------------------------------
# 3. Guardian — ReplanEngine wraps the replan loop with checkpoint/resume
# ---------------------------------------------------------------------------

guardian = Agent(
    engine=ReplanEngine(max_rounds=10),  # add store= + checkpoint_key= for persistence
    tools=[planner, research_agent, math_agent, writer_agent],
    name="guardian",
)


# ---------------------------------------------------------------------------
# 4. Entry point
# ---------------------------------------------------------------------------


def main() -> None:
    query = (
        "What is the combined headcount of Apple and Google in 2024, and "
        "write a one-paragraph note on what those numbers say about the "
        "two companies' staffing strategies?"
    )
    env = guardian(query)
    if env.error:
        print(f"ERROR: {env.error.message}")
    else:
        print("\n=== FINAL ANSWER ===\n" + (env.text() or ""))


if __name__ == "__main__":
    main()
```

## Walkthrough

- **`PlanRound` is a Pydantic schema** — the planner emits a list of tasks for the next round plus a `done: bool` flag. The outer loop dispatches all tasks concurrently via `asyncio.gather`, collects results, feeds them back to the planner.
- **`max_rounds`** is the safety net for bad termination logic — if `done=False` keeps firing forever, the loop bails. Set it defensively.
- **Per-task sequential flag** lets a round mix parallel and sequential tasks: the planner can declare that a specific task must wait for the others to finish before running.

## Variations

- Add a `verify=judge` on the planner agent itself to gate termination — the judge sees the latest round's results and decides whether `done=True` is justified.
- Persist round results to a `Store` so a debugger / dashboard can watch progress in real time.
- Replace the manual `asyncio.gather` with `Plan` parallel bands (`Step(parallel=True)`) if the round structure is fixed enough to declare up-front.

## Variations — anti-patterns

- **Pathological case**: planner emits `done=False` and an empty task list — the loop spins. The example file's source comment (line 196 in the upstream version) documents this; mitigate by adding a "no-tasks → final answer" branch.

## See also

- [ReplanEngine](https://core.lazybridge.com/guides/full/replan-engine/index.md) — the engine that wraps this loop with structured rounds and checkpoint/resume, so you don't hand-roll the `asyncio.gather` orchestration.
- [Plan](https://core.lazybridge.com/guides/full/plan/index.md) — declared alternative when the structure is known up front.
- [Parallel](https://core.lazybridge.com/guides/mid/parallel/index.md) — `Agent.parallel` is the application-layer fan-out used inside each round.
- [Agent builds a plan](https://core.lazybridge.com/recipes/agent-builds-plan/index.md) — typed-spec alternative when the topology is decided once rather than re-emitted every round.

# Live visualization

A 3-agent chain (researcher → analyst → writer) wrapped in a `Visualizer` context manager. The browser tab opens automatically; pulses travel along graph edges as agents call tools, the inspector lets you click into every event, the store viewer highlights writes as they happen.

The same UI replays a finished session via `Visualizer.replay(db="...").open()`.

## Source

```python
"""Pipeline Visualizer demo — a small chain of three agents that calls
two tools, all wrapped in a live :class:`Visualizer` so you can watch
the data flow in your browser.

Run::

    python examples/viz_demo.py

The browser tab opens automatically on a local URL. To replay the
recorded run later::

    python -c "from lazybridge.ext.viz import Visualizer; \\
               Visualizer.replay('examples/viz_demo.db').open()"
"""

from __future__ import annotations

import time

from lazybridge import Agent, LLMEngine, Session, Tool
from lazybridge.ext.viz import Visualizer

DB = "examples/viz_demo.db"


def search(query: str) -> str:
    """Stub web search tool — returns a fake result with a short delay."""
    time.sleep(0.4)
    return f"[search] results for '{query}': 5 hits, top is example.com"


def summarise(text: str) -> str:
    """Stub summariser tool that pretends to compress text."""
    time.sleep(0.3)
    return f"[summary] {text[:120]}..."


def main() -> None:
    sess = Session(db=DB, console=False)

    researcher = Agent(
        engine=LLMEngine("claude-haiku-4-5", system="Find facts. Cite sources."),
        tools=[Tool.wrap(search, name="search")],
        name="researcher",
        session=sess,
    )
    analyst = Agent(
        engine=LLMEngine("claude-haiku-4-5", system="Summarise findings."),
        tools=[Tool.wrap(summarise, name="summarise")],
        name="analyst",
        session=sess,
    )
    writer = Agent(
        engine=LLMEngine("claude-haiku-4-5", system="Write a short brief."),
        name="writer",
        session=sess,
    )
    pipeline = Agent.chain(researcher, analyst, writer)

    with Visualizer(sess) as viz:
        print(f"[viz] open -> {viz.url}")
        print("[viz] running pipeline…")
        envelope = pipeline("Brief me on the state of fusion energy in 2026.")
        print("[viz] pipeline done")
        print("--- result ---")
        print(envelope.text())
        print("---")
        print("[viz] press Ctrl+C to stop the server")
        try:
            while True:
                time.sleep(3600)
        except KeyboardInterrupt:
            pass


if __name__ == "__main__":
    main()
```

## Walkthrough

- **`with Visualizer(sess) as viz`** — context-manager pattern is what the recipe shows. The HTTP server starts on `__enter__`, shuts down on `__exit__`. The browser tab survives the boundary; close it yourself.
- **`session=sess` on every agent** — the live `GraphSchema` is populated from the agents that register with the session. An agent without `session=sess` is invisible in the topology even if it runs.
- **`viz.url`** is the bound URL (`127.0.0.1:<ephemeral>` by default). Print it before the pipeline runs so you can switch to the browser before tools start firing.

## Variations

- For a fixed port (e.g. to share with a teammate over a tunnel), pass `port=8765` to `Visualizer(sess, port=8765)`.
- For headless / CI runs that should NOT open a browser, pass `auto_open=False`.
- For a recorded run replayed at half speed: `Visualizer.replay(db="demo.db", speed=0.5).open()`.

## See also

- [Visualizer](https://core.lazybridge.com/guides/advanced/visualizer/index.md) — the deep reference for live + replay modes, control endpoints, and custom-UI hooks.
- [Visualization mock](https://core.lazybridge.com/recipes/visualization-mock/index.md) — the same UI driven by a synthetic event stream (no LLM calls).
- [Session](https://core.lazybridge.com/guides/mid/session/index.md) — the bus the Visualizer reads from; covers `db=`, `batched=`, exporters.

# Plan tool

`make_planner` is a pre-built factory that wires up an LLM-driven planner: it picks between a direct sub-agent call or a multi-step plan automatically, based on the input. Use it as the orchestrator when you want planning logic without writing it from scratch.

> **Canonical name**: `make_planner` is a backward-compat alias for `orchestrator_agent` (in `lazybridge.ext.planners`). New code can use either; the alias signals "this orchestrates sub-agents", the canonical name distinguishes from `lazybridge.Plan` (the *static* DAG engine) since both involve "planning".

## Source

```python
"""Demo: ``lazybridge.ext.planners.make_planner`` with three sub-agents.

The factory itself lives in :mod:`lazybridge.ext.planners.builder` — this file
just shows a minimal usage pattern. Run it with provider credentials in
the environment to see the planner pick between direct sub-agent calls
and ``execute_plan`` for multi-step work.
"""

from lazybridge import Agent, LLMEngine
from lazybridge.ext.planners import make_planner


def web_search(query: str) -> str:
    """Look up current facts (stub — wire to a real search API)."""
    return f"[stub web result for {query!r}]"


def add(a: float, b: float) -> float:
    """Add two numbers."""
    return a + b


def multiply(a: float, b: float) -> float:
    """Multiply two numbers."""
    return a * b


def main() -> None:
    research = Agent(
        engine=LLMEngine("claude-haiku-4-5", system="Look up facts via web_search."),
        tools=[web_search],
        name="research",
        description="Web lookups for current facts. No math.",
    )
    math = Agent(
        engine=LLMEngine("claude-haiku-4-5", system="Solve arithmetic with add/multiply."),
        tools=[add, multiply],
        name="math",
        description="Arithmetic (add, multiply). No facts.",
    )
    writer = Agent(
        engine=LLMEngine("claude-haiku-4-5", system="Synthesise prior results into prose."),
        name="writer",
        description="Turns prior results into a short paragraph.",
    )

    planner = make_planner([research, math, writer], verbose=True)

    queries = [
        "What does FAANG stand for?",  # trivial
        "What is 17 * 23 + 5?",  # one agent
        "Research quantum networking and write a one-paragraph brief.",  # multi-step plan
        "Look up the FAANG headcounts in parallel and write a summary.",  # parallel band + N-branch synth
    ]
    for q in queries:
        print(f"\n>>> {q}")
        print(planner(q).text())


if __name__ == "__main__":
    main()
```

## Walkthrough

- **`make_planner([research, math, writer])`** returns an `Agent`. The factory under `lazybridge.ext.planners.builder` constructs an inner `Plan` + dispatch logic; you supply the specialists.
- **Specialist `description=`** drives the planner's choice of which agent to call; precise descriptions matter as much as for the supervisor pattern.
- **Four query styles** exercise the planner's decision tree: trivial (no agent needed), single-agent (one specialist), multi-step plan (chained calls), parallel + synth (fan-out then combine).

## Variations

- Use `make_blackboard_planner` (see [Blackboard planner](https://core.lazybridge.com/recipes/blackboard-planner/index.md)) for a flat to-do list shape instead of a DAG.
- For full control over the planning shape, build the `Plan` yourself — see [Agent builds a plan](https://core.lazybridge.com/recipes/agent-builds-plan/index.md) and [Dynamic re-planning](https://core.lazybridge.com/recipes/dynamic-replanning/index.md).
- The factory accepts `verbose=True` to surface planner decisions on stdout; pair with a `Session` for structured event logs.

## See also

- [Plan](https://core.lazybridge.com/guides/full/plan/index.md) — the engine the factory wraps.
- [Blackboard planner](https://core.lazybridge.com/recipes/blackboard-planner/index.md) — sibling factory for flat task lists.
- [Agent builds a plan](https://core.lazybridge.com/recipes/agent-builds-plan/index.md) — the manual alternative when you want to write the planning logic yourself.

# React agent

A single agent that uses one tool. The agent's `LLMEngine` runs a ReAct loop natively: read the task, decide whether to call a tool, observe the result, decide again, return the final answer.

This is the LazyBridge equivalent of LangGraph's `create_react_agent` — no graph DSL, no `@tool` decorator. Tools are plain functions; the schema comes from type hints + docstring.

## Source

```python
"""LangGraph `create_react_agent` weather example, ported to LazyBridge.

Original (LangGraph):

    from langgraph.prebuilt import create_react_agent

    def check_weather(location: str) -> str:
        '''Return the weather forecast for the specified location.'''
        return f"It's always sunny in {location}"

    graph = create_react_agent(
        "anthropic:claude-3-7-sonnet-latest",
        tools=[check_weather],
        prompt="You are a helpful assistant",
    )
    inputs = {"messages": [{"role": "user", "content": "what is the weather in sf"}]}
    for chunk in graph.stream(inputs, stream_mode="updates"):
        print(chunk)

LazyBridge equivalent: a plain ``Agent`` is already a ReAct loop. Tools are
passed by reference — schemas are derived from type hints + docstring, no
``@tool`` decorator. ``verbose=True`` prints turn-by-turn updates to stdout,
which is the equivalent of LangGraph's ``graph.stream(stream_mode="updates")``.
"""

from lazybridge import Agent, LLMEngine


def check_weather(location: str) -> str:
    """Return the weather forecast for ``location``."""
    return f"It's always sunny in {location}"


def main() -> None:
    agent = Agent(
        engine=LLMEngine("gpt-5.4-mini", system="You are a helpful assistant"),
        tools=[check_weather],
        verbose=True,
    )
    result = agent("what is the weather in sf")
    print("\nFinal answer:", result.text())


if __name__ == "__main__":
    main()
```

## Walkthrough

- **`LLMEngine("claude-opus-4-7", system=...)`** is the engine. The `system` prompt sets the persona; everything else (tool dispatch, observation, retry on bad tool call) happens inside the engine's loop.
- **`tools=[check_weather]`** — the function is passed by reference; LazyBridge auto-wraps it as a `Tool` and infers the JSON schema from the type hints and docstring. No `@tool` decorator.
- **`verbose=True`** prints turn-by-turn updates to stdout — the equivalent of LangGraph's `graph.stream(stream_mode="updates")`.
- **`agent("what is the weather in sf")`** runs the loop synchronously and returns an `Envelope`. `.text()` extracts the final answer.

## Variations

- Add more tools by extending `tools=[...]` — the engine emits parallel tool calls automatically when the model asks for several in one turn.
- Swap the model for a different provider (`gpt-4o`, `gemini-3-flash-preview`) with no other changes; LazyBridge infers the provider from the model string.
- For typed structured output, pass `output=PydanticModel` — the payload becomes a model instance and the framework re-prompts on validation errors.

## See also

- [Agent](https://core.lazybridge.com/guides/basic/agent/index.md) — the constructor surface this recipe uses.
- [Tool](https://core.lazybridge.com/guides/basic/tool/index.md) — how plain functions become tools, and when to construct `Tool(...)` explicitly.
- [Researcher → reporter](https://core.lazybridge.com/recipes/researcher-reporter/index.md) — the next rung: two agents in sequence.

# Researcher → reporter

Two agents in sequence: a researcher gathers facts, a reporter turns them into a markdown brief. The handoff is text — the researcher's output `Envelope.text()` becomes the reporter's task. The LazyBridge equivalent of CrewAI's `Process.sequential` crew, without `@CrewBase`, decorators, or YAML.

## Source

````python
"""CrewAI canonical two-agent crew (researcher + reporting_analyst) — ported.

Original (CrewAI README):

    # agents.yaml
    researcher:
      role: "{topic} Senior Data Researcher"
      goal: "Uncover cutting-edge developments in {topic}"
      backstory: "You're a seasoned researcher..."
    reporting_analyst:
      role: "{topic} Reporting Analyst"
      goal: "Create detailed reports based on {topic} data analysis..."
      backstory: "You're a meticulous analyst..."

    # tasks.yaml
    research_task:
      description: "Conduct a thorough research about {topic}..."
      expected_output: "A list with 10 bullet points..."
      agent: researcher
    reporting_task:
      description: "Review the context you got and expand each topic..."
      expected_output: "A fully fledge reports... Formatted as markdown."
      agent: reporting_analyst
      output_file: report.md

    # crew.py
    @CrewBase
    class LatestAiDevelopmentCrew:
        @agent
        def researcher(self) -> Agent:
            return Agent(config=self.agents_config["researcher"],
                         verbose=True, tools=[SerperDevTool()])
        @agent
        def reporting_analyst(self) -> Agent:
            return Agent(config=self.agents_config["reporting_analyst"],
                         verbose=True)
        @task
        def research_task(self) -> Task: ...
        @task
        def reporting_task(self) -> Task: ...
        @crew
        def crew(self) -> Crew:
            return Crew(agents=self.agents, tasks=self.tasks,
                        process=Process.sequential, verbose=True)

    # main.py
    LatestAiDevelopmentCrew().crew().kickoff(inputs={"topic": "AI Agents"})

LazyBridge equivalent: ``Agent.chain(researcher, reporter)`` runs them
sequentially and feeds the researcher's output as the reporter's task —
exactly what CrewAI's ``Process.sequential`` does, minus the YAML, the
decorators, and the @CrewBase metaclass.
"""

from __future__ import annotations

from pathlib import Path

from lazybridge import Agent, LLMEngine

OUTPUT_FILE = Path("report.md")


def serper_search(query: str) -> str:
    """Stub web search; swap for Serper / Tavily in production."""
    return f"[stub results for {query!r}]"


def build_researcher(topic: str) -> Agent:
    system = (
        f"# Role\n{topic} Senior Data Researcher\n\n"
        f"# Goal\nUncover cutting-edge developments in {topic}\n\n"
        "# Backstory\nYou're a seasoned researcher with a knack for uncovering "
        f"the latest developments in {topic}. Known for your ability to find "
        "the most relevant information and present it in a clear and concise "
        "manner.\n\n"
        "# Task contract\n"
        f"Conduct a thorough research about {topic}. Make sure you find any "
        "interesting and relevant information given the current year is 2026.\n"
        "Expected output: a list with 10 bullet points of the most relevant "
        f"information about {topic}."
    )
    return Agent(
        engine=LLMEngine("claude-opus-4-7", system=system),
        tools=[serper_search],
        name="researcher",
        verbose=True,
    )


def build_reporting_analyst(topic: str) -> Agent:
    system = (
        f"# Role\n{topic} Reporting Analyst\n\n"
        f"# Goal\nCreate detailed reports based on {topic} data analysis "
        "and research findings.\n\n"
        "# Backstory\nYou're a meticulous analyst with a keen eye for detail. "
        "You're known for your ability to turn complex data into clear and "
        "concise reports, making it easy for others to understand and act on "
        "the information you provide.\n\n"
        "# Task contract\n"
        "Review the context you got and expand each topic into a full section "
        "for a report. Make sure the report is detailed and contains any and "
        "all relevant information.\n"
        "Expected output: a full report with the main topics, each with a full "
        "section of information. Formatted as markdown without ``` fences."
    )
    return Agent(
        engine=LLMEngine("claude-opus-4-7", system=system),
        name="reporting_analyst",
        verbose=True,
    )


def kickoff(topic: str = "AI Agents") -> str:
    crew = Agent.chain(build_researcher(topic), build_reporting_analyst(topic))
    report = crew(f"Topic: {topic}").text()

    OUTPUT_FILE.write_text(report, encoding="utf-8")
    print(f"Report written to {OUTPUT_FILE}")
    return report


if __name__ == "__main__":
    kickoff()
````

## Walkthrough

- **`Agent.chain(researcher, reporter)`** is sugar for the canonical `Agent(engine=Plan(Step(target=researcher, name=researcher.name), Step(target=reporter, name=reporter.name)), name="chain")` form. Use the sugar for purely linear handoffs; reach for the explicit `Plan` form when you need typed handoffs, routing, or checkpoints.
- **`crew(f"Topic: {topic}")`** is the canonical sync call — returns the final agent's `Envelope`. `.text()` extracts the markdown report.
- **Each builder function (`build_researcher` / `build_reporting_analyst`)** is a plain Python factory; the role / goal / backstory go in the `system` prompt instead of an external YAML file.

## Variations

- Replace the chain with a `Plan` for typed Pydantic handoff: `Step(researcher, output=ResearchPayload)` then `Step(writer, context=from_step("researcher"))` — the writer sees the typed payload.
- Add a `verify=judge` to the reporter to gate the final output on policy (length, format, citation presence).
- For a third agent in the pipeline — fact-checker, copy-editor — just append to `Agent.chain(...)` or to the underlying `Plan`.

## See also

- [Chain](https://core.lazybridge.com/guides/mid/chain/index.md) — sugar for sequential pipelines.
- [Plan](https://core.lazybridge.com/guides/full/plan/index.md) — the canonical orchestration primitive, for typed handoffs and conditional routing.
- [Researcher (single agent)](https://core.lazybridge.com/recipes/researcher-single/index.md) — the previous rung.
- [Supervisor pattern](https://core.lazybridge.com/recipes/supervisor-pattern/index.md) — the next rung: hierarchical dispatch, where a supervisor decides which agent to call.

# Researcher (single agent)

A single research agent with one stub web-search tool — the LazyBridge equivalent of CrewAI's "single-agent crew" quickstart. Demonstrates that you don't need a multi-agent crew, decorators, or YAML configs to get a researcher up and running.

## Source

```python
"""CrewAI quickstart (single researcher) — ported to LazyBridge.

Original (CrewAI quickstart):

    # config/agents.yaml
    researcher:
      role: "{topic} Senior Data Researcher"
      goal: "Uncover cutting-edge developments in {topic}"
      backstory: "You're a seasoned researcher..."

    # config/tasks.yaml
    research_task:
      description: "Conduct thorough research about {topic}..."
      expected_output: "A markdown report with clear sections..."
      agent: researcher
      output_file: output/report.md

    # content_crew.py
    @CrewBase
    class ResearchCrew:
        agents_config = "config/agents.yaml"
        tasks_config = "config/tasks.yaml"

        @agent
        def researcher(self) -> Agent:
            return Agent(config=self.agents_config["researcher"],
                         verbose=True, tools=[SerperDevTool()])

        @task
        def research_task(self) -> Task: ...

        @crew
        def crew(self) -> Crew:
            return Crew(agents=self.agents, tasks=self.tasks,
                        process=Process.sequential, verbose=True)

    # Plus a Flow class with @start / @listen decorators that calls
    # ResearchCrew().crew().kickoff(inputs={"topic": "AI Agents"}) and writes
    # report.md.

LazyBridge equivalent: no YAML, no decorators, no @CrewBase / @agent / @task
boilerplate, no separate Crew or Flow. The "agent" is an :class:`Agent`,
the "task" is the string passed to it, and the "tool" is just a Python
function. The output file is a one-liner at the end.
"""

from __future__ import annotations

from pathlib import Path

from lazybridge import Agent, LLMEngine

OUTPUT_FILE = Path("output/report.md")


def serper_search(query: str) -> str:
    """Search the web (stub — wire to Serper / Tavily / your search API).

    The CrewAI original uses ``SerperDevTool()``. Replace this body with the
    real call once you have an API key; the rest of the example is unchanged.
    """
    return (
        f"[stub results for {query!r}] "
        "Top sources: latest agent frameworks comparisons, multi-agent "
        "patterns, observability tooling, hosted runtimes."
    )


def build_researcher(topic: str) -> Agent:
    """Mirror of the YAML role/goal/backstory — just three Python strings."""
    role = f"{topic} Senior Data Researcher"
    goal = f"Uncover cutting-edge developments in {topic}"
    backstory = (
        "You're a seasoned researcher with a knack for uncovering the latest "
        f"developments in {topic}. You find the most relevant information and "
        "present it clearly."
    )
    system = f"# Role\n{role}\n\n# Goal\n{goal}\n\n# Backstory\n{backstory}"
    return Agent(
        engine=LLMEngine("claude-opus-4-7", system=system),
        tools=[serper_search],
        name="researcher",
        verbose=True,
    )


def kickoff(topic: str = "AI Agents") -> str:
    researcher = build_researcher(topic)
    task = (
        f"Conduct thorough research about {topic}. Use web search to find "
        "current, credible information. The current year is 2026.\n\n"
        "Expected output: a markdown report with clear sections — key trends, "
        "notable tools or companies, and implications. Aim for 800-1200 words. "
        "Do not wrap the document in fenced code blocks."
    )
    report = researcher(task).text()

    OUTPUT_FILE.parent.mkdir(parents=True, exist_ok=True)
    OUTPUT_FILE.write_text(report, encoding="utf-8")
    print(f"Report written to {OUTPUT_FILE}")
    return report


if __name__ == "__main__":
    kickoff()
```

## Walkthrough

- **No `@CrewBase`, no YAML.** The agent is a plain `Agent(engine= LLMEngine(...), tools=[...])` constructor call. The role / goal / backstory live in the `system` prompt directly.
- **`serper_search`** is a stub function — swap for Serper, Tavily, or any web-search API in production. The signature stays the same because the agent doesn't care about implementation details.
- **`output=Path(...)`-style file write** is plain Python around the `result.text()` call — there's no framework hook for "write the output to a file"; that's just code you put after the agent runs.

## Variations

- Add a structured `output=PydanticModel` to validate the report shape (title, bullets, sources) instead of free text.
- Wrap with `verify=judge_agent` for a judge-and-retry loop on the output (see [verify=](https://core.lazybridge.com/guides/mid/verify/index.md)).
- Move to a two-agent pipeline by chaining a reporter agent — see [Researcher → reporter](https://core.lazybridge.com/recipes/researcher-reporter/index.md).

## See also

- [Researcher → reporter](https://core.lazybridge.com/recipes/researcher-reporter/index.md) — the next rung: add a reporter agent in sequence via `Agent.chain`.
- [Tool](https://core.lazybridge.com/guides/basic/tool/index.md) — schema-from-signature semantics for the search function.
- [Agent](https://core.lazybridge.com/guides/basic/agent/index.md) — `Agent(engine=...)` canonical constructor.

# Supervisor pattern

A supervisor agent dispatches to specialist sub-agents (research, math) by including them in `tools=[...]`. There's no separate `create_supervisor` primitive — agents-as-tools is the same mechanism. The supervisor's LLM picks which specialist to call (and may call them sequentially or in parallel — that decision is the model's, not the framework's).

The LazyBridge equivalent of LangGraph's `create_supervisor` recipe.

## Source

```python
"""LangGraph multi-agent supervisor (research + math), ported to LazyBridge.

Original (langgraph-supervisor-py README):

    from langchain_openai import ChatOpenAI
    from langgraph_supervisor import create_supervisor
    from langgraph.prebuilt import create_react_agent

    model = ChatOpenAI(model="gpt-4o")

    def add(a: float, b: float) -> float: ...
    def multiply(a: float, b: float) -> float: ...
    def web_search(query: str) -> str: ...

    math_agent = create_react_agent(
        model=model, tools=[add, multiply], name="math_expert",
        prompt="You are a math expert. Always use one tool at a time.",
    )
    research_agent = create_react_agent(
        model=model, tools=[web_search], name="research_expert",
        prompt="You are a world class researcher with access to web search...",
    )
    workflow = create_supervisor(
        [research_agent, math_agent], model=model,
        prompt="You are a team supervisor managing a research expert and a math expert...",
    )
    app = workflow.compile()
    result = app.invoke({"messages": [{"role": "user",
        "content": "what's the combined headcount of the FAANG companies in 2024?"}]})

LazyBridge equivalent: there's no separate ``create_supervisor`` primitive —
specialist agents are simply tools of the supervisor agent. The supervisor's
LLM picks which specialist to call (and may call them sequentially or in
parallel — that decision is the model's, not the framework's).
"""

from lazybridge import Agent, LLMEngine


def add(a: float, b: float) -> float:
    """Add two numbers."""
    return a + b


def multiply(a: float, b: float) -> float:
    """Multiply two numbers."""
    return a * b


def web_search(query: str) -> str:
    """Search the web for ``query`` (stub matching the original example)."""
    return (
        "Here are the headcounts for each of the FAANG companies in 2024:\n"
        "1. **Facebook (Meta)**: 67,317 employees.\n"
        "2. **Apple**: 164,000 employees.\n"
        "3. **Amazon**: 1,551,000 employees.\n"
        "4. **Netflix**: 14,000 employees.\n"
        "5. **Google (Alphabet)**: 181,269 employees."
    )


def main() -> None:
    math_agent = Agent(
        engine=LLMEngine(
            "gpt-4o",
            system="You are a math expert. Always use one tool at a time.",
        ),
        tools=[add, multiply],
        name="math_expert",
        description="Solves arithmetic problems using add/multiply tools.",
    )

    research_agent = Agent(
        engine=LLMEngine(
            "gpt-4o",
            system=("You are a world class researcher with access to web search. Do not do any math."),
        ),
        tools=[web_search],
        name="research_expert",
        description="Looks up current facts via web_search; never does math.",
    )

    supervisor = Agent(
        engine=LLMEngine(
            "gpt-4o",
            system=(
                "You are a team supervisor managing a research expert and a "
                "math expert. For current events, use research_expert. "
                "For math problems, use math_expert."
            ),
        ),
        tools=[research_agent, math_agent],  # agents-as-tools, no special primitive
        name="supervisor",
        verbose=True,
    )

    result = supervisor("what's the combined headcount of the FAANG companies in 2024?")
    print("\nFinal answer:", result.text())


if __name__ == "__main__":
    main()
```

## Walkthrough

- **`tools=[research_agent, math_agent]`** on the supervisor — agents are passed directly. Each specialist's `name=` becomes the surface tool name the supervisor's LLM sees. No `as_tool()` call required.
- **Per-agent `description=`** is what the supervisor's model reads when deciding which specialist fits the request. Distinct, precise descriptions are the lever — vague ones cause routing mistakes.
- **Each specialist has its own `system` prompt** scoped to its job ("You are a math expert. Always use one tool at a time."). The supervisor's prompt focuses on dispatch, not domain.
- **The model decides parallelism.** When the supervisor's LLM emits two tool calls in one turn, the engine dispatches them concurrently via `asyncio.gather` — no config knob.

## Variations

- Add a `verify=judge` on a specialist via `agent.as_tool(name="research", verify=judge)` to gate every research invocation through a judge-and-retry loop.
- Promote the supervisor to a `Plan` if you want explicit ordering or a parallel band — `tools=[]` is for LLM-decided dispatch; `Plan(Step(...))` is for declared dispatch.
- Cap the supervisor's loop with `LLMEngine(max_turns=N)` if a bad model decision risks unbounded re-dispatch.

## See also

- [As tool](https://core.lazybridge.com/guides/mid/as-tool/index.md) — implicit `tools=[agent]` vs explicit `agent.as_tool(...)`, and when each is right.
- [Researcher → reporter](https://core.lazybridge.com/recipes/researcher-reporter/index.md) — the previous rung: fixed-order chain instead of LLM-directed dispatch.
- [Plan](https://core.lazybridge.com/guides/full/plan/index.md) — the alternative when you want declared ordering instead of LLM choice.

# Visualization mock

The same Visualizer UI as [Live visualization](https://core.lazybridge.com/recipes/live-visualization/index.md), but driven by a synthetic event stream — no LLM calls, no provider keys required. Useful for demos, screen recordings, or for exercising the UI without spending tokens.

Demonstrates a complex topology: planner → parallel researchers (each with a nested sub-researcher) → merger → writer, with store writes and explicit graph registrations along the way.

## Source

```python
"""Pipeline Visualizer — mock demo (no LLM calls).

Pipeline topology:

  planner (plan_task, allocate_work)
      ├──► researcher_a (web_search, news_api)  ──┐
      │                                            ├──► merger (combine, deduplicate) ──► writer (format_text)
      └──► researcher_b (database_query)          ──┘
                └── nested: sub_researcher (fact_check, verify_url)

Features demonstrated:
  • Planner agent that produces a structured plan before the pipeline runs
  • Parallel agents (researcher_a and researcher_b share planner context)
  • Nested agent (sub_researcher called as tool by researcher_b)
  • Multiple tools per agent
  • Live store writes (visible in Store tab + Node card)
  • Click START → see pipeline task; click END → see final output
  • Click any node → D&D character sheet with tools, stats, store entries
  • Drag to pin nodes, double-click to unpin
  • Play/step through events (Space / J / K)

Run::

    python examples/viz_mock_demo.py
"""

from __future__ import annotations

import threading
import time
import webbrowser

from lazybridge import Session
from lazybridge.ext.viz.exporter import EventHub, HubExporter
from lazybridge.ext.viz.server import VizServer
from lazybridge.graph.schema import EdgeType
from lazybridge.session import EventType

# ---------------------------------------------------------------------------
# Mock store — plain dict served via /api/store
# ---------------------------------------------------------------------------

_store: dict = {}


def _store_provider() -> dict:
    return dict(_store)


def _write(key: str, value, agent: str, sess: Session, run: str) -> None:
    _store[key] = {"value": value, "agent": agent, "ts": time.time()}
    sess.emit(EventType.STORE_WRITE, {"agent_name": agent, "key": key, "value": str(value)[:500]}, run_id=run)


# ---------------------------------------------------------------------------
# Agent definitions (metadata only — no real engines)
# ---------------------------------------------------------------------------

AGENTS = [
    {
        "name": "planner",
        "provider": "anthropic",
        "model": "claude-opus-4-7",
        "system": "Decompose the user request into a structured research plan. Assign tasks to agents.",
        "tools": ["plan_task", "allocate_work"],
    },
    {
        "name": "researcher_a",
        "provider": "anthropic",
        "model": "claude-haiku-4-5",
        "system": "Search the web for recent news. Cite sources precisely.",
        "tools": ["web_search", "news_api"],
    },
    {
        "name": "researcher_b",
        "provider": "anthropic",
        "model": "claude-haiku-4-5",
        "system": "Query internal databases and verify facts via sub-agents.",
        "tools": ["database_query"],
    },
    {
        "name": "sub_researcher",
        "provider": "anthropic",
        "model": "claude-haiku-4-5",
        "system": "Verify individual claims. Return confidence scores.",
        "tools": ["fact_check", "verify_url"],
    },
    {
        "name": "merger",
        "provider": "anthropic",
        "model": "claude-haiku-4-5",
        "system": "Combine research from multiple sources, removing duplicates.",
        "tools": ["combine_results", "deduplicate"],
    },
    {
        "name": "writer",
        "provider": "anthropic",
        "model": "claude-haiku-4-5",
        "system": "Write polished executive briefs from structured data.",
        "tools": ["format_text"],
    },
]

# ---------------------------------------------------------------------------
# Mock pipeline event sequence
# ---------------------------------------------------------------------------


def _emit_planner(sess: Session, run: str, query: str) -> str:
    """Planner: receives user query, produces a structured research plan."""
    p = time.sleep
    sess.emit(EventType.AGENT_START, {"agent_name": "planner", "task": query}, run_id=run)
    p(0.3)

    # Step 1 — think about approach
    sess.emit(
        EventType.MODEL_REQUEST,
        {
            "agent_name": "planner",
            "model": "claude-opus-4-7",
            "step": 1,
            "messages": [{"role": "user", "content": query}],
        },
        run_id=run,
    )
    p(0.9)
    sess.emit(
        EventType.MODEL_RESPONSE,
        {
            "agent_name": "planner",
            "model": "claude-opus-4-7",
            "step": 1,
            "content": "Breaking query into sub-tasks and preparing research plan...",
            "usage": {"input_tokens": 98, "output_tokens": 31},
        },
        run_id=run,
    )

    # Step 2 — call plan_task tool
    sess.emit(
        EventType.TOOL_CALL,
        {
            "agent_name": "planner",
            "name": "plan_task",
            "arguments": {"query": query, "depth": "comprehensive"},
        },
        run_id=run,
    )
    p(0.5)
    sess.emit(
        EventType.TOOL_RESULT,
        {
            "agent_name": "planner",
            "name": "plan_task",
            "result": (
                "Plan: (1) Web + news search for recent breakthroughs. "
                "(2) DB query for project counts and funding data. "
                "(3) Fact-check key claims. (4) Merge findings. (5) Write brief."
            ),
        },
        run_id=run,
    )
    p(0.3)

    # Step 3 — allocate work to agents
    sess.emit(
        EventType.MODEL_REQUEST,
        {
            "agent_name": "planner",
            "model": "claude-opus-4-7",
            "step": 2,
            "messages": [{"role": "user", "content": query}],
        },
        run_id=run,
    )
    p(0.7)
    sess.emit(
        EventType.MODEL_RESPONSE,
        {
            "agent_name": "planner",
            "model": "claude-opus-4-7",
            "step": 2,
            "content": "Allocating sub-tasks to researcher_a and researcher_b...",
            "usage": {"input_tokens": 175, "output_tokens": 44},
        },
        run_id=run,
    )
    sess.emit(
        EventType.TOOL_CALL,
        {
            "agent_name": "planner",
            "name": "allocate_work",
            "arguments": {
                "researcher_a": "Web search + news headlines (last 30 days)",
                "researcher_b": "Internal DB query + fact verification via sub-agent",
            },
        },
        run_id=run,
    )
    p(0.35)
    sess.emit(
        EventType.TOOL_RESULT,
        {
            "agent_name": "planner",
            "name": "allocate_work",
            "result": "Work allocated. researcher_a and researcher_b notified.",
        },
        run_id=run,
    )
    p(0.4)

    # Final plan output
    sess.emit(
        EventType.MODEL_REQUEST,
        {
            "agent_name": "planner",
            "model": "claude-opus-4-7",
            "step": 3,
        },
        run_id=run,
    )
    p(0.65)
    plan_out = (
        "PLAN: 5-step research pipeline on fusion energy 2026. "
        "researcher_a: web + news. researcher_b: DB + verification. "
        "merger: consolidate. writer: executive brief."
    )
    sess.emit(
        EventType.MODEL_RESPONSE,
        {
            "agent_name": "planner",
            "model": "claude-opus-4-7",
            "step": 3,
            "content": plan_out,
            "usage": {"input_tokens": 220, "output_tokens": 52},
        },
        run_id=run,
    )
    _write("research_plan", plan_out, "planner", sess, run)
    sess.emit(EventType.AGENT_FINISH, {"agent_name": "planner", "result": plan_out}, run_id=run)
    return plan_out


def _emit_parallel_research(sess: Session, run: str, plan: str) -> None:
    """Parallel researcher_a and researcher_b (interleaved events)."""
    task = f"Web search + news headlines (last 30 days). Context: {plan[:120]}"

    def p(s):
        time.sleep(s)

    # Both agents start roughly simultaneously, sharing the planner's context
    sess.emit(EventType.AGENT_START, {"agent_name": "researcher_a", "task": task}, run_id=run)
    p(0.15)
    sess.emit(
        EventType.AGENT_START,
        {
            "agent_name": "researcher_b",
            "task": f"Internal DB query + fact verification via sub-agent. Context: {plan[:120]}",
        },
        run_id=run,
    )
    p(0.3)

    # Both send MODEL_REQUEST
    sess.emit(
        EventType.MODEL_REQUEST,
        {
            "agent_name": "researcher_a",
            "model": "claude-haiku-4-5",
            "step": 1,
            "messages": [{"role": "user", "content": task}],
        },
        run_id=run,
    )
    p(0.1)
    sess.emit(
        EventType.MODEL_REQUEST,
        {
            "agent_name": "researcher_b",
            "model": "claude-haiku-4-5",
            "step": 1,
            "messages": [{"role": "user", "content": task}],
        },
        run_id=run,
    )
    p(0.6)

    # researcher_a responds first → web_search
    sess.emit(
        EventType.MODEL_RESPONSE,
        {
            "agent_name": "researcher_a",
            "model": "claude-haiku-4-5",
            "step": 1,
            "content": "Searching news for fusion energy...",
            "usage": {"input_tokens": 115, "output_tokens": 22},
        },
        run_id=run,
    )
    p(0.1)
    sess.emit(
        EventType.TOOL_CALL,
        {
            "agent_name": "researcher_a",
            "name": "web_search",
            "arguments": {"query": "ITER fusion energy Q>1 2026"},
        },
        run_id=run,
    )
    p(0.4)

    # researcher_b responds → database_query (still in parallel)
    sess.emit(
        EventType.MODEL_RESPONSE,
        {
            "agent_name": "researcher_b",
            "model": "claude-haiku-4-5",
            "step": 1,
            "content": "Querying internal DB for fusion R&D entries...",
            "usage": {"input_tokens": 118, "output_tokens": 24},
        },
        run_id=run,
    )
    p(0.1)
    sess.emit(
        EventType.TOOL_CALL,
        {
            "agent_name": "researcher_b",
            "name": "database_query",
            "arguments": {"table": "fusion_projects", "filter": "year=2026"},
        },
        run_id=run,
    )
    p(0.5)

    # researcher_a tool_result → news_api
    sess.emit(
        EventType.TOOL_RESULT,
        {
            "agent_name": "researcher_a",
            "name": "web_search",
            "result": "ITER Q=1.2 confirmed Jan 2026. Helion Series D closed at $2.8B.",
        },
        run_id=run,
    )
    p(0.1)
    sess.emit(
        EventType.MODEL_REQUEST,
        {
            "agent_name": "researcher_a",
            "model": "claude-haiku-4-5",
            "step": 2,
            "messages": [{"role": "user", "content": task}],
        },
        run_id=run,
    )
    p(0.5)
    sess.emit(
        EventType.MODEL_RESPONSE,
        {
            "agent_name": "researcher_a",
            "model": "claude-haiku-4-5",
            "step": 2,
            "content": "Getting latest headlines from news_api...",
            "usage": {"input_tokens": 220, "output_tokens": 18},
        },
        run_id=run,
    )
    sess.emit(
        EventType.TOOL_CALL,
        {
            "agent_name": "researcher_a",
            "name": "news_api",
            "arguments": {"topic": "fusion energy", "days": 30},
        },
        run_id=run,
    )

    # researcher_b tool_result → calls sub_researcher (nested agent)
    p(0.3)
    sess.emit(
        EventType.TOOL_RESULT,
        {
            "agent_name": "researcher_b",
            "name": "database_query",
            "result": "47 active fusion projects, 6 reached ignition threshold in 2026.",
        },
        run_id=run,
    )
    p(0.2)
    sess.emit(
        EventType.MODEL_REQUEST,
        {
            "agent_name": "researcher_b",
            "model": "claude-haiku-4-5",
            "step": 2,
            "messages": [{"role": "user", "content": task}],
        },
        run_id=run,
    )
    p(0.55)
    sess.emit(
        EventType.MODEL_RESPONSE,
        {
            "agent_name": "researcher_b",
            "model": "claude-haiku-4-5",
            "step": 2,
            "content": "Delegating fact verification to sub_researcher...",
            "usage": {"input_tokens": 195, "output_tokens": 20},
        },
        run_id=run,
    )
    # researcher_b calls sub_researcher as a tool
    sess.emit(
        EventType.TOOL_CALL,
        {
            "agent_name": "researcher_b",
            "name": "sub_researcher",
            "arguments": {"claim": "ITER achieved Q=1.2 in January 2026"},
        },
        run_id=run,
    )

    # --- sub_researcher runs (nested) ---
    p(0.2)
    sess.emit(
        EventType.AGENT_START,
        {
            "agent_name": "sub_researcher",
            "task": "Verify: ITER achieved Q=1.2 in January 2026",
        },
        run_id=run,
    )
    p(0.3)
    sess.emit(
        EventType.MODEL_REQUEST,
        {
            "agent_name": "sub_researcher",
            "model": "claude-haiku-4-5",
            "step": 1,
            "messages": [{"role": "user", "content": "Verify: ITER Q=1.2 Jan 2026"}],
        },
        run_id=run,
    )
    p(0.6)
    sess.emit(
        EventType.MODEL_RESPONSE,
        {
            "agent_name": "sub_researcher",
            "model": "claude-haiku-4-5",
            "step": 1,
            "content": "Running fact_check tool...",
            "usage": {"input_tokens": 88, "output_tokens": 14},
        },
        run_id=run,
    )
    sess.emit(
        EventType.TOOL_CALL,
        {
            "agent_name": "sub_researcher",
            "name": "fact_check",
            "arguments": {"claim": "ITER Q=1.2 January 2026", "sources": ["nature.com", "iter.org"]},
        },
        run_id=run,
    )
    p(0.5)
    sess.emit(
        EventType.TOOL_RESULT,
        {
            "agent_name": "sub_researcher",
            "name": "fact_check",
            "result": "VERIFIED (confidence 0.97). ITER press release Jan 14 2026 confirms Q=1.2.",
        },
        run_id=run,
    )
    p(0.15)
    sess.emit(
        EventType.TOOL_CALL,
        {
            "agent_name": "sub_researcher",
            "name": "verify_url",
            "arguments": {"url": "https://iter.org/news/2026-01-14-q12"},
        },
        run_id=run,
    )
    p(0.35)
    sess.emit(
        EventType.TOOL_RESULT,
        {
            "agent_name": "sub_researcher",
            "name": "verify_url",
            "result": "URL valid. Content matches claim. Confidence: 0.99.",
        },
        run_id=run,
    )
    p(0.2)
    sess.emit(
        EventType.MODEL_REQUEST,
        {
            "agent_name": "sub_researcher",
            "model": "claude-haiku-4-5",
            "step": 2,
            "messages": [{"role": "user", "content": "Verify: ITER Q=1.2 Jan 2026"}],
        },
        run_id=run,
    )
    p(0.55)
    sub_result = "VERIFIED: ITER Q=1.2 confirmed (confidence 0.98). Two independent sources."
    sess.emit(
        EventType.MODEL_RESPONSE,
        {
            "agent_name": "sub_researcher",
            "model": "claude-haiku-4-5",
            "step": 2,
            "content": sub_result,
            "usage": {"input_tokens": 152, "output_tokens": 28},
        },
        run_id=run,
    )
    _write("verification_result", sub_result, "sub_researcher", sess, run)
    sess.emit(
        EventType.AGENT_FINISH,
        {
            "agent_name": "sub_researcher",
            "result": sub_result,
        },
        run_id=run,
    )

    # researcher_b gets sub_researcher result
    p(0.2)
    sess.emit(
        EventType.TOOL_RESULT,
        {
            "agent_name": "researcher_b",
            "name": "sub_researcher",
            "result": sub_result,
        },
        run_id=run,
    )

    # researcher_a news_api result arrives
    p(0.1)
    sess.emit(
        EventType.TOOL_RESULT,
        {
            "agent_name": "researcher_a",
            "name": "news_api",
            "result": "30 articles found. Top: 'Fusion Era Begins' (Nature, Jan 15), 'CFS magnet record' (Science, Feb 3).",
        },
        run_id=run,
    )
    p(0.25)

    # Both finalize
    sess.emit(
        EventType.MODEL_REQUEST,
        {
            "agent_name": "researcher_a",
            "model": "claude-haiku-4-5",
            "step": 3,
        },
        run_id=run,
    )
    sess.emit(
        EventType.MODEL_REQUEST,
        {
            "agent_name": "researcher_b",
            "model": "claude-haiku-4-5",
            "step": 3,
        },
        run_id=run,
    )
    p(0.7)

    ra_out = (
        "ITER Q=1.2 confirmed Jan 2026. Helion raised $2.8B. CFS set magnet field record. 30+ news articles in 30 days."
    )
    rb_out = (
        "DB: 47 active projects, 6 reached ignition. "
        "Verification: ITER claim confirmed 0.98 confidence. "
        "Private funding +340% YoY."
    )

    sess.emit(
        EventType.MODEL_RESPONSE,
        {
            "agent_name": "researcher_a",
            "model": "claude-haiku-4-5",
            "step": 3,
            "content": ra_out,
            "usage": {"input_tokens": 288, "output_tokens": 52},
        },
        run_id=run,
    )
    _write("research_a_findings", ra_out, "researcher_a", sess, run)
    sess.emit(EventType.AGENT_FINISH, {"agent_name": "researcher_a", "result": ra_out}, run_id=run)
    p(0.1)
    sess.emit(
        EventType.MODEL_RESPONSE,
        {
            "agent_name": "researcher_b",
            "model": "claude-haiku-4-5",
            "step": 3,
            "content": rb_out,
            "usage": {"input_tokens": 305, "output_tokens": 58},
        },
        run_id=run,
    )
    _write("research_b_findings", rb_out, "researcher_b", sess, run)
    sess.emit(EventType.AGENT_FINISH, {"agent_name": "researcher_b", "result": rb_out}, run_id=run)

    return ra_out, rb_out


def _emit_merger(sess: Session, run: str, ra: str, rb: str) -> str:
    p = time.sleep
    combined = f"{ra} | {rb}"
    sess.emit(EventType.AGENT_START, {"agent_name": "merger", "task": combined}, run_id=run)
    sess.emit(EventType.STORE_READ, {"agent_name": "merger", "key": "research_a_findings"}, run_id=run)
    sess.emit(EventType.STORE_READ, {"agent_name": "merger", "key": "research_b_findings"}, run_id=run)
    p(0.4)
    sess.emit(
        EventType.MODEL_REQUEST,
        {
            "agent_name": "merger",
            "model": "claude-haiku-4-5",
            "step": 1,
            "messages": [{"role": "user", "content": combined}],
        },
        run_id=run,
    )
    p(0.65)
    sess.emit(
        EventType.MODEL_RESPONSE,
        {
            "agent_name": "merger",
            "model": "claude-haiku-4-5",
            "step": 1,
            "content": "Combining and deduplicating...",
            "usage": {"input_tokens": 245, "output_tokens": 19},
        },
        run_id=run,
    )
    sess.emit(
        EventType.TOOL_CALL,
        {
            "agent_name": "merger",
            "name": "combine_results",
            "arguments": {"sources": ["researcher_a", "researcher_b"]},
        },
        run_id=run,
    )
    p(0.45)
    sess.emit(
        EventType.TOOL_RESULT,
        {
            "agent_name": "merger",
            "name": "combine_results",
            "result": "Merged 9 unique facts from 2 sources.",
        },
        run_id=run,
    )
    sess.emit(
        EventType.TOOL_CALL,
        {
            "agent_name": "merger",
            "name": "deduplicate",
            "arguments": {"threshold": 0.85},
        },
        run_id=run,
    )
    p(0.35)
    sess.emit(
        EventType.TOOL_RESULT,
        {
            "agent_name": "merger",
            "name": "deduplicate",
            "result": "Removed 2 duplicate entries. 7 unique facts remain.",
        },
        run_id=run,
    )
    p(0.3)
    sess.emit(
        EventType.MODEL_REQUEST,
        {
            "agent_name": "merger",
            "model": "claude-haiku-4-5",
            "step": 2,
        },
        run_id=run,
    )
    p(0.6)
    merger_out = (
        "7 verified facts: ITER Q=1.2 (verified 0.98), 47 active projects, "
        "6 ignition events, Helion $2.8B, CFS magnet record, Nature+Science coverage, "
        "private funding +340% YoY."
    )
    sess.emit(
        EventType.MODEL_RESPONSE,
        {
            "agent_name": "merger",
            "model": "claude-haiku-4-5",
            "step": 2,
            "content": merger_out,
            "usage": {"input_tokens": 320, "output_tokens": 61},
        },
        run_id=run,
    )
    _write("merged_facts", merger_out, "merger", sess, run)
    sess.emit(EventType.AGENT_FINISH, {"agent_name": "merger", "result": merger_out}, run_id=run)
    return merger_out


def _emit_writer(sess: Session, run: str, facts: str) -> None:
    p = time.sleep
    sess.emit(EventType.AGENT_START, {"agent_name": "writer", "task": facts}, run_id=run)
    sess.emit(EventType.STORE_READ, {"agent_name": "writer", "key": "merged_facts"}, run_id=run)
    p(0.4)
    sess.emit(
        EventType.MODEL_REQUEST,
        {
            "agent_name": "writer",
            "model": "claude-haiku-4-5",
            "step": 1,
            "messages": [{"role": "user", "content": facts}],
        },
        run_id=run,
    )
    p(1.0)
    final = (
        "EXECUTIVE BRIEF — Fusion Energy 2026\n\n"
        "2026 marks the beginning of the fusion era. ITER achieved energy gain Q=1.2 "
        "in January, confirmed with 0.98 confidence from two independent sources. "
        "Among 47 tracked projects, 6 reached ignition threshold this year. "
        "Private investment surged +340% YoY: Helion raised $2.8B, Commonwealth Fusion "
        "set a new magnet field record. Pilot commercial plants are projected for 2028-2035."
    )
    sess.emit(
        EventType.MODEL_RESPONSE,
        {
            "agent_name": "writer",
            "model": "claude-haiku-4-5",
            "step": 1,
            "content": final,
            "usage": {"input_tokens": 198, "output_tokens": 95},
        },
        run_id=run,
    )
    sess.emit(
        EventType.TOOL_CALL,
        {
            "agent_name": "writer",
            "name": "format_text",
            "arguments": {"style": "executive_brief", "max_words": 120},
        },
        run_id=run,
    )
    p(0.4)
    sess.emit(
        EventType.TOOL_RESULT,
        {
            "agent_name": "writer",
            "name": "format_text",
            "result": "Formatted. Word count: 89. Readability score: A.",
        },
        run_id=run,
    )
    _write("final_brief", final, "writer", sess, run)
    sess.emit(EventType.AGENT_FINISH, {"agent_name": "writer", "result": final}, run_id=run)


def _run_pipeline(sess: Session) -> None:
    run = "mock-run-001"
    query = "Brief me on fusion energy breakthroughs in 2026."
    print("[mock] pipeline starting...")
    plan = _emit_planner(sess, run, query)
    ra, rb = _emit_parallel_research(sess, run, plan)
    merged = _emit_merger(sess, run, ra, rb)
    _emit_writer(sess, run, merged)
    print(f"[mock] pipeline complete — store has {len(_store)} entries")
    print("[viz]  step through events with Space / J / K")
    print("[viz]  click START or END node to see task and output")
    print("[viz]  click any node to see its character sheet")
    print("[viz]  press Ctrl+C to stop")


# ---------------------------------------------------------------------------
# Graph registration
# ---------------------------------------------------------------------------


def _build_graph(sess):
    g = sess.graph

    class _M:
        def __init__(self, meta):
            self.name = meta["name"]
            self._provider_name = meta["provider"]
            self._model_name = meta["model"]
            self.engine = None
            self.system = meta.get("system", "")
            self.description = self.system
            # Fake _tool_map so add_agent() pre-registers tool nodes
            self._tool_map = {t: object() for t in meta.get("tools", [])}

    for meta in AGENTS:
        g.add_agent(_M(meta))

    # Pipeline edges: planner fans out to parallel researchers, then merge → write
    g.add_edge("planner", "researcher_a", label="plan", kind=EdgeType.CONTEXT)
    g.add_edge("planner", "researcher_b", label="plan", kind=EdgeType.CONTEXT)
    g.add_edge("researcher_a", "merger", label="findings_a", kind=EdgeType.CONTEXT)
    g.add_edge("researcher_b", "merger", label="findings_b", kind=EdgeType.CONTEXT)
    g.add_edge("researcher_b", "sub_researcher", label="verify", kind=EdgeType.TOOL)
    g.add_edge("merger", "writer", label="merged_facts", kind=EdgeType.CONTEXT)


# ---------------------------------------------------------------------------
# Entry point
# ---------------------------------------------------------------------------


def main():
    sess = Session(db="examples/viz_mock.db", console=False)
    _build_graph(sess)

    hub = EventHub()
    exporter = HubExporter(hub)
    sess.add_exporter(exporter)

    server = VizServer(
        hub,
        graph_provider=lambda: sess.graph.to_dict(),
        store_provider=_store_provider,
        meta_provider=lambda: {"mode": "live", "session_id": sess.session_id},
        host="127.0.0.1",
        port=0,
    )
    server.start()
    url = server.url
    webbrowser.open(url, new=2)
    print(f"[viz] open -> {url}")

    # Let the browser connect before events start
    time.sleep(2.2)

    t = threading.Thread(target=_run_pipeline, args=(sess,), daemon=True)
    t.start()

    try:
        while True:
            time.sleep(3600)
    except KeyboardInterrupt:
        pass
    finally:
        server.stop()
        sess.close()


if __name__ == "__main__":
    main()
```

## Walkthrough

- **No real agents.** The script registers nodes / edges directly on `session.graph` and emits events via `session.emit(...)` to simulate a multi-agent pipeline. The Visualizer reads the same surfaces it does in live mode.
- **Nested topology.** Each parallel researcher has a sub-researcher in its own `tools=[...]` — the graph has two levels of `as_tool` edges. Useful for stress-testing the renderer.
- **Store writes.** The mock pushes values into the `Store` to exercise the side-panel store viewer that highlights writes as they happen.

## Variations

- Drive the mock from a recorded production trace (load events from a JSONL file, replay them through `session.emit`) — useful for triaging an issue without re-running the production pipeline.
- Slow the event timeline down (`time.sleep(0.5)` between emits) for screen-recording demos where you want viewers to follow along.

## See also

- [Live visualization](https://core.lazybridge.com/recipes/live-visualization/index.md) — the real-LLM sibling.
- [Visualizer](https://core.lazybridge.com/guides/advanced/visualizer/index.md) — the deep reference, including the `EventHub` / `HubExporter` surface for custom UIs.
- [GraphSchema](https://core.lazybridge.com/guides/full/graph-schema/index.md) — the topology representation the mock builds explicitly.
# Decisions

# Decisions

Short reference pages, one per recurring "which one do I use?" question. Each page has a decision tree, a quick-reference table, and pointers to the deep guides for the chosen option.

## Picking your starting point

- [Which tier?](https://core.lazybridge.com/decisions/pick-tier/index.md) — Basic, Mid, Full, or Advanced.

## Inside an Agent

- [Return type](https://core.lazybridge.com/decisions/return-type/index.md) — `.text()` vs `.payload` vs `.metadata`.
- [State layer](https://core.lazybridge.com/decisions/state-layer/index.md) — `Memory`, `Store`, or `sources=`.

## Composing Agents

- [Composition](https://core.lazybridge.com/decisions/composition/index.md) — `Agent.chain` vs `Agent.parallel` vs `Agent(tools=[...])` vs `Plan`.
- [Parallelism](https://core.lazybridge.com/decisions/parallelism/index.md) — automatic (LLM-decided) vs declared.

## Human / verifier placement

- [HumanEngine vs SupervisorEngine](https://core.lazybridge.com/decisions/human-engine-vs-supervisor/index.md)
- [`verify=` placement](https://core.lazybridge.com/decisions/verify-placement/index.md) — Agent / tool / Plan step.

## Production-grade Plan

- [Checkpoint & resume](https://core.lazybridge.com/decisions/checkpoint/index.md) — when is the storage overhead worth it?

## Extension surface

- [Do I need Advanced?](https://core.lazybridge.com/decisions/need-advanced/index.md) — smell test for the framework-author tier.

# Checkpoint & resume

> **When is the storage overhead worth it?**

Checkpointing writes one JSON snapshot per step. Worth it when re-running an early step costs more than the storage round-trip; not worth it for short, idempotent pipelines.

## Decision tree

```text
Short-running pipeline, idempotent if rerun from scratch?
    → No checkpoint. Plan(*steps)  — without store= / checkpoint_key=.

Long or expensive pipeline, partial-run survival matters?
    → Plan(*steps,
           store=Store(db="run.sqlite"),
           checkpoint_key="run-2026-04-30",
           resume=True)
      # Failed step retries on resume; "done" pipeline short-
      # circuits to the cached writes-bucket.

Pipeline waits on external events (webhook, human approval, retry queue)?
    → Same pattern; you split the run across processes and re-enter
      with resume=True after the event is delivered.

Dev loop iterating on a specific step?
    → Pin upstream steps via the same checkpoint_key so you don't
      re-pay for them on every iteration.

Need a user-visible history of every step's Envelope?
    → Checkpoint is minimal — only writes-bucket + next_step + status.
      For full history use Session(exporters=[JsonFileExporter("…")])
      and query session.events.query(...).
```

## Quick reference

| Situation                                         | Use checkpoint?                                        |
| ------------------------------------------------- | ------------------------------------------------------ |
| Short, idempotent pipeline                        | **No**                                                 |
| Expensive, crash-prone, long-running              | **Yes** — `store=` + `checkpoint_key=` + `resume=True` |
| Async / event-driven (re-enters across processes) | **Yes** — same pattern                                 |
| Dev iteration loop on a specific step             | **Yes** — pin upstream via checkpoint                  |
| Want a full run trace for audit                   | **No** — use `Session` + `JsonFileExporter`            |
| Concurrent fan-out runs sharing one Plan shape    | `on_concurrent="fork"` (no resume)                     |

## Notes

- **Checkpoint is minimal.** One JSON write per step: the `writes`-bucket payload, the next step name, the status (`claimed` / `running` / `failed` / `done`), the run UID, and (v2 only) the serialised step-result history. Not a full audit trail — for that pair `Plan` with `Session`.
- **Concurrent runs sharing a key are serialised.** The default `on_concurrent="fail"` raises `ConcurrentPlanRunError` on collision. Use `on_concurrent="fork"` to give each run its own keyspace (incompatible with `resume=True`).
- **Checkpoint writes happen *before* durable Store writes.** Eliminates double-writes on resume; the inverse trade-off is that a crash in the gap loses the durable Store value. The value still lives in the checkpoint's `kv` so the Plan continues correctly — but sidecar consumers reading the Store directly should reconcile against the checkpoint snapshot.
- **A failed parallel band points the checkpoint at the band's *first* step.** The whole band re-runs cleanly so all sibling `writes` are produced consistently. Branches with non- idempotent side effects need idempotency keys.

## See also

- [Checkpoint & resume](https://core.lazybridge.com/guides/full/checkpoint/index.md) — full reference: persisted shape, state transitions, sidecar consumer rules.
- [Store](https://core.lazybridge.com/guides/mid/store/index.md) — the durable layer behind checkpoints; SQLite WAL mode for thread-safe concurrent access.
- [Parallel plan steps](https://core.lazybridge.com/guides/full/parallel-plan-steps/index.md) — band atomicity rules that drive the "next_step points to the band's first step" behaviour on failure.

# Composition

> **Composing agents: `chain`, `Agent.parallel`, `Plan`, or `tools=[...]`?**

Pick by **who decides what runs when** — pre-scripted by you, LLM-driven, or declared with compile-time validation.

## Decision tree

```text
Linear pipeline, output of step N becomes task of N+1?
    → Agent.chain(a, b, c)
      # sugar for Agent(engine=Plan(Step(target=a, name=a.name),
      #                              Step(target=b, name=b.name), …))

Run the same task on N agents concurrently, get the joined output?
    → Agent.parallel(a, b, c)
      # asyncio.gather over a/b/c — returns ONE Envelope whose
      # text() is the labelled-text join.  Call .run_branches(task)
      # (async) if you need typed per-branch list[Envelope].

Let the LLM decide which sub-agent(s) to call (and when, including
in parallel)?
    → Agent(engine=LLMEngine("…"), tools=[a, b, c])
      # the engine fans out automatically when the model emits
      # multiple tool calls in one turn — no config needed

Declared workflow with typed hand-offs, routing, parallel bands,
or crash-resume?
    → Agent(engine=Plan(Step("…", output=…), Step("…", routes={…}), …),
            tools=[…])
```

## Quick reference

| Who decides the flow?                           | Use                       |
| ----------------------------------------------- | ------------------------- |
| You, linear and fixed                           | `Agent.chain(a, b, c)`    |
| You, deterministic fan-out → list               | `Agent.parallel(a, b, c)` |
| You, declared DAG with types / routing / resume | `Plan(Step(…), …)`        |
| The LLM (which to call, when, in parallel)      | `Agent(tools=[a, b, c])`  |

## Notes

- **`Agent.chain` is sugar for a linear `Plan`.** Use it for one-liner sequential handoffs; reach for the explicit `Plan` the moment you can see a router or a parallel band coming.
- **`Agent.parallel` returns ONE Envelope (since 0.7.9)** whose `payload` is the labelled-text join of every branch. For typed per-branch access call `parallel.run_branches(task)` (async) → `list[Envelope]`.
- **`tools=[a, b, c]` is the LLM-driven path.** When the model emits multiple tool calls in one turn, LazyBridge dispatches them concurrently via `asyncio.gather` — automatic parallelism, no flag.
- **All four shapes compose recursively.** A `Plan` step's `target` can be an agent built from `Agent.chain` or `Agent.parallel`; an `Agent.parallel` branch can itself contain a `Plan`. The unit at every level is the same `Agent`.

## See also: scripted composition

- [Chain](https://core.lazybridge.com/guides/mid/chain/index.md) — sequential composition reference.
- [Parallel](https://core.lazybridge.com/guides/mid/parallel/index.md) — `Agent.parallel` semantics; `ParallelAgent` return type.
- [As tool](https://core.lazybridge.com/guides/mid/as-tool/index.md) — when to pass an agent in `tools=[…]` directly vs `agent.as_tool("alias")`.
- [Plan](https://core.lazybridge.com/guides/full/plan/index.md) — declared orchestration with compile-time DAG validation.
- [Parallelism decision](https://core.lazybridge.com/decisions/parallelism/index.md) — automatic vs declared.

## See also: LLM-driven orchestrators

When the structure of the work is decided at runtime, two factory helpers in `lazybridge.ext.planners` ship a pre-built orchestrator agent:

- `orchestrator_agent(...)` (alias `make_planner`) — DAG builder; the LLM composes a `Plan` step by step via builder tools, with compile-time validation.
- `blackboard_orchestrator_agent(...)` (alias `make_blackboard_planner`) — flat to-do list; the LLM manages tasks via `set_plan` / `get_plan` / `mark_done` tools without DAG structure.

Both wrap an LLM with the planning toolkit. Use them when you want LLM-driven dispatch but don't want to author the planning prompt from scratch. See the [Plan tool](https://core.lazybridge.com/recipes/plan-tool/index.md) and [Blackboard planner](https://core.lazybridge.com/recipes/blackboard-planner/index.md) recipes.

# HumanEngine vs SupervisorEngine

> **Which human-in-the-loop engine?**

Two HIL engines under `lazybridge.ext.hil`. The split is by *what the human is allowed to do* during the gate, not by integration shape.

## Decision tree

```text
Simple "wait for input / approve / fill a form"?
    → HumanEngine
      Agent(engine=HumanEngine(timeout=120, default="approve"),
            output=ReviewForm)
      # one prompt, one answer (or per-field prompts when output= is
      # a Pydantic model)

Interactive REPL where the operator can call tools, retry agents
with feedback, and inspect the store?
    → SupervisorEngine
      Agent(engine=SupervisorEngine(tools=[search],
                                    agents=[researcher],
                                    store=store))
      # commands: continue [text] | retry <agent>: <feedback> |
      #           store <key>     | <tool>(<args>)

Automated verification at runtime (no human, LLM judge)?
    → verify=judge_agent
      # NOT human-in-the-loop — see verify-placement.md
```

## Quick reference

| Need                                                     | Use                            |
| -------------------------------------------------------- | ------------------------------ |
| Approve / reject / fill form                             | **`HumanEngine`**              |
| REPL with tool dispatch + agent retry + store inspection | **`SupervisorEngine`**         |
| LLM-as-judge with retry feedback                         | `verify=judge_agent` (not HIL) |

## Notes

- **`HumanEngine` is the lighter variant.** One prompt, one answer; with `output=PydanticModel` the terminal UI prompts field-by-field for a structured form.
- **`SupervisorEngine` is the REPL.** The operator calls tools registered on the engine, retries any agent passed in `agents=[…]` with feedback, and inspects `store=` keys via the `store <key>` command. Heavier but interactive.
- **Both are drop-in `Engine`s.** `Agent(engine=HumanEngine(...))` / `Agent(engine=SupervisorEngine(...))` plug into any pipeline that takes an `Engine`. Use them as a `Plan` step like any other agent.
- **Sugar exists.** `human_agent(...)` and `supervisor_agent(...)` in `lazybridge.ext.hil` skip the explicit `Agent(engine=…)` wrap. Use the sugar for one-liners; the canonical form when the engine choice should be visible at the call site.
- **Set `timeout=` for unattended pipelines.** Both engines hang forever on `timeout=None` (the default) when no human shows up. Pair `timeout=` with `default=` for graceful fallback.

## See also

- [HumanEngine](https://core.lazybridge.com/guides/mid/human-engine/index.md) — full reference, custom UI adapter via `ui=`.
- [SupervisorEngine](https://core.lazybridge.com/guides/full/supervisor/index.md) — REPL command surface, scripted-input tests, async UI via `ainput_fn`.
- [`verify=` placement](https://core.lazybridge.com/decisions/verify-placement/index.md) — automated LLM judge, distinct from HIL.
- [Canonical vs sugar](https://core.lazybridge.com/concepts/canonical-vs-sugar/index.md) — `human_agent(...)` / `supervisor_agent(...)` mapped to their `Agent(engine=...)` canonical equivalents.

# Do I need Advanced?

> **Smell test for the framework-author tier.**

Advanced is for framework authorship, not application development. If you're tweaking prompts, swapping models, or building pipelines, you're in Basic / Mid / Full.

## Decision tree

```text
Adding support for a new LLM vendor (Mistral, Cohere, Bedrock,
in-house model, …)?
    → Yes, Advanced.   BaseProvider + LLMEngine.register_provider_rule

Replacing the execution loop with a non-LLM strategy (rules,
deterministic dispatch, RL, recorded-script driver, …)?
    → Yes, Advanced.   Engine protocol

Serialising a Plan to disk / over the wire / between services?
    → Yes, Advanced.   Plan.to_dict / Plan.from_dict

Wiring deeper OpenTelemetry — custom TracerProvider, in-memory
exporter for tests, custom resource attributes?
    → Yes, Advanced.   OTelExporter(exporter=…)

Building a custom UI on top of the live event stream?
    → Yes, Advanced.   Visualizer / EventHub / HubExporter

Importing from `lazybridge.core.*` directly in app code?
    → STOP. Smell test failed — you probably want the public surface
      from `lazybridge import …`.

Just tweaking prompts / swapping models / building pipelines?
    → STOP. Basic / Mid / Full cover this.
```

## Quick reference

| Are you writing…                           | Tier                                      |
| ------------------------------------------ | ----------------------------------------- |
| A provider adapter                         | **Advanced** — `BaseProvider`             |
| A new execution strategy                   | **Advanced** — `Engine` protocol          |
| A plan serialiser / worker pool            | **Advanced** — `Plan.to_dict`             |
| A custom OTel pipeline                     | **Advanced** — `OTelExporter(exporter=…)` |
| A custom UI on session events              | **Advanced** — `EventHub` / `HubExporter` |
| App code importing `lazybridge.core.types` | **Probably not** — reconsider             |
| Pipelines / prompts / models               | **Basic / Mid / Full**                    |

## Notes

- **Advanced is opt-in.** None of the Basic / Mid / Full surface references Advanced primitives — you can ship a pipeline to production without ever opening the Advanced docs.
- **Smell test: imports from `lazybridge.core.*` in app code.** `from lazybridge import Agent, Tool, Envelope, Memory, Store, Session, Plan, Step, …` covers 99% of real use. If you find yourself reaching for `lazybridge.core.types`, step back — the public `Envelope` / `Agent` / `Tool` usually has what you need with a friendlier surface.
- **Stable contracts.** The Advanced surface (`BaseProvider`, `Engine` protocol, `Plan.to_dict`) is stable across minor versions; breaking changes follow a deprecation cycle plus a minor-version bump. Depend on it confidently when you need it.
- **Phase upgrades carefully.** Adding a custom provider or engine is a code-review-worthy change because it bypasses the framework's tested integrations. Pair with regression tests against your custom path.

## See also

- [BaseProvider](https://core.lazybridge.com/guides/advanced/base-provider/index.md) — the stable extension point for new LLM backends.
- [Engine protocol](https://core.lazybridge.com/guides/advanced/engine-protocol/index.md) — how to write a non-LLM execution layer.
- [Plan serialization](https://core.lazybridge.com/guides/advanced/plan-serialize/index.md) — `to_dict` / `from_dict` round-trip.
- [OpenTelemetry](https://core.lazybridge.com/guides/advanced/otel/index.md) — deep dive on semantic conventions and custom tracer setup.
- [Visualizer](https://core.lazybridge.com/guides/advanced/visualizer/index.md) — live and replay UI; custom UI hooks via `EventHub`.

# Parallelism

> **Automatic or declared?**

There's no serial / parallel mode switch. Automatic parallelism is always on when the model emits multiple tool calls; declared parallelism is when you fix the shape yourself.

## Decision tree

```text
Want the LLM to decide whether to run things in parallel?
    → Pass them in tools=[...] on a regular Agent. When the model
      emits multiple tool calls in one turn, the engine runs them
      concurrently via asyncio.gather. No config.

Want to declare that N agents run at once on the same task?
    → Agent.parallel(a, b, c)(task)            # → list[Envelope]

Want declared concurrent branches inside a typed workflow?
    → Plan(
          Step(a, parallel=True),
          Step(b, parallel=True),
          Step(join,
               task="Aggregate the branches.",
               context=[from_parallel("a"), from_parallel("b")]),
      )
```

## Quick reference

| Who decides the parallelism shape? | Use                               |
| ---------------------------------- | --------------------------------- |
| The LLM (emergent, per turn)       | `Agent(tools=[a, b, c])`          |
| You (deterministic fan-out → list) | `Agent.parallel(a, b, c)`         |
| You (typed workflow with bands)    | `Plan(Step(…, parallel=True), …)` |

## Notes

- **Automatic parallelism is the default.** When the engine sees multiple `tool_call` messages in one model response, it dispatches them concurrently. There is no setting that turns this off.
- **`Agent.parallel(...)` is scripted fan-out.** Every input agent runs unconditionally on the same task; the result is `list[Envelope]` in input order. Use this when you *know* you want N things to happen.
- **`Step(parallel=True)` bands are atomic.** If any branch errors, no `writes=` from the band are applied — a future `resume=True` re-runs the whole band cleanly. This is why cross-branch side effects (Store writes, external POSTs) need idempotency keys.
- **Only consecutive `parallel=True` steps form a band.** A non-parallel step in between starts a new band; keep parallel siblings contiguous in the declaration.

## See also

- [Parallel](https://core.lazybridge.com/guides/mid/parallel/index.md) — `Agent.parallel` reference (returns `ParallelAgent`, not `Agent`).
- [Parallel plan steps](https://core.lazybridge.com/guides/full/parallel-plan-steps/index.md) — band semantics, `from_parallel_all`, atomicity rules.
- [Composition decision](https://core.lazybridge.com/decisions/composition/index.md) — which composition shape to pick.

# Pick your tier

> **Where do I start: Basic, Mid, Full, or Advanced?**

Start as low as possible. Tiers are additive — moving up never requires changing the code you already wrote.

## Decision tree

```text
Single agent, one call, maybe with one tool?
    → Basic.

Need conversation memory, observability, guardrails, or simple
chain / parallel composition?
    → Mid.

Declared multi-step workflow with typed hand-offs, routing,
crash-resume, or per-step verifiers?
    → Full.

Writing framework code — new provider, new engine,
cross-process Plan serialisation?
    → Advanced.
```

## Quick reference

| You need                                                                                                          | Start at     | Key surface (import path)                                                                                                                                                                                                                                         |
| ----------------------------------------------------------------------------------------------------------------- | ------------ | ----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- |
| One agent, one tool, one call                                                                                     | **Basic**    | `Agent`, `LLMEngine`, `Envelope`, `Tool`, `NativeTool` — all `from lazybridge`                                                                                                                                                                                    |
| Memory, Store, Session, Guards, `chain` / `parallel` / `as_tool`, MCP, Evals, HumanEngine, `verify=`              | **Mid**      | `Memory`, `Store`, `Session`, `Agent.chain`, `Agent.parallel`, `verify=` from `lazybridge`; `MCP` from `lazytools.connectors.mcp`; `HumanEngine` from `lazybridge.ext.hil`                                                                                        |
| Plan + Step + sentinels, routing, parallel bands, checkpoint / resume, exporters, GraphSchema, `SupervisorEngine` | **Full**     | `Plan`, `Step`, `from_prev` / `from_step` / `from_start` / `from_agent`, `Step(routes=…)`, `Step(parallel=True)`, `GraphSchema`, `ConsoleExporter` / `JsonFileExporter` / `StructuredLogExporter` from `lazybridge`; `SupervisorEngine` from `lazybridge.ext.hil` |
| Custom engine, custom provider, Plan persistence, OTel deep, Visualizer                                           | **Advanced** | `BaseProvider` from `lazybridge.core.providers.base`; `Engine` Protocol from `lazybridge.engines.base`; `Plan.to_dict` / `Plan.from_dict` on the `Plan` class; `OTelExporter` from `lazybridge.ext.otel`; `Visualizer` from `lazybridge.ext.viz`                  |

## Notes

- **Tiers are additive.** Adding a `Memory` to a Basic agent doesn't require restructuring; you just pass it as a kwarg. Same for `Session`, `output=`, `verify=`, `tools=[...]`.
- **Don't reach up too early.** A Plan is overkill for a 3-step text pipeline (use `Agent.chain`); `verify=` is overkill for a policy a regex `Guard` already enforces; a custom engine is overkill if `LLMEngine(system="...", max_turns=N)` covers what you need.
- **Advanced is framework authorship**, not application development. If you're tweaking prompts / swapping models / building pipelines, you're in Basic / Mid / Full.

## See also

- [Mental model](https://core.lazybridge.com/concepts/mental-model/index.md) — Engine + Tools + State, the decomposition every tier shares.
- [Progressive complexity](https://core.lazybridge.com/concepts/progressive-complexity/index.md) — the twelve-rung ladder from one-liner to checkpointed pipeline.
- [Canonical vs sugar](https://core.lazybridge.com/concepts/canonical-vs-sugar/index.md) — the full table of factory shortcuts mapped back to canonical forms.

# Return type

> **What does my agent return — text, typed object, or metadata?**

Every `agent(task)` returns an `Envelope`. What you read off it depends on what you asked for.

## Decision tree

```text
Plain string response?
    → result.text()                # str, always, regardless of payload type

Pydantic model / structured result?
    → Agent(engine=LLMEngine("…"), output=MyModel)
      ...
      result.payload               # MyModel instance

Token count, cost, latency, run id?
    → result.metadata              # EnvelopeMetadata

Need to check for errors before reading payload?
    → if result.ok:
          read result.payload
      else:
          read result.error.message
```

## Quick reference

| You want                                          | Read                                           |
| ------------------------------------------------- | ---------------------------------------------- |
| Stringified response (works for any payload type) | `result.text()`                                |
| Validated typed payload                           | `result.payload` (with `output=PydanticModel`) |
| Token / cost / latency / model / provider         | `result.metadata`                              |
| Did this run succeed?                             | `result.ok`                                    |
| Why did it fail?                                  | `result.error.type` / `result.error.message`   |
| Original task                                     | `result.task`                                  |

## Notes

- **`.text()` is always safe.** It serialises Pydantic payloads as JSON, returns `""` for `None`, and returns the string verbatim for `str` payloads. Use it whenever you want a string regardless of the payload shape.
- **`output=Model` + `.text()` returns JSON, not human-readable text.** With structured output, read `.payload` directly to get the model instance. Calling `.text()` on a structured envelope is a common confusion — see [Envelope](https://core.lazybridge.com/guides/basic/envelope/index.md) pitfalls.
- **`metadata.nested_*` aren't authoritative for cross-agent rollup.** They reflect what flowed through *this* envelope's lineage, not the entire run. For an authoritative cross-agent cost, query `session.usage_summary()`.
- **Always check `.ok` before reading `.payload` in production.** An error envelope's payload is whatever was last produced (or `None`); it's `result.error` that tells you something went wrong.

## See also

- [Envelope](https://core.lazybridge.com/guides/basic/envelope/index.md) — full reference for the `Envelope` shape, `EnvelopeMetadata`, `ErrorInfo`.
- [Agent](https://core.lazybridge.com/guides/basic/agent/index.md) — `output=` kwarg behaviour and structured-output retry.

# State layer

> **State: `Memory`, `Store`, or `sources=`?**

Three composable mechanisms. Pick by *scope* and *lifetime*; they also stack freely on the same agent.

## Decision tree

```text
Conversation history for one agent across multiple calls?
    → Memory(strategy="auto")
      Agent(engine=LLMEngine("…"), memory=memory)

Shared key-value blackboard across multiple agents or runs?
    → Store(db="path.sqlite")    # or Store() for in-process
      Agent(engine=LLMEngine("…"), store=store)
      # writes via Step(writes="key") or agent auto-write

Static documents / live views injected into context at every call?
    → Agent(engine=LLMEngine("…"), sources=[memory, store, "policy text"])

Multiple patterns at once?
    → Yes — they compose freely.
      Agent(
          engine=LLMEngine("…"),
          memory=conversation_memory,
          store=team_store,
          sources=[policy_text, live_metric_provider],
      )
```

## Quick reference

| Use           | When                                                                                         |
| ------------- | -------------------------------------------------------------------------------------------- |
| `Memory`      | Per-agent conversation history; auto-compresses with `strategy="auto"`                       |
| `Store`       | Cross-agent / cross-run state; `db=` for SQLite persistence                                  |
| `sources=[…]` | Live view injected into the system prompt; accepts callables, `Memory`, `Store`, raw strings |

## Notes

- **`Memory` is per-agent by default.** Pass the same `Memory` instance to two agents to share their conversation; the second agent typically reads via `sources=[shared_memory]` so it doesn't accidentally write turns it didn't author.
- **`Store` is the durable counterpart to `Memory`.** Memory is "what should the model see in the next prompt"; Store is "what should survive a crash". They're complementary, not substitutable.
- **`sources=` is a live view.** Each item with `.text()` is re-evaluated at call time, so a `Store` passed in `sources=` reflects the most-recent values without any explicit refresh.
- \*\*Inside a `Plan`, prefer step writes (`Step(writes="key")`)
- `from_step("step_name")` sentinels\*\* for in-pipeline data flow. Keep `Memory` and `sources=` for cross-call state and live document injection.

## See also

- [Memory](https://core.lazybridge.com/guides/mid/memory/index.md) — strategies (auto / sliding / summary / none), summariser configuration.
- [Store](https://core.lazybridge.com/guides/mid/store/index.md) — in-memory vs SQLite, auto-write keys, `from_agent("…")` reads.
- [Sentinels](https://core.lazybridge.com/guides/full/sentinels/index.md) — `from_step` / `from_agent` / `from_memory` for plumbing state into Plan steps.

# `verify=` placement

> **Agent level, tool level, or Plan step level?**

`verify=` retries with judge feedback. Three placements, same judge contract — pick the narrowest scope that covers the policy.

## Decision tree

```text
Gating the final output of a single agent?
    → Agent(engine=LLMEngine("…"), verify=judge, max_verify=3)

One specific sub-agent (used as a tool by a parent) is the risky
one; rest of the run is fine?
    → risky_subagent.as_tool(verify=judge, max_verify=2)
      Agent(engine=LLMEngine("…"),
            tools=[risky_subagent.as_tool(verify=judge)])

One step of a declared Plan needs a judge; other steps don't?
    → Plan(
          Step(target=Agent(engine=LLMEngine("…"),
                            verify=judge,
                            name="summarise")),
          …,
      )

Want a judge on every tool call (not output)?
    → That isn't what verify= does — see Guards instead.
      Agent(engine=LLMEngine("…"),
            tools=[…],
            guard=ContentGuard(input_fn=…, output_fn=…))
```

## Quick reference

| Scope                                       | Placement                                                                                 |
| ------------------------------------------- | ----------------------------------------------------------------------------------------- |
| Whole agent's final output                  | `Agent(verify=judge)`                                                                     |
| One sub-agent's invocations only            | `subagent.as_tool(verify=judge)`                                                          |
| One Plan step only                          | `Step(target=Agent(verify=judge, name=…))`                                                |
| Every tool invocation (input/output filter) | **Not `verify=`** — use a [Guard](https://core.lazybridge.com/guides/mid/guards/index.md) |

## Notes

- **`verify=` is the soft sibling of Guards.** A `Guard` is a hard yes / no gate that ends the run on failure; `verify=` re-prompts with the judge's reason and retries up to `max_verify` times. Pick by what should happen on failure.
- **The judge contract.** Returns a string starting with `"approved"` (case-insensitive) to accept; anything else is rejection, and the verdict text is appended to the next attempt's task as feedback.
- **Callable judges that return `bool` don't produce feedback.** Retries reuse the same task. Return a string verdict if you want the feedback loop.
- **Nested verify is allowed but expensive.** Agent-level + tool-level + Plan-level on the same path stacks the loops; pick one per agent unless you're intentionally building defence in depth.
- **Keep judges cheap and specific.** Use a smaller / faster model and one criterion per judge — multi-criteria judges conflate failure modes and produce vague feedback.

## See also

- [`verify=`](https://core.lazybridge.com/guides/mid/verify/index.md) — full reference for the three placements, retry-budget tradeoffs, callable vs Agent judges.
- [Guards](https://core.lazybridge.com/guides/mid/guards/index.md) — the hard-gate alternative for input / output filtering.
- [Evals](https://core.lazybridge.com/guides/mid/evals/index.md) — the offline / CI sibling of `verify=` (batch grading instead of live retries).
# Reference

# API reference

Auto-generated from the live docstrings in `lazybridge.*`. Pair with the [Guides](https://core.lazybridge.com/guides/basic/agent/index.md) for narrative explanations and the [Recipes](https://core.lazybridge.com/recipes/index.md) for runnable examples.

## Primary

- [Agent + Envelope](https://core.lazybridge.com/reference/agent/index.md) — the universal wrapper and the typed result object every run produces.

## Tools

- [Tool family](https://core.lazybridge.com/reference/tools/index.md) — `Tool` + the `Tool.wrap()` classmethod factory, the `ToolProvider` protocol, `NativeTool` enum.

## State

- [State primitives](https://core.lazybridge.com/reference/state/index.md) — `Memory`, `Store`.
- [Session & observability](https://core.lazybridge.com/reference/session/index.md) — `Session`, `EventLog`, `EventType`, `GraphSchema`, exporters.
- [Guards](https://core.lazybridge.com/reference/guards/index.md) — `Guard`, `ContentGuard`, `GuardChain`, `LLMGuard`, `GuardError`.

## Engines & orchestration

- [Engines](https://core.lazybridge.com/reference/engines/index.md) — `LLMEngine`, `Plan`, `Step`, `PlanCompileError`, `ToolTimeoutError`, `StreamStallError`.
- [Multi-agent graphs](https://core.lazybridge.com/reference/multi-agent/index.md) — `AgentPool`, `conclude`, `ConcludeSignal` for dynamic, LLM-routed agent systems.
- [Sentinels & predicates](https://core.lazybridge.com/reference/sentinels/index.md) — `from_*` sentinels and the `when` DSL for routing.

## Extensions

- [Extension engines](https://core.lazybridge.com/reference/extensions/index.md) — `HumanEngine`, `SupervisorEngine`, MCP integration, `EvalSuite`, `Visualizer`, `OTelExporter`.
- [Custom providers](https://core.lazybridge.com/reference/providers/index.md) — `BaseProvider` + the `LLMEngine.register_provider_*` registry.

## Configuration & testing

- [Runtime configs & testing](https://core.lazybridge.com/reference/configs/index.md) — `CacheConfig` (kept) and `MockAgent`. The 0.7-era `AgentRuntimeConfig` / `ResilienceConfig` / `ObservabilityConfig` were deleted in 0.7.9; fleet config uses a flat-kwarg dict spread now.

# Agent + Envelope

The universal wrapper and the typed result object every run produces. For narrative usage see [Guides → Basic → Agent](https://core.lazybridge.com/guides/basic/agent/index.md) and [Guides → Basic → Envelope](https://core.lazybridge.com/guides/basic/envelope/index.md). If you're migrating from a 0.7-era surface (`Agent.from_*` factories, config dataclasses, `_ParallelAgent`), see the [0.7 → 0.7.9 migration guide](https://core.lazybridge.com/migrations/0.7-to-0.79/index.md).

### lazybridge.Agent

```python
Agent(engine: str | Any | None = None, tools: list[Tool | Callable | Agent] | None = None, output: type = str, memory: Any | None = None, store: Any | None = None, sources: list[Any] | None = None, guard: Any | None = None, verify: Agent | Callable[[str], Any] | None = None, max_verify: int = 3, name: str | None = None, description: str | None = None, session: Any | None = None, verbose: bool = False, model: str | None = None, native_tools: list[Any] | None = None, allow_dangerous_native_tools: bool = False, output_validator: Callable[[Any], Any] | None = None, max_output_retries: int = 2, timeout: float | None = None, max_retries: int = 3, retry_delay: float = 1.0, fallback: Agent | None = None, cache: bool | Any = False)
```

Universal agent — `Agent(engine, tools, state)`.

Every Agent has the same shape, regardless of what it does:

- `engine` — the brain: decides what happens (LLM, Plan, Human, …)
- `tools` — the capabilities: what the agent can invoke
- state — `memory`, `session`, `guard`, `verify`, `output`

**Canonical composition** — give each sub-agent an explicit `name=` and pass it directly in `tools=[...]`::

```text
from lazybridge import Agent, LLMEngine, Plan, Step, tool, from_prev, from_step

search = tool(search_web, name="search", description="Search the web.")

researcher = Agent(
    name="research",
    engine=LLMEngine("claude-haiku-4-5", system="You are a research expert."),
    tools=[search],
)
writer = Agent(
    name="write",
    engine=LLMEngine("gpt-5.4-mini", system="You are a concise writer."),
)

# Deterministic orchestrator — Plan engine
pipeline = Agent(
    name="pipeline",
    engine=Plan(
        Step("research"),
        Step("write", task=from_prev, context=from_step("research")),
    ),
    tools=[researcher, writer],   # agents passed directly
    session=sess,
)

# Dynamic orchestrator — LLM engine
orchestrator = Agent(
    name="orchestrator",
    engine=LLMEngine("claude-opus-4-8"),
    tools=[researcher, writer],
    session=sess,
)
```

The engine is the only thing that changes. Everything else — tools, memory, session, guard, output — is the same surface on every Agent.

**String shortcut** — `Agent("claude-opus-4-8")` is sugar for `Agent(engine=LLMEngine("claude-opus-4-8"))`. Use the explicit form when you need to configure the engine (`system=`, `max_turns=`, `thinking=`, etc.).

**The name chain** — `Agent(name=...)` is the authoritative key that connects every part of the system::

```text
Agent(name="research")      →  tool map key when passed in tools=[researcher]
Step("research")            →  looks up "research" in tool map ✓
from_step("research")       →  reads output of step "research" (in-Plan) ✓
from_agent("research")      →  reads stored output of "research" (cross-run) ✓
from_memory("research")     →  reads live memory of "research" ✓
```

**Advanced alias / backward compat** — `.as_tool("alias")` remains available when you need a different name than the agent's own::

```text
tools=[researcher.as_tool("deep_research")]
```

**Factory methods** that build real structure (not pure aliases) live on the class:

- `Agent.chain(a, b)` — sequential: builds a `Plan` of one `Step` per agent.
- `Agent.parallel(*agents)` — scripted fan-out: returns a `ParallelAgent` whose `__call__` yields one `Envelope` (labelled-text join across every branch, with transitive cost rollup). For typed per-branch `list[Envelope]` access call `parallel.run_branches(task)` (async).
- `Agent.from_provider(provider, tier="medium")` — resolves a tier alias (`cheap` / `medium` / `top` / …) to that provider's current model.

Extension engines live in :mod:`lazybridge.ext` to respect the core/ext import boundary::

```text
from lazybridge.ext.hil import HumanEngine, SupervisorEngine
Agent(engine=HumanEngine(timeout=60), tools=[approve])
Agent(engine=SupervisorEngine(tools=[...]))
```

Source code in `lazybridge/agent.py`

```python
def __init__(
    self,
    engine: str | Any | None = None,
    tools: list[Tool | Callable | Agent] | None = None,
    output: type = str,
    memory: Any | None = None,
    store: Any | None = None,
    sources: list[Any] | None = None,
    guard: Any | None = None,
    verify: Agent | Callable[[str], Any] | None = None,
    max_verify: int = 3,
    name: str | None = None,
    description: str | None = None,
    session: Any | None = None,
    verbose: bool = False,
    # Convenience: pin a specific model id via ``model=`` on the
    # auto-LLMEngine path.  Only consumed when ``engine=None`` or
    # ``engine=<model_string>``; passing ``model=`` alongside a
    # pre-built engine raises (see the model-vs-engine check below).
    # For tier-alias model selection use ``Agent.from_provider(
    # "anthropic", tier="top")`` — the bare-provider-name shortcut
    # ``Agent("anthropic", ...)`` was removed in 0.7.9.x because it
    # left the model id ambiguous at request time.
    model: str | None = None,
    # Provider-native server-side tools (WEB_SEARCH, CODE_EXECUTION, …).
    # Accepted directly on Agent as a shortcut for
    # ``Agent(engine=LLMEngine(..., native_tools=[...]))``.  Ignored when
    # ``engine=`` is a non-LLM engine.
    native_tools: list[Any] | None = None,
    # Required opt-in for capabilities with broad access (CODE_EXECUTION,
    # COMPUTER_USE).  Forwarded to LLMEngine when auto-created and used
    # to gate the pre-built engine path so callers can't silently bypass
    # the LLMEngine.__init__ check by passing engine= separately.
    allow_dangerous_native_tools: bool = False,
    # --- Resilience kwargs ---
    # Optional post-parse validator.  Runs on the structured ``payload``
    # after schema validation; may raise ValueError to force a
    # retry-with-feedback loop (up to ``max_output_retries``).
    output_validator: Callable[[Any], Any] | None = None,
    max_output_retries: int = 2,
    # Total deadline (seconds) for ``run()``.  ``None`` disables.
    timeout: float | None = None,
    # Provider retry/backoff — forwarded to LLMEngine when the engine
    # is auto-created from a model string.  Ignored when ``engine=``
    # is supplied explicitly (configure on ``LLMEngine`` directly).
    max_retries: int = 3,
    retry_delay: float = 1.0,
    # Fallback agent tried when the primary engine returns an error.
    # ``Agent("claude-opus-4-7", fallback=Agent("gpt-4o"))``.)
    fallback: Agent | None = None,
    # Prompt caching — when True, marks the static prefix (system
    # prompt + tools) as cacheable so providers that support it
    # (Anthropic today; OpenAI/DeepSeek auto-cache; Google uses a
    # different API) serve cache hits at ~10% of input token cost.
    # Pass a ``CacheConfig(ttl="1h")`` instance for the longer
    # Anthropic TTL.  Forwarded to LLMEngine when the engine is
    # auto-created.  Ignored when ``engine=`` is supplied explicitly
    # (configure ``LLMEngine(cache=...)`` directly).
    cache: bool | Any = False,
) -> None:
    # ``name`` is "explicit" when the caller supplied a real string
    # value (not None / blank).  Used downstream to require a name
    # when the agent is later passed in ``tools=[...]``.
    _name_explicit_flag: bool = name is not None and str(name).strip() != ""
    from lazybridge.engines.llm import LLMEngine

    # Phase-3 Block H, T6 — ``model=`` is only meaningful on the LLM-engine
    # construction path (engine is None or a model-string).  Passing both
    # ``model=`` and a non-string ``engine=`` was silently dropped pre-0.8;
    # 0.7.9 raises so the typo / misconfiguration is visible.
    if model is not None and engine is not None and not isinstance(engine, str):
        raise ValueError(
            f"Agent(model={model!r}, engine={type(engine).__name__}(...)): "
            f"the ``model=`` kwarg is only consumed when ``engine=None`` or "
            f"``engine=<model_string>`` (in which case Agent auto-builds an "
            f"``LLMEngine``).  When you pass a pre-built engine, configure "
            f"the model on that engine itself.\n"
            f"  Fix: drop ``model=`` (engine controls the model), or pass "
            f"the model string directly: ``Agent({model!r}, ...)``."
        )

    # Canonical: Agent(engine=LLMEngine(...)) or Agent(engine=Plan(...))
    # Sugar:     Agent("claude-opus-4-7") → engine is a model string → auto-builds LLMEngine
    #            Agent() → engine is None → defaults to "claude-opus-4-7"
    if engine is None or isinstance(engine, str):
        model_str = model or engine or "claude-opus-4-7"
        self.engine: Any = LLMEngine(
            model_str,
            native_tools=native_tools,
            allow_dangerous_native_tools=allow_dangerous_native_tools,
            max_retries=max_retries,
            retry_delay=retry_delay,
            cache=cache,
        )
    else:
        self.engine = engine
        # Phase-3 Block H, T7 — when the engine isn't an LLM (Plan,
        # SupervisorEngine, HumanEngine, custom), the auto-name fallback
        # to the engine's ``model`` attribute (or to the literal
        # ``"agent"`` placeholder) silently produces ambiguous names that
        # collide once the agent is used as a tool or referenced by a
        # ``Step``.  Require ``name=`` upfront so the failure is at the
        # construction point rather than at first composition.
        if not _name_explicit_flag and not hasattr(self.engine, "model"):
            engine_kind = type(self.engine).__name__
            raise ValueError(
                f"Agent(engine={engine_kind}(...)) requires an explicit ``name=``.\n"
                f"  Engines other than ``LLMEngine`` have no ``.model`` attribute to derive\n"
                f"  a default name from, so the agent would silently get the placeholder\n"
                f"  ``'agent'`` and collide the moment another agent is built or composed.\n"
                f"  Fix: pass ``name=`` (e.g. ``Agent(engine={engine_kind}(...), name='pipeline')``)."
            )

    # If the caller passed native_tools but also supplied a pre-built
    # engine, push the list onto the engine if it has the attribute.
    # This lets ``Agent(engine=LLMEngine("claude"), native_tools=[...])``
    # work the same as ``Agent("claude", native_tools=[...])``.)
    if native_tools and hasattr(self.engine, "native_tools"):
        from lazybridge.core.types import NativeTool

        resolved = [NativeTool(t) if isinstance(t, str) else t for t in native_tools]
        # Run the same dangerous-tools gate that LLMEngine.__init__ would
        # run — prevents bypassing it by passing engine= separately.
        _DANGEROUS = {NativeTool.CODE_EXECUTION, NativeTool.COMPUTER_USE}
        found = [t for t in resolved if t in _DANGEROUS]
        if found and not allow_dangerous_native_tools:
            names = ", ".join(t.value for t in found)
            raise ValueError(
                f"Native tools {names} have broad system access. Pass allow_dangerous_native_tools=True to opt in."
            )
        # Merge without dup — preserve order of existing + append new.
        existing = list(getattr(self.engine, "native_tools", []) or [])
        for t in resolved:
            if t not in existing:
                existing.append(t)
        self.engine.native_tools = existing

    self._tools_raw = list(tools or [])
    # Validate before building the tool map so errors surface early with
    # the agent's current name rather than a wrapped Tool name.
    for _raw in self._tools_raw:
        # Default True so duck-typed agents (MockAgent, custom subclasses)
        # that predate _name_explicit are not rejected.  Only real Agent
        # instances explicitly set this to False when no name= was given.
        if getattr(_raw, "_is_lazy_agent", False) and getattr(_raw, "_name_explicit", True) is False:
            _raw_name = getattr(_raw, "name", repr(_raw))
            raise ValueError(
                f"Agent used as a tool must have an explicit name=...\n"
                f"The agent currently has name={_raw_name!r} "
                f"(derived from the model or left as the default).\n\n"
                f"Set an explicit name:\n"
                f'    Agent(name="research", engine=LLMEngine(...))\n\n'
                f"Or use an alias:\n"
                f'    agent.as_tool("research")\n'
                f'    tool(agent, name="research")'
            )
    self._tool_map: dict[str, Tool] = build_tool_map(self._tools_raw)
    self.output = output
    self.output_validator = output_validator
    self.max_output_retries = max_output_retries
    self.timeout = timeout
    self.memory = memory
    self.store = store
    self.sources = list(sources or [])
    if max_verify < 1:
        raise ValueError(f"max_verify must be >= 1, got {max_verify!r}")
    if max_output_retries < 0:
        raise ValueError(f"max_output_retries must be >= 0, got {max_output_retries!r}")
    self.guard = guard
    self.verify = verify
    self.max_verify = max_verify
    self.fallback = fallback
    if self.fallback is not None:
        seen: set[int] = {id(self)}
        fb: Agent | None = self.fallback
        while fb is not None:
            if id(fb) in seen:
                raise ValueError("fallback= chain contains a cycle. Check your Agent(fallback=...) configuration.")
            seen.add(id(fb))
            fb = getattr(fb, "fallback", None)
    self.name: str = str(name or getattr(self.engine, "model", None) or "agent")
    self.description = description
    #: True when the caller supplied an explicit ``name=``.  False
    #: when the name was derived from the model string or left as
    #: the ``"agent"`` default.
    #: Used by ``build_tool_map`` and the ``tool()`` factory to require
    #: an explicit identity before an Agent is used as a sub-agent tool.
    self._name_explicit: bool = _name_explicit_flag

    # Private per-agent console exporter when verbose= is set without
    # an explicit Session. Attached when we bind to a session below.
    self._verbose = verbose

    # Bind to session — create an implicit private Session if verbose=
    # is requested without one, so events print to stdout out of the box.
    if session is None and verbose:
        from lazybridge.session import Session

        session = Session(console=True)
    self.session = session

    self.engine._agent_name = self.name

    # Register with session graph so it's visible in session.graph.to_json()
    _safe_register_agent(self.session, self)

    # Propagate session to nested Agents passed as tools (they become
    # part of the same observability surface — events from B called
    # via A flow into A's EventLog). Agents that already have a
    # session keep it. This is the fix for the "as_tool" observability
    # paradox: without it, calling B through A's tool loop recorded
    # nothing anywhere.
    if self.session is not None:
        for raw in self._tools_raw:
            if not getattr(raw, "_is_lazy_agent", False):
                continue
            agent_raw = cast("Agent", raw)
            child_session = getattr(agent_raw, "session", None)

            if child_session is None:
                # Propagate parent session down to child and register both
                # the agent node and the parent → child edge.
                agent_raw.session = self.session
                _safe_register_agent(self.session, agent_raw)
                _safe_register_tool_edge(self.session, self, agent_raw, label=agent_raw.name)
            elif child_session is self.session:
                # Child already shares the same session (canonical pattern:
                # all agents built with session= up front).  Register the
                # edge — it was missing because the old guard checked for
                # ``session is None`` only.
                _safe_register_agent(self.session, agent_raw)
                _safe_register_tool_edge(self.session, self, agent_raw, label=agent_raw.name)
            # else: child belongs to a different session — don't steal it.
        # ``fallback=`` and ``verify=`` Agents inherit the same
        # session + graph-registration the tools list gets, so any
        # events they produce (errors handled by the fallback, judge
        # verdicts from verify) flow into the outer EventLog.  Edge
        # labels distinguish provenance.
        for related, label in (
            (self.fallback, "fallback"),
            (self.verify, "verify"),
        ):
            if (
                related is not None
                and getattr(related, "_is_lazy_agent", False)
                and getattr(related, "session", None) is None
            ):
                # Duck-typed ``_is_lazy_agent`` covers Agent and MockAgent
                # (and any other agent-shaped object); cast for the typed
                # graph helpers since the static type of ``related`` now
                # admits a plain Callable via the widened ``verify=``.
                related_agent = cast("Agent", related)
                related_agent.session = self.session
                _safe_register_agent(self.session, related_agent)
                _safe_register_tool_edge(self.session, self, related_agent, label=label)

    # PlanCompiler runs at construction time
    if hasattr(self.engine, "_validate"):
        self.engine._validate(self._tool_map)
```

#### stream

```python
stream(task: str | Envelope, *, images: list[Any] | None = None, audio: Any | None = None) -> AsyncGenerator[str, None]
```

Stream LLM tokens across the full tool-calling loop.

**Guard enforcement.** `self.guard` is checked via `acheck_input` before the first token is emitted. A blocked input raises :class:`ValueError` immediately; a modified input (`GuardAction.modify`) replaces the task in the envelope sent to the provider. This is identical to the guard contract in :meth:`run` — streaming mode does not bypass the guard.

Honours `self.timeout` between chunks so a stalled provider can't hang the caller. Each `__anext__` is wrapped in `asyncio.wait_for` (per-chunk, not whole-stream) so short chunks don't consume the deadline budget.

Multimodal: pass `images=` / `audio=` to attach blocks to the streamed turn — same coercion semantics as :meth:`run`.

Source code in `lazybridge/agent.py`

```python
async def stream(
    self,
    task: str | Envelope,
    *,
    images: list[Any] | None = None,
    audio: Any | None = None,
) -> AsyncGenerator[str, None]:
    """Stream LLM tokens across the full tool-calling loop.

    **Guard enforcement.** ``self.guard`` is checked via
    ``acheck_input`` before the first token is emitted.  A blocked
    input raises :class:`ValueError` immediately; a modified input
    (``GuardAction.modify``) replaces the task in the envelope sent
    to the provider.  This is identical to the guard contract in
    :meth:`run` — streaming mode does not bypass the guard.

    Honours ``self.timeout`` between chunks so a stalled provider
    can't hang the caller.  Each ``__anext__`` is wrapped in
    ``asyncio.wait_for`` (per-chunk, not whole-stream) so short
    chunks don't consume the deadline budget.

    Multimodal: pass ``images=`` / ``audio=`` to attach blocks to
    the streamed turn — same coercion semantics as :meth:`run`.
    """
    env = self._to_envelope(task, images=images, audio=audio)
    env = self._inject_sources(env)
    timeout = getattr(self, "timeout", None)

    # Apply input guard before the first token is emitted.  A blocked
    # task must never reach the provider even in streaming mode.
    if self.guard:
        action = await self.guard.acheck_input(env.task or "")
        if not action.allowed:
            raise ValueError(action.message or "Blocked by guard")
        if action.modified_text is not None:
            env = env.model_copy(update={"task": action.modified_text, "payload": action.modified_text})

    gen = self.engine.stream(
        env,
        tools=list(self._tool_map.values()),
        output_type=self.output,
        memory=self.memory,
        session=self.session,
    ).__aiter__()
    chunks: list[str] = []
    _completed = False
    try:
        while True:
            try:
                if timeout is None:
                    chunk = await gen.__anext__()
                else:
                    chunk = await asyncio.wait_for(gen.__anext__(), timeout=timeout)
            except StopAsyncIteration:
                _completed = True
                return
            yield chunk
            chunks.append(chunk)
    finally:
        aclose = getattr(gen, "aclose", None)
        if aclose is not None:
            try:
                await aclose()
            except asyncio.CancelledError:
                # Cancellation is BaseException; let it propagate so
                # structured cancellation works through the boundary.
                raise
            except Exception as exc:
                # Surface non-cancellation errors as a UserWarning so
                # buggy provider cleanup paths don't disappear; we
                # still don't re-raise (the consumer has either
                # finished or already abandoned the stream).
                import warnings as _w

                _w.warn(
                    f"stream() aclose raised {type(exc).__name__}: {exc}.",
                    stacklevel=2,
                )
        if _completed:
            _store = getattr(self, "store", None)
            if _store is not None and self.name and chunks:
                from lazybridge.sentinels import _AGENT_OUTPUT_KEY_PREFIX

                _store.write(_AGENT_OUTPUT_KEY_PREFIX + self.name, "".join(chunks))
```

#### as_tool

```python
as_tool(name: str | None = None, description: str | None = None, *, verify: Agent | Callable[[str], Any] | None = None, max_verify: int = 3) -> Tool
```

Wrap this agent as a :class:`Tool` (advanced / compatibility API).

The canonical way to use a sub-agent is to give it an explicit `name=` and pass it directly in `tools=[...]`::

```text
# Canonical
researcher = Agent(name="research", engine=LLMEngine(...))
orchestrator = Agent(..., tools=[researcher])
```

`.as_tool()` remains available for **local aliases** and **backward compatibility**::

```text
# Advanced alias — use a different name than the agent's own
tools=[researcher.as_tool("deep_research")]

# Backward compat — existing code that already calls as_tool()
tools=[researcher.as_tool("research")]
```

The tool schema is `(task: str) -> Envelope`.

Verify (Option B) — wrap the call in a judge/retry loop so every invocation is vetted before returning::

```text
judge = Agent(engine=LLMEngine(
    "claude-opus-4-7",
    system="Reply 'approved' or 'rejected: <reason>'.",
))
synth = Agent(name="synth", engine=LLMEngine(...))
orchestrator = Agent(
    ...,
    tools=[synth.as_tool("synthesize", verify=judge, max_verify=2)],
)
```

`verify` can be either an :class:`Agent` (its `run` method is called with the output) or a plain callable taking the output text and returning a verdict string / bool. On rejection, the judge's feedback is injected into the next attempt's task.

Source code in `lazybridge/agent.py`

```python
def as_tool(
    self,
    name: str | None = None,
    description: str | None = None,
    *,
    verify: Agent | Callable[[str], Any] | None = None,
    max_verify: int = 3,
) -> Tool:
    """Wrap this agent as a :class:`Tool` (advanced / compatibility API).

    The canonical way to use a sub-agent is to give it an explicit
    ``name=`` and pass it directly in ``tools=[...]``::

        # Canonical
        researcher = Agent(name="research", engine=LLMEngine(...))
        orchestrator = Agent(..., tools=[researcher])

    ``.as_tool()`` remains available for **local aliases** and
    **backward compatibility**::

        # Advanced alias — use a different name than the agent's own
        tools=[researcher.as_tool("deep_research")]

        # Backward compat — existing code that already calls as_tool()
        tools=[researcher.as_tool("research")]

    The tool schema is ``(task: str) -> Envelope``.

    Verify (Option B) — wrap the call in a judge/retry loop so every
    invocation is vetted before returning::

        judge = Agent(engine=LLMEngine(
            "claude-opus-4-7",
            system="Reply 'approved' or 'rejected: <reason>'.",
        ))
        synth = Agent(name="synth", engine=LLMEngine(...))
        orchestrator = Agent(
            ...,
            tools=[synth.as_tool("synthesize", verify=judge, max_verify=2)],
        )

    ``verify`` can be either an :class:`Agent` (its ``run`` method is
    called with the output) or a plain callable taking the output text
    and returning a verdict string / bool. On rejection, the judge's
    feedback is injected into the next attempt's task.
    """
    if max_verify < 1:
        raise ValueError(f"max_verify must be >= 1, got {max_verify!r}")
    agent = self
    effective_name = name or self.name
    effective_desc = description or self.description or f"Run the {effective_name} agent."

    if verify is None:

        async def _run(task: str) -> Envelope:
            # ``_run_as_tool`` (not ``run``) so a ``conclude`` raised inside
            # the sub-agent propagates up to the top-level caller instead of
            # being absorbed here.  Duck-typed doubles without it fall back
            # to ``run``.
            runner = getattr(agent, "_run_as_tool", agent.run)
            result = await runner(task)
            # Always write under the alias so from_agent("alias") can find
            # the output regardless of agent.name.  _run_body also writes
            # under agent.name (for standalone callers); the alias write
            # here is the authoritative key for Plan sentinel resolution.
            _store = getattr(agent, "store", None)
            if _store is not None and result.ok and effective_name != getattr(agent, "name", None):
                from lazybridge.sentinels import _AGENT_OUTPUT_KEY_PREFIX

                _store.write(_AGENT_OUTPUT_KEY_PREFIX + effective_name, result.text())
            return result
    else:

        async def _run(task: str) -> Envelope:  # type: ignore[misc]
            from lazybridge._verify import verify_with_retry
            from lazybridge.envelope import Envelope as _Env

            env = _Env.from_task(str(task))
            result = await verify_with_retry(
                agent,
                env,
                verify,
                max_verify=max_verify,
            )
            _store = getattr(agent, "store", None)
            if _store is not None and result.ok and effective_name != getattr(agent, "name", None):
                from lazybridge.sentinels import _AGENT_OUTPUT_KEY_PREFIX

                _store.write(_AGENT_OUTPUT_KEY_PREFIX + effective_name, result.text())
            return result

    _run.__name__ = effective_name
    _run.__doc__ = effective_desc

    return Tool(
        _run,
        name=effective_name,
        description=effective_desc,
        mode="signature",
        returns_envelope=True,
        agent_memory=getattr(self, "memory", None),
        agent_store=getattr(self, "store", None),
    )
```

#### definition

```python
definition() -> Any
```

ToolDefinition for this agent — used when passed in tools=[] of another agent.

Source code in `lazybridge/agent.py`

```python
def definition(self) -> Any:
    """ToolDefinition for this agent — used when passed in tools=[] of another agent."""
    return self.as_tool().definition()
```

#### derive

```python
derive(*, tools: list[Any] | None = None, **overrides: Any) -> Agent
```

Return a NEW Agent: a clone of self plus extra tools and/or overrides.

Does not mutate self. The returned Agent runs full constructor validation, so the build-time guarantee (construction implies a valid graph) is preserved unchanged.

Use when an existing agent needs context-specific capabilities — a pool handle, a one-shot tool, a different name in a sub-context — without rebuilding it field by field.

```text
debater = base.derive(tools=[pool.as_tool(), conclude, vote_tool])
renamed = base.derive(name="alias")
```

`tools=` are APPENDED to the agent's existing tools. Any other keyword in `overrides` replaces the corresponding constructor argument.

Source code in `lazybridge/agent.py`

```python
def derive(self, *, tools: list[Any] | None = None, **overrides: Any) -> Agent:
    """Return a NEW Agent: a clone of self plus extra tools and/or overrides.

    Does not mutate self. The returned Agent runs full constructor
    validation, so the build-time guarantee (construction implies a valid
    graph) is preserved unchanged.

    Use when an existing agent needs context-specific capabilities — a pool
    handle, a one-shot tool, a different name in a sub-context — without
    rebuilding it field by field.

        debater = base.derive(tools=[pool.as_tool(), conclude, vote_tool])
        renamed = base.derive(name="alias")

    ``tools=`` are APPENDED to the agent's existing tools. Any other keyword
    in ``overrides`` replaces the corresponding constructor argument.
    """
    base_kwargs = dict(
        engine=self.engine,
        tools=list(self._tools_raw) + list(tools or []),
        output=self.output,
        memory=self.memory,
        store=self.store,
        sources=list(self.sources),
        guard=self.guard,
        verify=self.verify,
        max_verify=self.max_verify,
        name=self.name,
        description=self.description,
        session=self.session,
        verbose=self._verbose,
        timeout=self.timeout,
        output_validator=self.output_validator,
        max_output_retries=self.max_output_retries,
        fallback=self.fallback,
    )
    base_kwargs.update(overrides)
    derived = Agent(**base_kwargs)
    # __init__ stamps engine._agent_name on every construction; since the engine
    # object is shared, restore the original so base-agent runs continue to
    # log/emit under the correct identity.
    if hasattr(self.engine, "_agent_name"):
        self.engine._agent_name = self.name
    # Preserve _name_explicit unless name= was explicitly overridden — passing
    # name=self.name through __init__ would otherwise silently promote an
    # implicitly-named agent to explicitly-named, bypassing the guard.
    if "name" not in overrides:
        derived._name_explicit = self._name_explicit
    return derived
```

#### from_provider

```python
from_provider(provider: str, *, tier: Tier | str = 'medium', **kwargs: Any) -> Agent
```

Construct an Agent for `provider` using its tier alias for model selection.

Tiers (`super_cheap` / `cheap` / `medium` / `expensive` / `top`) resolve to each provider's current lineup, so preview and date-pinned model names stay in one place::

```text
Agent.from_provider("anthropic", tier="top")
Agent.from_provider("openai", tier="cheap", tools=[search])
```

Source code in `lazybridge/agent.py`

```python
@classmethod
def from_provider(
    cls,
    provider: str,
    *,
    tier: Tier | str = "medium",
    **kwargs: Any,
) -> Agent:
    """Construct an Agent for ``provider`` using its tier alias for model selection.

    Tiers (``super_cheap`` / ``cheap`` / ``medium`` / ``expensive`` /
    ``top``) resolve to each provider's current lineup, so preview and
    date-pinned model names stay in one place::

        Agent.from_provider("anthropic", tier="top")
        Agent.from_provider("openai", tier="cheap", tools=[search])
    """
    from lazybridge.engines.llm import LLMEngine

    # Pass both the tier (as the provider-facing model string, which
    # the BaseProvider resolves via its tier map) AND the explicit
    # provider name (so _infer_provider doesn't fall back to the
    # default when the tier alone isn't a recognised model).
    return cls(engine=LLMEngine(tier, provider=provider), **kwargs)
```

#### chain

```python
chain(*agents: Agent, **kwargs: Any) -> Agent
```

Run agents sequentially: output of each becomes input to the next.

Source code in `lazybridge/agent.py`

```python
@classmethod
def chain(cls, *agents: Agent, **kwargs: Any) -> Agent:
    """Run agents sequentially: output of each becomes input to the next."""
    from lazybridge.engines.plan import Plan, Step

    steps = [Step(target=a, name=a.name) for a in agents]
    plan = Plan(*steps)
    name = kwargs.pop("name", "chain")
    # Don't auto-wrap agents as tools — ``Plan._exec_step`` dispatches
    # Agent targets via ``target.run()`` directly, so wrapping them
    # would just waste schema-compilation on every chain call.
    # Caller-supplied tools= in kwargs still pass through unchanged.
    return cls(engine=plan, name=name, **kwargs)
```

#### parallel

```python
parallel(*agents: Agent, concurrency_limit: int | None = None, step_timeout: float | None = None, **kwargs: Any) -> ParallelAgent
```

Deterministic fan-out: run `agents` concurrently on the same task.

Returns a :class:`ParallelAgent` whose `__call__` produces a single :class:`Envelope` — labelled-text join of every branch's output, with transitive cost rollup. For typed access to per-branch envelopes call `ParallelAgent.run_branches(task)` (async).

Use this when you **know** you want N things to happen in parallel. If you want the LLM to decide whether to call agents in parallel (and which, and how), don't use this — pass them as `tools=[...]` on a regular `Agent` instead; the engine emits parallel tool calls automatically when the model requests them.

Source code in `lazybridge/agent.py`

```python
@classmethod
def parallel(
    cls,
    *agents: Agent,
    concurrency_limit: int | None = None,
    step_timeout: float | None = None,
    **kwargs: Any,
) -> ParallelAgent:
    """Deterministic fan-out: run ``agents`` concurrently on the same task.

    Returns a :class:`ParallelAgent` whose ``__call__`` produces a
    single :class:`Envelope` — labelled-text join of every branch's
    output, with transitive cost rollup.  For typed access to per-branch
    envelopes call ``ParallelAgent.run_branches(task)`` (async).

    Use this when you **know** you want N things to happen in
    parallel.  If you want the LLM to decide whether to call agents
    in parallel (and which, and how), don't use this — pass them as
    ``tools=[...]`` on a regular ``Agent`` instead; the engine emits
    parallel tool calls automatically when the model requests them.
    """
    return ParallelAgent(
        agents=list(agents),
        concurrency_limit=concurrency_limit,
        step_timeout=step_timeout,
        **kwargs,
    )
```

### lazybridge.ParallelAgent

```python
ParallelAgent(agents: list[Agent], *, concurrency_limit: int | None = None, step_timeout: float | None = None, name: str = 'parallel', description: str | None = None, session: Any | None = None)
```

Deterministic fan-out over N agents — the shape behind :meth:`Agent.parallel`.

Pre-scripted parallel runner. Every input agent receives the same task; the N branch results are folded into a single :class:`Envelope` via labelled-text join — same shape as :class:`Plan`'s `from_parallel_all` aggregator. Cost roll-up is transitive. The first non-`None` branch error propagates as the wrapper's `error` so downstream consumers can short-circuit.

Prefer :class:`Agent` with `tools=[...]` when you want the engine (LLM, Supervisor, Plan) to decide dynamically which tools to invoke and when — parallel execution is automatic on that path.

Per-branch typed access: call :meth:`run_branches` (async) when you need `list[Envelope]` rather than the joined wrapper.

Source code in `lazybridge/agent.py`

```python
def __init__(
    self,
    agents: list[Agent],
    *,
    concurrency_limit: int | None = None,
    step_timeout: float | None = None,
    name: str = "parallel",
    description: str | None = None,
    session: Any | None = None,
) -> None:
    self.agents = agents
    self.concurrency_limit = concurrency_limit
    self.step_timeout = step_timeout
    self.name = name
    self.description = description
    self.session = session
```

#### run_branches

```python
run_branches(task: str | Envelope) -> list[Envelope]
```

Async per-branch entry point — returns one `Envelope` per input agent in input order. Use this when you need typed access to individual branch results; for the framework-uniform single-Envelope view, use :meth:`run` or `__call__`.

Source code in `lazybridge/agent.py`

```python
async def run_branches(self, task: str | Envelope) -> list[Envelope]:
    """Async per-branch entry point — returns one ``Envelope`` per
    input agent in input order.  Use this when you need typed
    access to individual branch results; for the framework-uniform
    single-Envelope view, use :meth:`run` or ``__call__``.
    """
    env = Agent._to_envelope(task) if isinstance(task, str) else task
    sem = asyncio.Semaphore(self.concurrency_limit) if self.concurrency_limit else None

    async def _run_one(agent: Agent) -> Envelope:
        async def _coro() -> Envelope:
            if self.step_timeout:
                return await asyncio.wait_for(agent.run(env), timeout=self.step_timeout)
            return await agent.run(env)

        if sem:
            async with sem:
                return await _coro()
        return await _coro()

    results = await asyncio.gather(*[_run_one(a) for a in self.agents], return_exceptions=True)
    out: list[Envelope] = []
    for r in results:
        if isinstance(r, Envelope):
            out.append(r)
        elif isinstance(r, asyncio.CancelledError):
            # CancelledError is BaseException (not Exception) in Python 3.8+;
            # wrapping it as an error envelope would silently swallow the
            # cancellation signal. Re-raise so structured cancellation works.
            raise r
        elif isinstance(r, Exception):
            out.append(Envelope.error_envelope(r))
        else:
            out.append(Envelope.error_envelope(RuntimeError(str(r))))
    return out
```

#### run

```python
run(task: str | Envelope) -> Envelope
```

Run every branch and return one folded :class:`Envelope`.

The wrapper's `payload` is the labelled-text join of every branch's `.text()`; `metadata.nested_*` rolls every branch's cost up so the outer envelope reports total spend. The first non-`None` branch error propagates as the wrapper's `error`.

For typed per-branch access, call :meth:`run_branches`.

Source code in `lazybridge/agent.py`

```python
async def run(self, task: str | Envelope) -> Envelope:
    """Run every branch and return one folded :class:`Envelope`.

    The wrapper's ``payload`` is the labelled-text join of every
    branch's ``.text()``; ``metadata.nested_*`` rolls every branch's
    cost up so the outer envelope reports total spend.  The first
    non-``None`` branch error propagates as the wrapper's ``error``.

    For typed per-branch access, call :meth:`run_branches`.
    """
    branches = await self.run_branches(task)
    return self._join_branches(task, branches)
```

#### as_tool

```python
as_tool(name: str | None = None, description: str | None = None) -> Tool
```

Expose the fan-out runner as a single :class:`Tool`.

Just delegates to :meth:`run` — same labelled-text Envelope as every direct caller sees, so a `ParallelAgent` passed in `tools=[...]` produces output identical to a hand-call.

Source code in `lazybridge/agent.py`

```python
def as_tool(
    self,
    name: str | None = None,
    description: str | None = None,
) -> Tool:
    """Expose the fan-out runner as a single :class:`Tool`.

    Just delegates to :meth:`run` — same labelled-text Envelope as
    every direct caller sees, so a ``ParallelAgent`` passed in
    ``tools=[...]`` produces output identical to a hand-call.
    """
    from lazybridge.tools import Tool

    actual_name = name or self.name or "parallel"
    actual_desc = (
        description or self.description or (f"Run {len(self.agents)} agents in parallel and join their outputs.")
    )

    async def _run(task: str) -> Envelope:
        return await self.run(task)

    _run.__name__ = actual_name
    _run.__doc__ = actual_desc

    return Tool(
        _run,
        name=actual_name,
        description=actual_desc,
        mode="signature",
        returns_envelope=True,
    )
```

### lazybridge.Envelope

Bases: `BaseModel`, `Generic[T]`

Typed envelope carrying a payload of type `T`.

`Envelope[str]` → payload is a string. `Envelope[MyModel]` → payload is an instance of `MyModel`. `Envelope` (no parameter) defaults to `T = Any` for maximum flexibility.

Multimodal attachments (`images=` / `audio=`) ride alongside `task` and reach the LLMEngine's user-message builder verbatim. Steps in a Plan see them only on step 0; downstream steps receive upstream output (text), not the original attachments.

#### __str__

```python
__str__() -> str
```

Stringification falls through to :meth:`text`.

Needed because tools that return an `Envelope` (agent-as-tool) cross back into content blocks expected by the LLM API, which serialise the value via `str(...)`. Without this, any such tool would produce `"task=… context=…"` garbage instead of the agent's actual answer.

Source code in `lazybridge/envelope.py`

```python
def __str__(self) -> str:
    """Stringification falls through to :meth:`text`.

    Needed because tools that return an ``Envelope`` (agent-as-tool)
    cross back into content blocks expected by the LLM API, which
    serialise the value via ``str(...)``.  Without this, any such
    tool would produce ``"task=…  context=…"`` garbage instead of
    the agent's actual answer.
    """
    return self.text()
```

## Multimodal content blocks

For mixed-modality inputs (text + image + audio), pass `images=` and `audio=` kwargs on `agent(...)`, `await agent.run(...)`, or `async for chunk in agent.stream(...)`. Bare URL strings, `Path` objects, raw `bytes`, and `dict` payloads are coerced into the typed blocks below automatically — use these constructors directly only when you need to override the auto-detected MIME type. Narrative coverage lives in [Guides → Mid → Multimodal](https://core.lazybridge.com/guides/mid/multimodal/index.md).

### lazybridge.ImageContent

```python
ImageContent(url: str | None = None, base64_data: str | None = None, media_type: str = 'image/jpeg', type: ContentType = ContentType.IMAGE)
```

#### from_data_uri

```python
from_data_uri(data_uri: str) -> ImageContent
```

Parse `data:image/png;base64,<...>` style URIs.

Source code in `lazybridge/core/types.py`

```python
@classmethod
def from_data_uri(cls, data_uri: str) -> ImageContent:
    """Parse ``data:image/png;base64,<...>`` style URIs."""
    if not data_uri.startswith("data:"):
        raise ValueError("expected a data: URI")
    header, _, body = data_uri[5:].partition(",")
    if not body:
        raise ValueError("malformed data URI: missing payload")
    media_type, _, encoding = header.partition(";")
    media_type = media_type or "image/jpeg"
    if encoding == "base64":
        return cls(base64_data=body, media_type=media_type)
    return cls(base64_data=base64.b64encode(body.encode()).decode("ascii"), media_type=media_type)
```

### lazybridge.AudioContent

```python
AudioContent(url: str | None = None, base64_data: str | None = None, media_type: str = 'audio/wav', type: ContentType = ContentType.AUDIO)
```

Audio attachment for multimodal LLM input.

Provider support varies by model — see :meth:`BaseProvider.supports_audio`. Anthropic / OpenAI accept base64 only; Google Gemini accepts both URL and base64.

#### from_data_uri

```python
from_data_uri(data_uri: str) -> AudioContent
```

Parse `data:audio/flac;base64,<...>` style URIs.

Source code in `lazybridge/core/types.py`

```python
@classmethod
def from_data_uri(cls, data_uri: str) -> AudioContent:
    """Parse ``data:audio/flac;base64,<...>`` style URIs."""
    if not data_uri.startswith("data:"):
        raise ValueError("expected a data: URI")
    header, _, body = data_uri[5:].partition(",")
    if not body:
        raise ValueError("malformed data URI: missing payload")
    media_type, _, encoding = header.partition(";")
    media_type = media_type or "audio/wav"
    if encoding == "base64":
        return cls(base64_data=body, media_type=media_type)
    return cls(base64_data=base64.b64encode(body.encode()).decode("ascii"), media_type=media_type)
```

# Runtime configs & testing

The 0.7-era `AgentRuntimeConfig` / `ResilienceConfig` / `ObservabilityConfig` wrapper-of-flat-kwargs configs were deleted in 0.7.9 — they bundled flat kwargs into shareable objects with a `flat kwarg > config object > default` precedence game that required a private `_UNSET` sentinel on every kwarg (a documented LLM trap, T14 in the audit).

For fleet management, use a Python dict spread:

```python
PROD_DEFAULTS = dict(
    timeout=60,
    max_retries=5,
    max_output_retries=2,
    cache=True,
    verbose=False,
    session=session,
)

researcher = Agent(**PROD_DEFAULTS, engine=LLMEngine("model"), name="research")
writer     = Agent(**PROD_DEFAULTS, engine=LLMEngine("model"), name="write")
```

Same end-user value, no precedence-game complexity, no sentinel.

## Cache config (kept)

`CacheConfig` is intentionally kept — it carries real semantic value (`enabled`, `ttl`) consumed inside `LLMEngine`.

### lazybridge.CacheConfig

```python
CacheConfig(enabled: bool = True, ttl: str = '5m')
```

Mark the static prefix of a request (system prompt + tool definitions) as cacheable.

Providers with explicit prompt caching (Anthropic) need a `cache_control` marker on the last block of each cached segment; this makes it a one-flag opt-in instead of asking callers to hand-craft provider-specific content lists.

Provider-specific behaviour:

- **Anthropic** — the system prompt is upgraded from a string to a `[{type: "text", text, cache_control}]` block; a cache breakpoint is also placed on the last tool definition if tools are present. Cache hits cost ~10% of input tokens; writes cost ~25% more. TTL options: `"5m"` (default) or `"1h"`.
- **OpenAI** — automatic for system prompts >1024 tokens; no user-visible opt-in is required. This config is a no-op but accepted for forward-compat.
- **Google Gemini** — explicit Context Caching uses a different lifecycle (create a cache resource, reference by name). Not auto-wired from this config; pass via `extra` if needed.
- **DeepSeek** — automatic; no-op.

## Testing

### lazybridge.MockAgent

```python
MockAgent(responses: Any, *, name: str = 'mock_agent', description: str | None = None, output: type = str, cycle: bool = False, delay_ms: float = 0.0, default_input_tokens: int = 10, default_output_tokens: int = 20, default_cost_usd: float = 0.0, default_latency_ms: float | None = None, default_model: str = 'mock', default_provider: str = 'mock')
```

Deterministic test double that quacks like :class:`lazybridge.Agent`.

Designed for testing **pipeline composition and data transmission** without touching a real provider. Drop-in compatible with:

- `Agent(..., tools=[mock])` — wrapped via `_wrap_tool` using the `_is_lazy_agent` duck-type marker; nested Envelope metadata rolls up through the tool boundary.
- `Plan(Step(target=mock, ...))` — PlanEngine detects `_is_lazy_agent` and calls `target.run(env)` directly.
- `mock.as_tool()` returns a standard :class:`Tool`.
- `Agent.chain(mock_a, mock_b)` / `Agent.parallel(mock_a, mock_b)`.

**Response specification.** The `responses` argument is resolved per call in this order:

1. **Callable** `fn(env) -> value` (sync or async) — the incoming :class:`Envelope` is passed in. Whatever the callable returns is re-fed through rules 2–4.
1. **Dict** — keys are substrings checked against `env.task` in insertion order; `"*"` is a catch-all default. No match and no default raises :class:`RuntimeError`.
1. **List** — one response per call, in order. Exhausting the list raises :class:`RuntimeError` unless `cycle=True`.
1. **Scalar** — returned on every call.

**Response value types** (after callable resolution):

- :class:`Envelope` — returned verbatim; payload + metadata + error preserved. Use this to test error-path propagation.
- :class:`ErrorInfo` — returned as `Envelope(error=...)`.
- :class:`BaseException` instance — raised from `run()` so tests can assert on provider-style failures.
- :class:`~pydantic.BaseModel` / str / dict / list / scalar — wrapped as the envelope payload with the mock's default metadata (tokens, cost, model, provider).

**Recording.** Every call is appended to :attr:`calls`. Use :meth:`assert_called_with`, :meth:`assert_call_count`, :attr:`call_count`, :attr:`last_call` for assertions. :meth:`reset` clears both the call log and the list/cycle cursor.

Example::

```text
from lazybridge import Agent, Plan, Step
from lazybridge.testing import MockAgent

researcher = MockAgent(
    {"weather": "sunny", "market": "bullish", "*": "no data"},
    name="researcher",
    default_input_tokens=50, default_output_tokens=30,
)
writer = MockAgent(
    lambda env: f"Report based on: {env.text()}",
    name="writer",
)

plan = Plan(
    Step(target=researcher, task="weather today"),
    Step(target=writer),   # from_prev by default
)
agent = Agent(engine=plan)
env = agent("daily brief")

assert "sunny" in env.text()
assert researcher.call_count == 1
assert writer.call_count == 1
assert env.metadata.input_tokens + env.metadata.output_tokens > 0
```

Source code in `lazybridge/testing.py`

```python
def __init__(
    self,
    responses: Any,
    *,
    name: str = "mock_agent",
    description: str | None = None,
    output: type = str,
    cycle: bool = False,
    delay_ms: float = 0.0,
    default_input_tokens: int = 10,
    default_output_tokens: int = 20,
    default_cost_usd: float = 0.0,
    default_latency_ms: float | None = None,
    default_model: str = "mock",
    default_provider: str = "mock",
) -> None:
    self._responses = responses
    self._cycle = cycle
    self._cursor = 0
    self.name = name
    self.description = description
    self.output = output
    # Mirror the Agent surface so test code that introspects agents
    # (session graph registration, verbose exporters, …) works
    # uniformly regardless of whether the agent is real or mocked.
    self.session: Any | None = None
    self.memory: Any | None = None
    self.sources: list[Any] = []
    self.guard: Any | None = None
    self.verify: Any | None = None
    self.timeout: float | None = None
    self.fallback: Any | None = None
    self._tools_raw: list[Any] = []
    self._tool_map: dict[str, Any] = {}

    self._delay_ms = float(delay_ms)
    self._default_input_tokens = int(default_input_tokens)
    self._default_output_tokens = int(default_output_tokens)
    self._default_cost_usd = float(default_cost_usd)
    self._default_latency_ms = default_latency_ms  # None = use measured elapsed
    self._default_model = default_model
    self._default_provider = default_provider

    self.calls: list[MockCall] = []
```

#### stream

```python
stream(task: str | Envelope) -> AsyncGenerator[str, None]
```

Minimal streaming surface: yields the final text in one chunk.

Real streaming isn't meaningful for a deterministic mock, but tests that exercise the `.stream()` path still need something that behaves like an async generator.

Source code in `lazybridge/testing.py`

```python
async def stream(self, task: str | Envelope) -> AsyncGenerator[str, None]:
    """Minimal streaming surface: yields the final text in one chunk.

    Real streaming isn't meaningful for a deterministic mock, but
    tests that exercise the ``.stream()`` path still need something
    that behaves like an async generator.
    """
    env = await self.run(task)
    yield env.text()
```

#### definition

```python
definition() -> Any
```

ToolDefinition for this mock — mirrors `Agent.definition()`.

Source code in `lazybridge/testing.py`

```python
def definition(self) -> Any:
    """ToolDefinition for this mock — mirrors ``Agent.definition()``."""
    return self.as_tool().definition()
```

#### reset

```python
reset() -> None
```

Clear recorded calls and rewind the list/cycle cursor.

Source code in `lazybridge/testing.py`

```python
def reset(self) -> None:
    """Clear recorded calls and rewind the list/cycle cursor."""
    self.calls.clear()
    self._cursor = 0
```

#### assert_called_with

```python
assert_called_with(task: str | None = None, *, contains: str | None = None) -> None
```

Assert at least one call matched `task=` exactly or `contains=`.

Source code in `lazybridge/testing.py`

```python
def assert_called_with(
    self,
    task: str | None = None,
    *,
    contains: str | None = None,
) -> None:
    """Assert at least one call matched ``task=`` exactly or ``contains=``."""
    for c in self.calls:
        if task is not None and c.task == task:
            return
        if contains is not None and contains in (c.task or ""):
            return
    raise AssertionError(
        f"MockAgent({self.name!r}): no call matched task={task!r} "
        f"contains={contains!r}. Recorded tasks: "
        f"{[c.task for c in self.calls]}"
    )
```

# Engines

`LLMEngine` is the LLM-driven tool-calling loop; `Plan` is the deterministic-DAG engine and `Step` is its unit; `ReplanEngine` is the adaptive counterpart to `Plan` for pipelines whose shape is decided at runtime by a planner agent (`PlanRound` / `Task` are its output schema). `PlanCompileError` fires at construction for invalid DAGs; `ToolTimeoutError` and `StreamStallError` surface from the LLM engine's safety nets.

For narrative usage see [Guides → Full → Plan](https://core.lazybridge.com/guides/full/plan/index.md), [Step](https://core.lazybridge.com/guides/full/step/index.md), [ReplanEngine](https://core.lazybridge.com/guides/full/replan-engine/index.md), and the [Engine protocol](https://core.lazybridge.com/guides/advanced/engine-protocol/index.md) (extension surface).

## LLM engine

`LLMEngine` ships several production-grade knobs that are easy to miss in the auto-generated signature below. Quick reference:

| Knob                           | Default                    | Purpose                                                                                                                                                                                                                                                                                                                                                                                 |
| ------------------------------ | -------------------------- | --------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- |
| `max_turns`                    | `20`                       | Cap on tool-calling rounds; prevents runaway loops                                                                                                                                                                                                                                                                                                                                      |
| `tool_choice`                  | `"auto"`                   | `"auto"` / `"any"`; first-turn force-to-call mapped per provider                                                                                                                                                                                                                                                                                                                        |
| `max_retries`                  | `3`                        | Provider transient-error retries with exponential backoff + jitter                                                                                                                                                                                                                                                                                                                      |
| `request_timeout`              | `120.0`                    | Per-completion deadline. Distinct from `Agent(timeout=N)` (total run). `None` disables                                                                                                                                                                                                                                                                                                  |
| `max_parallel_tools`           | `8`                        | Cap on concurrent tool calls within one turn. `None` = unbounded                                                                                                                                                                                                                                                                                                                        |
| `tool_timeout`                 | `None`                     | Per-tool `asyncio.wait_for` deadline; on timeout reports `is_error=True` to the model loop                                                                                                                                                                                                                                                                                              |
| `stream_idle_timeout`          | `90.0`                     | Idle gap between streaming chunks before `StreamStallError`; pass `None` to disable (a one-shot `UserWarning` is emitted at `LLMEngine.__init__` time, not at stream time).                                                                                                                                                                                                             |
| `stream_buffer`                | `64`                       | Bounded queue for streaming producers. Must be ≥1                                                                                                                                                                                                                                                                                                                                       |
| `allow_dangerous_native_tools` | `False`                    | Security gate for `CODE_EXECUTION` / `COMPUTER_USE`; opt-in required                                                                                                                                                                                                                                                                                                                    |
| `thinking`                     | `False` / `ThinkingConfig` | Extended-thinking opt-in. Anthropic Opus 4.6+ / Claude 4.7 use adaptive thinking (server-managed budget; pass `display="omitted"` to hide thoughts). OpenAI `o1`/`o3`/`o4`/`gpt-5` and Gemini 2.5+ surface `reasoning_tokens` automatically; passing a `ThinkingConfig(effort=...)` is forwarded where the provider supports it. See provider-capability matrix in `lazybridge.matrix`. |
| `strict_multimodal`            | `False`                    | Raise `UnsupportedFeatureError` when the model doesn't support an attachment modality                                                                                                                                                                                                                                                                                                   |

`strict_native_tools` is **not** an `LLMEngine` knob — it lives on `BaseProvider` (constructor arg + class attribute). Configure it where you build the provider, e.g. `AnthropicProvider(..., strict_native_tools=True)`; LLMEngine reads it off the resolved provider at request time. See [`BaseProvider`](https://core.lazybridge.com/guides/advanced/base-provider/index.md).

### lazybridge.LLMEngine

```python
LLMEngine(model: str, *, provider: str | None = None, thinking: bool = False, max_turns: int = 20, tool_choice: Literal['auto', 'any'] = 'auto', temperature: float | None = None, system: str | None = None, native_tools: list[NativeTool | str] | None = None, allow_dangerous_native_tools: bool = False, max_retries: int = 3, retry_delay: float = 1.0, request_timeout: float | None = 120.0, max_parallel_tools: int | None = 8, max_tool_calls_per_turn: int | None = None, tool_timeout: float | None = None, stream_idle_timeout: float | None = _USE_DEFAULT_STREAM_IDLE, stream_buffer: int = 64, cache: bool | Any = False, strict_multimodal: bool = False)
```

Drives the LLM ↔ tool-call loop for a single agent invocation.

##### Parameters

model: Model string, e.g. "claude-opus-4-7". Provider is inferred automatically. thinking: Enable extended thinking (Anthropic) or reasoning (OpenAI o-series). max_turns: Maximum tool-call rounds before giving up with MaxTurnsExceeded error. Default 20 — a plain tool-using task typically completes in 2-5 rounds; the 20-round budget leaves headroom for deeper reasoning loops without capping legitimate pipelines. Bump higher for deliberately long agentic tasks; lower it during dev to fail fast. tool_choice: "auto" — provider decides when to call tools (default). "any" — provider must call at least one tool on the first turn.

```text
When ``"any"`` is set, the engine maps it to ``"required"`` on
the wire (Anthropic and OpenAI both reject the literal string
``"any"`` as an unknown tool name).  After the model satisfies
the "must call at least one tool" contract on the first turn,
the engine resets to ``"auto"`` for all subsequent turns so the
model can produce a final text answer.  Without this reset the
model would be forced to call tools on every turn, causing an
infinite loop until ``max_turns`` is exhausted.

When the model emits multiple tool calls in a single turn,
LazyBridge always executes them concurrently via ``asyncio.gather``.
That is a capability of the engine, not a configuration knob;
there is no "serial" execution path for LLM-emitted tool calls.
```

temperature: Sampling temperature. None = provider default. system: Static system prompt. Agent.sources= / Envelope.context are added on top. native_tools: Provider-native server-side tools, e.g. NativeTool.WEB_SEARCH. max_retries: Retries on transient provider errors (429, 5xx, network/timeout). Default 3 — production-safe. Pass 0 to disable. retry_delay: Base delay (seconds) for exponential backoff with ±10% jitter. request_timeout: Per-completion deadline in seconds. Caps the time a hung provider can block an agent run. `None` disables the framework-level timeout and defers to the provider SDK. max_parallel_tools: Maximum number of tool calls executed concurrently within a single model turn. Default is `8`. `None` means unbounded — every tool call returned by the model runs in parallel. Set to a small integer (e.g. 4–8) to apply backpressure on wide tool fan-outs and prevent thread/socket/DB exhaustion on a single turn. max_tool_calls_per_turn: Maximum number of tool calls *executed* per model turn — distinct from `max_parallel_tools` (which only bounds how many run concurrently). `None` (default) executes every call the model emits. Set to `1` to keep a multi-agent graph on a single, non-branching path: the first call runs and any extras get an `is_error` result block telling the model only one call per turn is allowed, so it learns to emit fewer. See :class:`~lazybridge.AgentPool` and :func:`~lazybridge.conclude`. tool_timeout: Per-tool deadline in seconds. When set, each tool execution is wrapped in `asyncio.wait_for`. On timeout the tool's result is reported as `is_error=True` to the model loop so the model can recover; the run does not abort. `None` (default) leaves tools unbounded. stream_idle_timeout: Maximum time (seconds) the engine will wait between successive streaming chunks before raising `StreamStallError`. Catches half-open streams without killing legitimately long fast streams. Defaults to :data:`DEFAULT_STREAM_IDLE_TIMEOUT` (`90.0` s). Pass an explicit `None` to disable stall detection — a one-shot `UserWarning` is emitted because a half-open provider stream then pins the worker indefinitely.

Source code in `lazybridge/engines/llm.py`

```python
def __init__(
    self,
    model: str,
    *,
    provider: str | None = None,
    thinking: bool = False,
    max_turns: int = 20,
    tool_choice: Literal["auto", "any"] = "auto",
    temperature: float | None = None,
    system: str | None = None,
    native_tools: list[NativeTool | str] | None = None,
    allow_dangerous_native_tools: bool = False,
    max_retries: int = 3,
    retry_delay: float = 1.0,
    request_timeout: float | None = 120.0,
    max_parallel_tools: int | None = 8,
    max_tool_calls_per_turn: int | None = None,
    tool_timeout: float | None = None,
    stream_idle_timeout: float | None = _USE_DEFAULT_STREAM_IDLE,
    stream_buffer: int = 64,
    cache: bool | Any = False,
    strict_multimodal: bool = False,
) -> None:
    self.model = model
    self.thinking = thinking
    self.max_turns = max_turns
    self.max_retries = max_retries
    self.retry_delay = retry_delay
    self.request_timeout = request_timeout
    if max_parallel_tools is not None and max_parallel_tools < 1:
        raise ValueError(f"max_parallel_tools must be >= 1 or None, got {max_parallel_tools!r}")
    if max_tool_calls_per_turn is not None and max_tool_calls_per_turn < 1:
        raise ValueError(f"max_tool_calls_per_turn must be >= 1 or None, got {max_tool_calls_per_turn!r}")
    if tool_timeout is not None and tool_timeout <= 0:
        raise ValueError(f"tool_timeout must be > 0 or None, got {tool_timeout!r}")
    # ``stream_idle_timeout`` has three valid input shapes:
    #   * sentinel (caller did not specify) → use the safe default
    #   * positive float                    → custom timeout
    #   * explicit None                     → disabled, but warn loudly
    # Anything else (zero, negative) raises.
    if stream_idle_timeout is _USE_DEFAULT_STREAM_IDLE:
        stream_idle_timeout = DEFAULT_STREAM_IDLE_TIMEOUT
    elif stream_idle_timeout is None:
        warnings.warn(
            "LLMEngine(stream_idle_timeout=None) disables stream stall "
            "detection.  A half-open provider stream (TCP RST never "
            f"delivered, HTTP/2 PING dropped) will then pin a worker "
            f"indefinitely.  The safe default is "
            f"{DEFAULT_STREAM_IDLE_TIMEOUT}s — pass a positive float to "
            "tune it instead of disabling.",
            UserWarning,
            stacklevel=2,
        )
    elif stream_idle_timeout <= 0:
        raise ValueError(f"stream_idle_timeout must be > 0 or None, got {stream_idle_timeout!r}")
    if stream_buffer < 1:
        raise ValueError(f"stream_buffer must be >= 1, got {stream_buffer!r}")
    self.max_parallel_tools = max_parallel_tools
    self.max_tool_calls_per_turn = max_tool_calls_per_turn
    self.tool_timeout = tool_timeout
    self.stream_idle_timeout = stream_idle_timeout
    # Bounded queue between the streaming producer (provider) and
    # the consumer (the ``stream()`` async generator).  Pre-W4.1
    # the queue was unbounded, so a slow consumer (slow terminal,
    # slow network, blocked downstream) caused the queue to grow
    # without limit while the provider kept pushing tokens.  A
    # bounded queue propagates backpressure all the way to the
    # provider stream — when the consumer pauses, the producer
    # naturally pauses on ``await sink.put()``.
    self.stream_buffer = stream_buffer
    # ``tool_choice="parallel"`` was deprecated in 0.7.0 and removed in
    # 0.7.9.  Concurrent tool execution is the default and cannot be
    # disabled; the model decides how many tools to call per turn and
    # they run via asyncio.gather.  Defensive runtime check for the
    # JSON-deserialisation case (the Literal annotation already covers
    # static callers); cast through Any so mypy --strict doesn't flag
    # the comparison as non-overlapping.
    if cast(Any, tool_choice) == "parallel":
        raise ValueError(
            "LLMEngine(tool_choice='parallel') was removed in 0.7.9.  "
            "Concurrent tool execution is now the default and cannot be "
            "disabled; drop the argument (or use 'auto' / 'any')."
        )
    self.tool_choice = tool_choice
    self.temperature = temperature
    self.system = system
    # CODE_EXECUTION and COMPUTER_USE grant the model arbitrary code
    # execution / OS control, so they require an explicit opt-in.
    # FILE_SEARCH (OpenAI vector-store lookup) is intentionally NOT
    # gated here: it only reaches data the caller has already uploaded
    # to their own vector store and carries no ambient privilege.
    _DANGEROUS = {NativeTool.CODE_EXECUTION, NativeTool.COMPUTER_USE}
    resolved_native = [NativeTool(t) if isinstance(t, str) else t for t in (native_tools or [])]
    if not allow_dangerous_native_tools:
        found = [t for t in resolved_native if t in _DANGEROUS]
        if found:
            raise ValueError(
                f"Native tools {[t.value for t in found]!r} require explicit opt-in "
                f"because they grant the model code-execution or computer-control "
                f"capabilities. Pass allow_dangerous_native_tools=True to confirm."
            )
    self.native_tools: list[NativeTool] = resolved_native
    # Prompt caching — ``cache=True`` enables the default
    # (5-minute TTL on Anthropic; no-op on OpenAI / Google /
    # DeepSeek because they either cache automatically or need a
    # different API).  Callers wanting the 1-hour TTL pass a
    # ``CacheConfig(ttl="1h")`` object directly.
    from lazybridge.core.types import CacheConfig

    if cache is True:
        self.cache: CacheConfig | None = CacheConfig(enabled=True)
    elif cache is False or cache is None:
        self.cache = None
    else:
        self.cache = cache  # assumed CacheConfig
    # When True, ``Envelope.images`` / ``.audio`` reaching a model
    # that does not support that modality raises
    # ``UnsupportedFeatureError`` instead of warning-and-stripping.
    # Off by default so a single agent fleet can mix vision and
    # text-only models without crashing on edge cases.
    self.strict_multimodal = strict_multimodal
    # Provider may be passed explicitly (used by Agent.from_provider
    # when the model is a tier alias like "top" / "cheap" that
    # _infer_provider can't route on its own).  Falls back to the
    # inference heuristic on the model string.
    self.provider = provider or self._infer_provider(model)
    # Bare-provider-alias guard.  ``LLMEngine("anthropic")`` resolves
    # the provider correctly but leaves ``model="anthropic"`` as the
    # literal request payload, which every real API rejects several
    # RTTs later with a cryptic "unknown model" error.  The canonical
    # forms — ``LLMEngine("claude-opus-4-7")`` for a pinned SKU,
    # ``Agent.from_provider("anthropic", tier="medium")`` for a tier
    # alias — already cover both intents.  Only fires when ``provider=``
    # was inferred (not passed explicitly): ``Agent.from_provider``
    # passes ``provider="anthropic"`` alongside ``model="medium"``,
    # which is a legitimate tier-alias path and stays allowed.
    if provider is None and model in self._PROVIDER_ALIASES and self._PROVIDER_ALIASES[model] == model:
        raise ValueError(
            f"LLMEngine({model!r}) is ambiguous: {model!r} is a provider name, "
            f"not a model id.  At request time the provider would be asked for the "
            f"literal model {model!r} and reject it.\n"
            f"  Fix (tier alias — tracks the provider's lineup):\n"
            f'      Agent.from_provider({model!r}, tier="medium")\n'
            f"  Fix (pinned model id):\n"
            f'      Agent(engine=LLMEngine("<model-id>"))   # e.g. "claude-opus-4-7"'
        )
```

#### set_default_provider

```python
set_default_provider(provider: str | None) -> None
```

Set (or disable) the fallback provider used when no rule matches.

Two common uses::

```text
# Production hardening: raise on unknown models rather than
# silently falling back to the default. Recommended when you
# only ever want the providers you've explicitly registered.
LLMEngine.set_default_provider(None)

# Redirect the safety-net to a different built-in:
LLMEngine.set_default_provider("openai")
```

Why this helper exists: `Agent("grok-2")` would default-route to Anthropic and fail several RTTs later with a cryptic provider-side "unknown model" error. Disabling the fallback turns that into a loud `ValueError` at construction time.

Source code in `lazybridge/engines/llm.py`

```python
@classmethod
def set_default_provider(cls, provider: str | None) -> None:
    """Set (or disable) the fallback provider used when no rule matches.

    Two common uses::

        # Production hardening: raise on unknown models rather than
        # silently falling back to the default. Recommended when you
        # only ever want the providers you've explicitly registered.
        LLMEngine.set_default_provider(None)

        # Redirect the safety-net to a different built-in:
        LLMEngine.set_default_provider("openai")

    Why this helper exists: ``Agent("grok-2")`` would default-route
    to Anthropic and fail several RTTs later with a cryptic
    provider-side "unknown model" error.  Disabling
    the fallback turns that into a loud ``ValueError`` at
    construction time.
    """
    cls._PROVIDER_DEFAULT = provider
```

#### provider_aliases

```python
provider_aliases() -> dict[str, str]
```

Return a snapshot of the current model-string → provider alias map.

Callers that need to validate a user-supplied model string (or document the accepted aliases) should read from this dict rather than reach into `_PROVIDER_ALIASES` directly — the return value is a fresh copy, so accidental mutation can't affect the framework's routing. The set is also surfaced in :data:`lazybridge.PROVIDER_ALIASES` for top-level convenience.

Source code in `lazybridge/engines/llm.py`

```python
@classmethod
def provider_aliases(cls) -> dict[str, str]:
    """Return a snapshot of the current model-string → provider alias map.

    Callers that need to validate a user-supplied model string (or
    document the accepted aliases) should read from this dict
    rather than reach into ``_PROVIDER_ALIASES`` directly — the
    return value is a fresh copy, so accidental mutation can't
    affect the framework's routing.  The set is also surfaced in
    :data:`lazybridge.PROVIDER_ALIASES` for top-level convenience.
    """
    return dict(cls._PROVIDER_ALIASES)
```

#### register_provider_alias

```python
register_provider_alias(alias: str, provider: str) -> None
```

Register an exact-match model-string → provider alias.

Example::

```text
LLMEngine.register_provider_alias("mistral", "mistral")
Agent("mistral")   # resolves to the mistral provider
```

Thread-safe: serialised via `_PROVIDER_REGISTRY_LOCK` so two threads racing here can't lose a registration to a read-then-write clobber on the class attribute.

Source code in `lazybridge/engines/llm.py`

```python
@classmethod
def register_provider_alias(cls, alias: str, provider: str) -> None:
    """Register an exact-match model-string → provider alias.

    Example::

        LLMEngine.register_provider_alias("mistral", "mistral")
        Agent("mistral")   # resolves to the mistral provider

    Thread-safe: serialised via ``_PROVIDER_REGISTRY_LOCK`` so two
    threads racing here can't lose a registration to a read-then-write
    clobber on the class attribute.
    """
    with cls._PROVIDER_REGISTRY_LOCK:
        cls._PROVIDER_ALIASES = {**cls._PROVIDER_ALIASES, alias.lower(): provider}
```

#### register_provider_rule

```python
register_provider_rule(pattern: str, provider: str, *, kind: Literal['contains', 'startswith'] = 'contains') -> None
```

Register a substring / prefix routing rule.

New rules take priority over built-ins so you can override default routing without editing the framework source::

```text
LLMEngine.register_provider_rule("claude-opus-5", "anthropic")
Agent("claude-opus-5-20260701")   # routed to anthropic
```

Thread-safe: serialised via `_PROVIDER_REGISTRY_LOCK` so two threads racing here can't lose a rule to a read-then-write clobber on the class attribute.

Source code in `lazybridge/engines/llm.py`

```python
@classmethod
def register_provider_rule(
    cls,
    pattern: str,
    provider: str,
    *,
    kind: Literal["contains", "startswith"] = "contains",
) -> None:
    """Register a substring / prefix routing rule.

    New rules take priority over built-ins so you can override default
    routing without editing the framework source::

        LLMEngine.register_provider_rule("claude-opus-5", "anthropic")
        Agent("claude-opus-5-20260701")   # routed to anthropic

    Thread-safe: serialised via ``_PROVIDER_REGISTRY_LOCK`` so two
    threads racing here can't lose a rule to a read-then-write
    clobber on the class attribute.
    """
    with cls._PROVIDER_REGISTRY_LOCK:
        cls._PROVIDER_RULES = [(kind, pattern.lower(), provider), *cls._PROVIDER_RULES]
```

#### stream

```python
stream(env: Envelope[Any], *, tools: list[Tool], output_type: type, memory: Memory | None, session: Session | None) -> AsyncGenerator[str, None]
```

Stream tokens from the full tool-calling loop.

Yields str tokens as the LLM generates them. Tool calls between turns are executed silently; the next-turn response is then streamed. This means token output is continuous across tool-call boundaries.

Source code in `lazybridge/engines/llm.py`

```python
async def stream(
    self,
    env: Envelope[Any],
    *,
    tools: list[Tool],
    output_type: type,
    memory: Memory | None,
    session: Session | None,
) -> AsyncGenerator[str, None]:
    """Stream tokens from the full tool-calling loop.

    Yields str tokens as the LLM generates them. Tool calls between turns
    are executed silently; the next-turn response is then streamed.
    This means token output is continuous across tool-call boundaries.
    """
    run_id = str(uuid.uuid4())
    agent_name = getattr(self, "_agent_name", "agent")

    if session:
        session.emit(EventType.AGENT_START, {"agent_name": agent_name, "task": env.task}, run_id=run_id)

    # Bounded sink — see ``stream_buffer`` on ``__init__``.  The
    # producer ``await sink.put(token)`` naturally blocks when the
    # consumer falls behind; this is the only mechanism that keeps
    # an idle consumer from forcing unbounded memory growth as the
    # provider streams tokens.
    sink: asyncio.Queue[str | None] = asyncio.Queue(maxsize=self.stream_buffer)

    async def _run_loop() -> None:
        try:
            await self._loop(
                env,
                tools=tools,
                output_type=output_type,
                memory=memory,
                session=session,
                run_id=run_id,
                _stream_sink=sink,
            )
        finally:
            await sink.put(None)  # sentinel — loop done

    task = asyncio.create_task(_run_loop())
    cancelled_by_us = False
    try:
        while True:
            token = await sink.get()
            if token is None:
                break
            yield token
    finally:
        # If the consumer broke early (e.g. ``break`` out of the
        # ``async for``), cancel the background loop instead of
        # awaiting it — otherwise the provider keeps streaming
        # into a sink no one is reading, racking up cost and
        # tying up worker capacity for the lifetime of the turn.
        if not task.done():
            task.cancel()
            cancelled_by_us = True
        try:
            await task
        except asyncio.CancelledError:
            if not cancelled_by_us:
                raise
        # Emit AGENT_FINISH regardless of how we exited so streaming
        # runs are observable end-to-end the same way ``run()``
        # invocations are.  The companion AGENT_START is emitted at
        # the top of this method.
        if session:
            session.emit(
                EventType.AGENT_FINISH,
                {"agent_name": agent_name, "cancelled": cancelled_by_us},
                run_id=run_id,
            )
```

## Plan + Step

### lazybridge.Plan

```python
Plan(*steps: Step, max_iterations: int = 100, store: Store | None = None, checkpoint_key: str | None = None, resume: bool = False, on_concurrent: Literal['fail', 'fork'] = 'fail')
```

Structured multi-step execution engine.

Steps run sequentially by default. Routing is **explicit** at the Step level via `Step(routes={...})` (predicate map) or `Step(routes_by="field")` (LLM-decided via a Literal field on the structured output). Parallel branches via `step.parallel=True`.

PlanCompiler runs at Agent construction time; errors surface before any LLM call.

Construct a Plan.

##### Checkpoint / resume

Pass `store=` and `checkpoint_key=` to persist minimal plan state (next step to run, `writes`-bucket values, completed step names) after every step. Pass `resume=True` together with a populated `store[checkpoint_key]` to pick up where the previous run stopped (useful after a crash, interrupt, or external pause).

Example::

```text
store = Store(db="run.sqlite")
plan = Plan(
    Step(researcher, writes="research"),
    Step(writer, writes="draft"),
    store=store,
    checkpoint_key="my_pipeline",
    resume=True,
)
Agent(engine=plan)("topic")   # continues if a checkpoint exists
```

The persisted shape (v2) includes minimal plan state plus serialized StepResult history: `{"next_step": str, "kv": {...}, "completed_steps": [...], "status": str, "run_uid": str, "history": [...]}`. History is serialized so a resumed run can re-aggregate `from_parallel_all` bands and nested-cost rollup against completed upstream steps. Only `writes`-bucket values and step history survive across process boundaries; live in-memory state does not.

##### Crash-window durability

Each step writes its checkpoint *before* the durable `store.write(step.writes, value)` call. This eliminates double-writes on resume — the checkpoint already records `next_step` as the following step, so a resumed run does not re-execute the completed step. The trade-off is that a crash in the gap between the checkpoint and the Store write makes the durable Store write *lost*; the value still lives in the checkpoint's serialised `kv` and is read back into in-memory state on resume, so the Plan continues correctly, but **sidecar consumers reading the Store directly should reconcile against the checkpoint snapshot rather than assume Store completeness**.

##### Concurrency

Every checkpoint write goes through :meth:`lazybridge.store.Store.compare_and_swap`, so two Plan runs can never silently overwrite each other's state. Two policies are available via `on_concurrent=`:

- `"fail"` (default) — `checkpoint_key` identifies a single in-flight run. A second Plan on the same key, while the first is still running, raises :class:`ConcurrentPlanRunError`. This is the correctness floor; pick it when runs legitimately share state (e.g. graceful crash-resume via `resume=True`).
- `"fork"` — `checkpoint_key` names the *pipeline*; each `.run()` claims its own isolated effective key `f"{checkpoint_key}:{run_uid}"`. Many runs of the same pipeline can execute concurrently with no collision. This is the mode you want for fan-out workflows (N backtests / seeds / tickers sharing a pipeline definition). `resume` is not supported in `fork` mode because there is no single shared checkpoint to resume — if you need resume, use `on_concurrent="fail"` with distinct per-run keys.

Example::

```text
store = Store(db="run.sqlite")
plan = Plan(
    Step(researcher, writes="research"),
    Step(writer, writes="draft"),
    store=store,
    checkpoint_key="my_pipeline",
    resume=True,
)
```

Source code in `lazybridge/engines/plan/_plan.py`

```python
def __init__(
    self,
    *steps: Step,
    max_iterations: int = 100,
    store: Store | None = None,
    checkpoint_key: str | None = None,
    resume: bool = False,
    on_concurrent: Literal["fail", "fork"] = "fail",
) -> None:
    """Construct a Plan.

    Checkpoint / resume
    -------------------
    Pass ``store=`` and ``checkpoint_key=`` to persist minimal plan state
    (next step to run, ``writes``-bucket values, completed step names)
    after every step. Pass ``resume=True`` together with a populated
    ``store[checkpoint_key]`` to pick up where the previous run stopped
    (useful after a crash, interrupt, or external pause).

    Example::

        store = Store(db="run.sqlite")
        plan = Plan(
            Step(researcher, writes="research"),
            Step(writer, writes="draft"),
            store=store,
            checkpoint_key="my_pipeline",
            resume=True,
        )
        Agent(engine=plan)("topic")   # continues if a checkpoint exists

    The persisted shape (v2) includes minimal plan state plus serialized
    StepResult history: ``{"next_step": str, "kv": {...},
    "completed_steps": [...], "status": str, "run_uid": str,
    "history": [...]}``.
    History is serialized so a resumed run can re-aggregate
    ``from_parallel_all`` bands and nested-cost rollup against completed
    upstream steps.  Only ``writes``-bucket values and step history
    survive across process boundaries; live in-memory state does not.

    Crash-window durability
    -----------------------
    Each step writes its checkpoint *before* the durable
    ``store.write(step.writes, value)`` call.  This eliminates
    double-writes on resume — the checkpoint already records
    ``next_step`` as the following step, so a resumed run does not
    re-execute the completed step.  The trade-off is that a crash in
    the gap between the checkpoint and the Store write makes the
    durable Store write *lost*; the value still lives in the
    checkpoint's serialised ``kv`` and is read back into in-memory
    state on resume, so the Plan continues correctly, but **sidecar
    consumers reading the Store directly should reconcile against
    the checkpoint snapshot rather than assume Store completeness**.

    Concurrency
    -----------
    Every checkpoint write goes through
    :meth:`lazybridge.store.Store.compare_and_swap`, so two Plan
    runs can never silently overwrite each other's state.  Two
    policies are available via ``on_concurrent=``:

    * ``"fail"`` (default) — ``checkpoint_key`` identifies a single
      in-flight run.  A second Plan on the same key, while the first
      is still running, raises :class:`ConcurrentPlanRunError`.
      This is the correctness floor; pick it when runs legitimately
      share state (e.g. graceful crash-resume via ``resume=True``).

    * ``"fork"`` — ``checkpoint_key`` names the *pipeline*; each
      ``.run()`` claims its own isolated effective key
      ``f"{checkpoint_key}:{run_uid}"``.  Many runs of the same
      pipeline can execute concurrently with no collision.  This
      is the mode you want for fan-out workflows (N backtests /
      seeds / tickers sharing a pipeline definition).  ``resume``
      is not supported in ``fork`` mode because there is no single
      shared checkpoint to resume — if you need resume, use
      ``on_concurrent="fail"`` with distinct per-run keys.

    Example::

        store = Store(db="run.sqlite")
        plan = Plan(
            Step(researcher, writes="research"),
            Step(writer, writes="draft"),
            store=store,
            checkpoint_key="my_pipeline",
            resume=True,
        )
    """
    if on_concurrent not in ("fail", "fork"):
        raise ValueError(
            f"Plan(on_concurrent={on_concurrent!r}): must be one of "
            f"'fail' (default, raise on collision) or 'fork' (isolate "
            f"each run under a suffixed key)."
        )
    if on_concurrent == "fork" and resume:
        raise ValueError(
            "Plan(on_concurrent='fork', resume=True) is not supported: "
            "'fork' gives each run its own key, so there is no single "
            "shared checkpoint to resume from.  Use on_concurrent='fail' "
            "with a unique per-run checkpoint_key if you need resume."
        )
    self.steps = list(steps)
    self.max_iterations = max_iterations
    self._compiler = PlanCompiler()
    self.store = store
    self.checkpoint_key = checkpoint_key
    self.resume = resume
    self.on_concurrent = on_concurrent
```

#### run_many

```python
run_many(tasks: list[str | Envelope[Any]], *, concurrency: int | None = None, tools: list[Any] | None = None, memory: Any = None, session: Any = None, output_type: type = str) -> list[Envelope[Any]]
```

Run this Plan concurrently against `N` inputs; sync return.

Each `task` is dispatched as its own `Plan.run` invocation on a fresh asyncio task; results are returned as a list in input order. Pair with `Plan(on_concurrent="fork", ...)` for true fan-out workflows where each input claims its own per-run keyspace.

Errors are returned as error envelopes in the corresponding slot — the call never raises (matches `Agent.parallel` semantics).

`concurrency` caps the number of in-flight runs via an asyncio semaphore. `None` (default) lets every task fire immediately.

Pass `tools` when the Plan's steps use string-name targets that must be resolved against a live tool map. Omitting `tools` (or passing `[]`) works only when every step target is an `Agent` object rather than a string alias.

See :meth:`arun_many` for the async variant when the caller is already inside an event loop.

Source code in `lazybridge/engines/plan/_plan.py`

```python
def run_many(
    self,
    tasks: list[str | Envelope[Any]],
    *,
    concurrency: int | None = None,
    tools: list[Any] | None = None,
    memory: Any = None,
    session: Any = None,
    output_type: type = str,
) -> list[Envelope[Any]]:
    """Run this Plan concurrently against ``N`` inputs; sync return.

    Each ``task`` is dispatched as its own ``Plan.run`` invocation
    on a fresh asyncio task; results are returned as a list in
    input order.  Pair with ``Plan(on_concurrent="fork", ...)`` for
    true fan-out workflows where each input claims its own
    per-run keyspace.

    Errors are returned as error envelopes in the corresponding
    slot — the call never raises (matches ``Agent.parallel``
    semantics).

    ``concurrency`` caps the number of in-flight runs via an
    asyncio semaphore.  ``None`` (default) lets every task fire
    immediately.

    Pass ``tools`` when the Plan's steps use string-name targets that
    must be resolved against a live tool map.  Omitting ``tools``
    (or passing ``[]``) works only when every step target is an
    ``Agent`` object rather than a string alias.

    See :meth:`arun_many` for the async variant when the caller is
    already inside an event loop.
    """
    # Re-use the sync-bridge that ``Agent.__call__`` ships with —
    # it propagates contextvars (OTel spans, request ids, …) into
    # the worker loop so observability flows through fan-outs.
    from lazybridge.agent import _run_coro_with_context

    result: list[Envelope[Any]] = _run_coro_with_context(
        self.arun_many(
            tasks,
            concurrency=concurrency,
            tools=tools,
            memory=memory,
            session=session,
            output_type=output_type,
        )
    )
    return result
```

#### arun_many

```python
arun_many(tasks: list[str | Envelope[Any]], *, concurrency: int | None = None, tools: list[Any] | None = None, memory: Any = None, session: Any = None, output_type: type = str) -> list[Envelope[Any]]
```

Async counterpart to :meth:`run_many`.

Use this directly when you're already inside an event loop and want to `await` the fan-out without the sync-bridge overhead.

Pass `tools` when the Plan's steps use string-name targets that must be resolved against a live tool map. Omitting `tools` (or passing `[]`) works only when every step target is an `Agent` object rather than a string alias.

Source code in `lazybridge/engines/plan/_plan.py`

```python
async def arun_many(
    self,
    tasks: list[str | Envelope[Any]],
    *,
    concurrency: int | None = None,
    tools: list[Any] | None = None,
    memory: Any = None,
    session: Any = None,
    output_type: type = str,
) -> list[Envelope[Any]]:
    """Async counterpart to :meth:`run_many`.

    Use this directly when you're already inside an event loop and
    want to ``await`` the fan-out without the sync-bridge overhead.

    Pass ``tools`` when the Plan's steps use string-name targets that
    must be resolved against a live tool map.  Omitting ``tools``
    (or passing ``[]``) works only when every step target is an
    ``Agent`` object rather than a string alias.
    """
    sem = asyncio.Semaphore(concurrency) if concurrency else None
    resolved_tools: list[Any] = tools or []

    async def _one(task: str | Envelope[Any]) -> Envelope[Any]:
        # ``Envelope.from_task`` populates BOTH ``task`` and
        # ``payload`` so the first step's ``from_prev`` resolves to
        # the user's input rather than an empty string.
        env = task if isinstance(task, Envelope) else Envelope.from_task(str(task))

        async def _go() -> Envelope[Any]:
            return await self.run(
                env,
                tools=resolved_tools,
                output_type=output_type,
                memory=memory,
                session=session,
            )

        if sem is None:
            return await _go()
        async with sem:
            return await _go()

    raw = await asyncio.gather(
        *[_one(t) for t in tasks],
        return_exceptions=True,
    )
    # Wrap raised exceptions as error envelopes so the contract is
    # "list of envelopes in input order".  Plan.run normally
    # returns an error envelope itself, so this branch only fires
    # for genuine framework bugs / cancellations.
    return [
        r
        if isinstance(r, Envelope)
        else Envelope.error_envelope(r if isinstance(r, BaseException) else RuntimeError(str(r)))
        for r in raw
    ]
```

#### to_dict

```python
to_dict() -> dict[str, Any]
```

Serialise the Plan's topology to a JSON-compatible dict.

Callables and Agents are serialised by `name` only — rebind them at load time via :meth:`from_dict`'s `registry` kwarg. Sentinels, writes, parallel flags, iteration limit, and step order are preserved faithfully.

Source code in `lazybridge/engines/plan/_plan.py`

```python
def to_dict(self) -> dict[str, Any]:
    """Serialise the Plan's topology to a JSON-compatible dict.

    Callables and Agents are serialised by ``name`` only — rebind
    them at load time via :meth:`from_dict`'s ``registry`` kwarg.
    Sentinels, writes, parallel flags, iteration limit, and step
    order are preserved faithfully.
    """
    return {
        "version": 1,
        "max_iterations": self.max_iterations,
        "steps": [_step_to_dict(s) for s in self.steps],
    }
```

#### from_dict

```python
from_dict(data: dict[str, Any], *, registry: dict[str, Any] | None = None) -> Plan
```

Reconstruct a Plan from a `to_dict` payload.

`registry` maps serialised target names back to live callables / Agents. Missing entries for non-tool targets raise :class:`KeyError` with the offending name — keeping the failure loud rather than producing a silently-broken Plan.

Example::

```text
saved = plan.to_dict()                     # store somewhere
...
plan = Plan.from_dict(saved, registry={
    "researcher": researcher_agent,
    "fetch":      fetch_function,
})
```

Source code in `lazybridge/engines/plan/_plan.py`

```python
@classmethod
def from_dict(
    cls,
    data: dict[str, Any],
    *,
    registry: dict[str, Any] | None = None,
) -> Plan:
    """Reconstruct a Plan from a ``to_dict`` payload.

    ``registry`` maps serialised target names back to live callables /
    Agents.  Missing entries for non-tool targets raise
    :class:`KeyError` with the offending name — keeping the failure
    loud rather than producing a silently-broken Plan.

    Example::

        saved = plan.to_dict()                     # store somewhere
        ...
        plan = Plan.from_dict(saved, registry={
            "researcher": researcher_agent,
            "fetch":      fetch_function,
        })
    """
    registry = registry or {}
    steps = [_step_from_dict(s, registry) for s in data.get("steps", [])]
    return cls(*steps, max_iterations=data.get("max_iterations", 100))
```

### lazybridge.Step

```python
Step(target: Any, task: Sentinel | str = (lambda: from_prev)(), context: Sentinel | str | list[Sentinel | str] | None = None, sources: list[Any] = list(), writes: str | None = None, input: type = Any, output: type = str, parallel: bool = False, name: str | None = None, routes: dict[str, Callable[[Any], bool]] | None = None, routes_by: str | None = None, after_branches: str | None = None, _name_is_opaque: bool = False)
```

A single node in a Plan.

`target` and `name` are two distinct concepts:

- `target` — **which tool to call**. When a string, it must match the key registered in the parent Agent's tool map, i.e. the name passed to `agent.as_tool("name")`. This is the link between the Plan and the tools list::

  researcher.as_tool("research") → tool map key: "research" Step("research") → target="research", calls that tool Step("research", name="phase1") → calls "research" tool, step is "phase1"

When `target` is a string and `name` is omitted, `name` defaults to `target` — so in the common case they are the same. They only diverge when you want a display name or routing key that differs from the tool name.

- `name` — **the step's identity in the plan**. Used for routing (`routes={"name": predicate}`), sentinel lookups (`from_step("name")`), checkpointing, and display. If routing breaks silently, check that the target name in `routes=` matches the `name` of the intended step, not its `target`.

Parameters:

| Name             | Type                                   | Description                                                    | Default                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                   |
| ---------------- | -------------------------------------- | -------------------------------------------------------------- | ------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- |
| `target`         | `Any`                                  | Tool name (str), callable, or Agent. Required.                 | *required*                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                |
| `task`           | \`Sentinel                             | str\`                                                          | Sentinel or str for the step's task. Default: from_prev.                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                  |
| `context`        | \`Sentinel                             | str                                                            | list\[Sentinel                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                            |
| `sources`        | `list[Any]`                            | Live-view objects with a .text() method injected into context. | `list()`                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                  |
| `writes`         | \`str                                  | None\`                                                         | Key under which Envelope.payload is saved in the Store.                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                   |
| `input`          | `type`                                 | Expected input payload type (PlanCompiler validates).          | `Any`                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                     |
| `output`         | `type`                                 | Expected output payload type (triggers structured output).     | `str`                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                     |
| `parallel`       | `bool`                                 | True if this step runs concurrently with siblings.             | `False`                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                   |
| `name`           | \`str                                  | None\`                                                         | The step's identity for routing, sentinels, and display. Defaults to target when target is a string.                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                      |
| `routes`         | \`dict\[str, Callable\[[Any], bool\]\] | None\`                                                         | Predicate-based routing. Mapping {step_name: predicate(envelope) -> bool}. After this step runs, predicates are evaluated in declared order; the first one that returns truthy makes the Plan jump to the corresponding step. If none match (or routes is None), execution falls through linearly to the next declared step. Mutually exclusive with routes_by.                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                           |
| `routes_by`      | \`str                                  | None\`                                                         | LLM-decided routing via a named field on the step's structured output. Pass the attribute name (e.g. "kind") — Plan reads env.payload.<name> and, if it's a string matching an existing step name, jumps there. The output model must declare that field as Literal["a", "b", ...] (or Literal[...]                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                       |
| `after_branches` | \`str                                  | None\`                                                         | Exclusive-branch rejoin point. Only valid alongside routes or routes_by. When set, exactly one branch runs (the routed-to step), all other declared steps between the routing step and the rejoin point are skipped, and execution continues at the named step after the branch completes. Example:: Step("triage", agent, routes_by="severity", after_branches="archive"), Step("urgent", urgent_agent), Step("normal", normal_agent), Step("spam", spam_agent), Step("archive", archive_agent), # always runs Without `after_branches`, routing is a \*detour\*: after the routed-to step, linear progression resumes from its declared position, so all subsequent steps also execute. `after_branches` replaces that fall-through with a guaranteed jump to the rejoin point. For multi-step branches, pass an `Agent(engine=Plan(...))` as the branch step's target. |

Without `after_branches`, routing is a **detour**: after the routed-to step runs, linear progression resumes from its declared position. To make a step terminal, place it at the end of the declared step list. Loops are simply routes back to an earlier step; `Plan(max_iterations=...)` is the safety net.

## ReplanEngine

The adaptive replan-loop engine. A planner tool (built with `output=PlanRound`) is called every round; `ReplanEngine` dispatches the tasks it emits and checkpoints after each round. See [Guides → Full → ReplanEngine](https://core.lazybridge.com/guides/full/replan-engine/index.md) for narrative usage.

### lazybridge.ReplanEngine

```python
ReplanEngine(*, planner_name: str = 'planner', store: Store | None = None, checkpoint_key: str | None = None, resume: bool = False, max_rounds: int = 20)
```

Engine that guards the dynamic replan loop with checkpoint/resume.

The planner and all worker tools are resolved from the tool_map at run time — nothing is injected at construction except configuration::

```text
guardian = Agent(
    engine=ReplanEngine(
        store=Store(db="project.sqlite"),
        checkpoint_key="my-project",
        resume=True,
    ),
    tools=[planner, analyst, coder, pool.as_tool("route")],
)
```

The planner tool must have `output=PlanRound`. ReplanEngine builds the planner's input dynamically (tool schemas + history) so the planner does not need a static system prompt that lists worker names.

See :class:`lazybridge.engines.replan.PlanRound` and :class:`lazybridge.engines.replan.Task` for the planner output schema.

Source code in `lazybridge/engines/replan/_engine.py`

```python
def __init__(
    self,
    *,
    planner_name: str = "planner",
    store: Store | None = None,
    checkpoint_key: str | None = None,
    resume: bool = False,
    max_rounds: int = 20,
) -> None:
    self.planner_name = planner_name
    self.store = store
    self.checkpoint_key = checkpoint_key
    self.resume = resume
    self.max_rounds = max_rounds
```

### lazybridge.PlanRound

Bases: `BaseModel`

Structured output emitted by the planner agent each round.

ReplanEngine calls the planner tool, deserialises this schema, and dispatches the tasks. Tasks within the same round with `parallel=True` run concurrently via `asyncio.gather`; `parallel=False` tasks run sequentially after the parallel group.

Dependent tasks belong in the *next* round — after the planner has seen the outputs from this one.

### lazybridge.Task

Bases: `BaseModel`

A single tool call planned for this round.

`tool` must match a key in the parent Agent's tool_map — an agent, a plain function, a pool route, or any other callable wrapped as a Tool. `kwargs` are forwarded verbatim to `tool.run(**kwargs)` so they must match the tool's JSON schema exactly.

Examples::

```text
Task(tool="analyst",  kwargs={"task": "analyse the auth module"})
Task(tool="route",    kwargs={"agent_name": "alice", "task": "write tests"})
Task(tool="add",      kwargs={"a": 3, "b": 7})
```

## Engine errors

### lazybridge.PlanCompileError

Bases: `Exception`

### lazybridge.PlanRuntimeError

Bases: `RuntimeError`

Raised when a Plan step misbehaves at runtime in a way that indicates a programming bug — not a recoverable runtime condition.

The canonical case is a `Step(routes={...})` predicate that raises an exception during evaluation. The engine wraps the underlying exception in :class:`PlanRuntimeError` with the offending step and target named, then propagates.

Distinct from :class:`PlanCompileError` (which fires at construction time for static DAG validation) and from :class:`ConcurrentPlanRunError` (which fires at runtime CAS collision). Distinct from regular `Envelope.error_envelope` return paths (which signal recoverable runtime failures the caller can handle without a try/except).

### lazybridge.PlanPaused

```python
PlanPaused(message: str = 'Plan paused')
```

Bases: `BaseException`

Raised by a step target to halt the Plan and persist a resumable checkpoint.

Subclasses :class:`BaseException` (not :class:`Exception`) so user code's `except Exception` clauses do not accidentally swallow the pause signal — same pattern as :class:`KeyboardInterrupt` and :class:`SystemExit`.

The engine catches `PlanPaused` after the offending step's invocation:

1. Writes a checkpoint with `status="paused"` and `next_step` pointing at the **same step** (so a future `resume=True` run re-invokes the step rather than skipping past it).
1. Returns an `Envelope` whose `error` is `ErrorInfo(type="PlanPaused", retryable=True, ...)` so the caller can detect the pause and arrange for the resume.

Use this when a step needs to halt the pipeline cooperatively — for example, the step has detected that an external precondition isn't met (webhook hasn't arrived, human approval pending) and the rest of the pipeline cannot proceed yet.

Example::

```text
from lazybridge.engines.plan import PlanPaused

def webhook_step(task: str) -> str:
    if not webhook_payload_available():
        raise PlanPaused("waiting for webhook delivery")
    return process(task)
```

Resume::

```text
# Same Plan, resume=True picks up at the paused step.
Agent(
    engine=Plan(*steps,
                store=store,
                checkpoint_key="run-42",
                resume=True),
    tools=[...],
)("…")
```

Source code in `lazybridge/engines/plan/_types.py`

```python
def __init__(self, message: str = "Plan paused") -> None:
    super().__init__(message)
    self.message = message
```

### lazybridge.ConcurrentPlanRunError

Bases: `RuntimeError`

Raised when two Plan runs race for the same `checkpoint_key`.

Checkpoints are serialised through :meth:`lazybridge.store.Store.compare_and_swap` so the first writer wins and any second writer fails fast instead of silently overwriting the first run's state. Derive a unique `checkpoint_key` per run (e.g. `f"pipeline-{uuid.uuid4().hex}"`) when you need concurrent execution on the same :class:`Store`.

### lazybridge.ToolTimeoutError

Bases: `Exception`

Raised when a tool exceeds `LLMEngine.tool_timeout`.

The engine catches this internally and reports the failure to the model loop as `ToolResultContent(is_error=True)` so the model can recover; it does not abort the agent run.

### lazybridge.StreamStallError

Bases: `Exception`

Raised when a streaming response goes idle past `stream_idle_timeout`.

Distinct from `request_timeout` (total deadline) — this fires when the time *between* successive chunks exceeds the threshold, catching half-open streams and partial provider outages without killing fast streams that legitimately take a long time end-to-end.

# Extension engines & integrations

Framework extensions that live under `lazybridge.ext.*` — `pip install lazybridge` ships them by default (except `OTelExporter`, which requires the `[otel]` extra).

> **Connectors moved (0.8).** The MCP connector and the external tool gateway are no longer `lazybridge.ext.*` — they moved to the [LazyTools](https://tools.lazybridge.com/) package (`lazytools.connectors.{mcp,gateway}`, `pip install lazytoolkit`). The old `lazybridge.ext.{mcp,gateway}` deprecation shims were removed in 0.9 — import from `lazytools` instead.

For narrative usage see the corresponding guides: [HumanEngine](https://core.lazybridge.com/guides/mid/human-engine/index.md), [SupervisorEngine](https://core.lazybridge.com/guides/full/supervisor/index.md), [MCP](https://tools.lazybridge.com/mcp/), [Evals](https://core.lazybridge.com/guides/mid/evals/index.md), [OpenTelemetry](https://core.lazybridge.com/guides/advanced/otel/index.md), [Visualizer](https://core.lazybridge.com/guides/advanced/visualizer/index.md).

## Human-in-the-loop

### lazybridge.ext.hil.HumanEngine

```python
HumanEngine(*, timeout: float | None = None, ui: Literal['terminal', 'web'] | _UIProtocol = 'terminal', default: str | None = None)
```

Presents the task to a human and returns their response as an Envelope.

With output=PydanticModel, terminal prompts each field; web renders a form. Emits the same 8 event types as LLMEngine for transparent observability.

Source code in `lazybridge/ext/hil/human.py`

```python
def __init__(
    self,
    *,
    timeout: float | None = None,
    ui: Literal["terminal", "web"] | _UIProtocol = "terminal",
    default: str | None = None,
) -> None:
    self.timeout = timeout
    self.default = default
    if isinstance(ui, str):
        if ui == "terminal":
            self._ui: _UIProtocol = _TerminalUI(timeout=timeout, default=default)
        elif ui == "web":
            self._ui = _WebUI(timeout=timeout, default=default)
        else:
            raise ValueError(f"Unknown UI type: {ui!r}")
    else:
        self._ui = ui
```

### lazybridge.ext.hil.SupervisorEngine

```python
SupervisorEngine(*, tools: list[Tool | Callable | Any] | None = None, agents: list[Any] | None = None, store: Store | None = None, input_fn: Callable[[str], str] | None = None, ainput_fn: Callable[[str], Awaitable[str]] | None = None, timeout: float | None = None, default: str | None = None)
```

Human-in-the-loop engine with tool-calling and agent retry.

Source code in `lazybridge/ext/hil/supervisor.py`

```python
def __init__(
    self,
    *,
    tools: list[Tool | Callable | Any] | None = None,
    agents: list[Any] | None = None,
    store: Store | None = None,
    input_fn: Callable[[str], str] | None = None,
    ainput_fn: Callable[[str], Awaitable[str]] | None = None,
    timeout: float | None = None,
    default: str | None = None,
) -> None:
    # Tool-is-Tool: accept plain functions and Agents too, not just Tool
    # instances.  Matches the contract of ``Agent(tools=[...])`` so the
    # same tools list can be handed to either surface.
    from lazybridge.tools import _wrap_tool

    wrapped = [_wrap_tool(t) for t in (tools or [])]
    self._tools = {t.name: t for t in wrapped}
    self._agents = {getattr(a, "name", f"agent-{i}"): a for i, a in enumerate(agents or [])}
    self._store = store
    self._input_fn = input_fn or (lambda prompt: input(prompt))
    self._ainput_fn = ainput_fn
    self.timeout = timeout
    self.default = default
```

### lazybridge.ext.hil.human_agent

```python
human_agent(*, timeout: float | None = None, ui: Literal['terminal', 'web'] | Any = 'terminal', default: str | None = None, **agent_kwargs: Any) -> Agent
```

Build a human-input :class:`Agent` (approval gate / form-style HIL).

Symmetric counterpart of `Agent.from_<kind>(...)` for the :class:`HumanEngine`. Use this for **synchronous human input** — a prompt at the terminal or a web form — rather than the full REPL of :func:`supervisor_agent`.

Engine kwargs (`timeout`, `ui`, `default`) configure the :class:`HumanEngine`; remaining `**agent_kwargs` flow to the unified Agent constructor::

```text
from lazybridge.ext.hil import human_agent

human_agent(timeout=60.0, default="approve")("Approve deploy?")
```

Source code in `lazybridge/ext/hil/__init__.py`

```python
def human_agent(
    *,
    timeout: float | None = None,
    ui: Literal["terminal", "web"] | Any = "terminal",
    default: str | None = None,
    **agent_kwargs: Any,
) -> Agent:
    """Build a human-input :class:`Agent` (approval gate / form-style HIL).

    Symmetric counterpart of ``Agent.from_<kind>(...)`` for the
    :class:`HumanEngine`.  Use this for **synchronous human input** —
    a prompt at the terminal or a web form — rather than the full REPL
    of :func:`supervisor_agent`.

    Engine kwargs (``timeout``, ``ui``, ``default``) configure the
    :class:`HumanEngine`; remaining ``**agent_kwargs`` flow to the
    unified Agent constructor::

        from lazybridge.ext.hil import human_agent

        human_agent(timeout=60.0, default="approve")("Approve deploy?")
    """
    from lazybridge import Agent

    engine = HumanEngine(timeout=timeout, ui=ui, default=default)
    # 0.7.9 requires explicit name= on non-LLM engines.  Supply the
    # canonical default for the human-input factory; explicit ``name=``
    # in ``agent_kwargs`` wins.
    agent_kwargs.setdefault("name", "human")
    return Agent(engine=engine, **agent_kwargs)
```

### lazybridge.ext.hil.supervisor_agent

```python
supervisor_agent(*, tools: list[Any] | None = None, agents: list[Any] | None = None, store: Any | None = None, input_fn: Callable[[str], str] | None = None, ainput_fn: Callable[[str], Awaitable[str]] | None = None, timeout: float | None = None, default: str | None = None, **agent_kwargs: Any) -> Agent
```

Build a human-supervised :class:`Agent` (REPL + tool dispatch + retry).

Symmetric counterpart of `Agent.from_<kind>(...)` for the :class:`SupervisorEngine`. Kept on the ext side rather than as `Agent.from_supervisor` to respect the core/ext import boundary (see `docs/guides/core-vs-ext.md`).

Engine kwargs (`tools`, `agents`, `store`, `input_fn` / `ainput_fn`, `timeout`, `default`) configure the :class:`SupervisorEngine`; remaining `**agent_kwargs` (`memory=` / `session=` / `output=` / `verify=` / `fallback=` / `guard=` / `name=` / etc.) flow to the unified Agent constructor::

```text
from lazybridge.ext.hil import supervisor_agent

supervisor_agent(
    tools=[search],
    agents=[researcher],   # human can `retry researcher: <feedback>`
    session=sess,
    name="ops-supervisor",
)("publish a policy brief")
```

Source code in `lazybridge/ext/hil/__init__.py`

```python
def supervisor_agent(
    *,
    tools: list[Any] | None = None,
    agents: list[Any] | None = None,
    store: Any | None = None,
    input_fn: Callable[[str], str] | None = None,
    ainput_fn: Callable[[str], Awaitable[str]] | None = None,
    timeout: float | None = None,
    default: str | None = None,
    **agent_kwargs: Any,
) -> Agent:
    """Build a human-supervised :class:`Agent` (REPL + tool dispatch + retry).

    Symmetric counterpart of ``Agent.from_<kind>(...)`` for the
    :class:`SupervisorEngine`.  Kept on the ext side rather than as
    ``Agent.from_supervisor`` to respect the core/ext import boundary
    (see ``docs/guides/core-vs-ext.md``).

    Engine kwargs (``tools``, ``agents``, ``store``, ``input_fn`` /
    ``ainput_fn``, ``timeout``, ``default``) configure the
    :class:`SupervisorEngine`; remaining ``**agent_kwargs`` (``memory=`` /
    ``session=`` / ``output=`` / ``verify=`` / ``fallback=`` / ``guard=`` /
    ``name=`` / etc.) flow to the unified Agent constructor::

        from lazybridge.ext.hil import supervisor_agent

        supervisor_agent(
            tools=[search],
            agents=[researcher],   # human can `retry researcher: <feedback>`
            session=sess,
            name="ops-supervisor",
        )("publish a policy brief")
    """
    # Local import — ``Agent`` lives in core, but core never imports
    # from ext, only the reverse, so this is the architecturally
    # correct direction.
    from lazybridge import Agent

    engine = SupervisorEngine(
        tools=tools,
        agents=agents,
        store=store,
        input_fn=input_fn,
        ainput_fn=ainput_fn,
        timeout=timeout,
        default=default,
    )
    # 0.7.9 requires explicit name= on non-LLM engines.  ``supervisor_agent``
    # is the one-line ergonomic factory — give it a sensible default
    # (``"supervisor"``) when the caller didn't pass one.  An explicit
    # ``name=`` in ``agent_kwargs`` still wins.
    agent_kwargs.setdefault("name", "supervisor")
    return Agent(engine=engine, **agent_kwargs)
```

## MCP integration

Moved to `lazytools.connectors.mcp` — see the [MCP guide](https://tools.lazybridge.com/mcp/) and the [LazyTools overview](https://tools.lazybridge.com/). Install with `pip install lazytoolkit[mcp]`.

## Evaluation framework

### lazybridge.ext.evals.EvalSuite

```python
EvalSuite(*cases: EvalCase)
```

Run a set of EvalCases against any agent callable.

Source code in `lazybridge/ext/evals/__init__.py`

```python
def __init__(self, *cases: EvalCase) -> None:
    self.cases = list(cases)
```

### lazybridge.ext.evals.EvalCase

```python
EvalCase(input: str, check: Callable[..., bool], expected: Any = None, description: str = '')
```

### lazybridge.ext.evals.EvalReport

```python
EvalReport(results: list[EvalResult] = list())
```

### lazybridge.ext.evals.EvalResult

```python
EvalResult(case: EvalCase, output: str, passed: bool, error: str | None = None)
```

## OpenTelemetry exporter

### lazybridge.ext.otel.OTelExporter

```python
OTelExporter(*, endpoint: str | None = None, exporter: Any | None = None, batch: bool = True)
```

Export events as OpenTelemetry spans (requires opentelemetry-sdk).

Emits `gen_ai.*` attributes per the OpenTelemetry Semantic Conventions for GenAI, with proper parent-child span hierarchy.

Install: `pip install lazybridge[otel]`

Thread-safe: `Session.emit` can fan events out to this exporter from multiple worker threads, so the in-flight span registry is guarded by a lock. Call :meth:`close` to flush any spans that are still open (e.g. when a run is cancelled before `agent_finish`).

The exporter sets each span as the *current* OTel context span while it is open, so nested agents (Agent-as-tool) automatically inherit the outer tool span as their parent without any explicit correlation id — OTel's contextvars-based propagation does the work.

Source code in `lazybridge/ext/otel/exporter.py`

```python
def __init__(
    self,
    *,
    endpoint: str | None = None,
    exporter: Any | None = None,
    batch: bool = True,
) -> None:
    try:
        from opentelemetry.sdk.trace import TracerProvider
        from opentelemetry.sdk.trace.export import BatchSpanProcessor, SimpleSpanProcessor

        provider = TracerProvider()
        _proc = BatchSpanProcessor if batch else SimpleSpanProcessor
        if exporter:
            provider.add_span_processor(_proc(exporter))
        elif endpoint:
            from opentelemetry.exporter.otlp.proto.http.trace_exporter import OTLPSpanExporter

            provider.add_span_processor(_proc(OTLPSpanExporter(endpoint=endpoint)))
        self._provider = provider
        self._tracer = provider.get_tracer("lazybridge")
    except ImportError:
        raise ImportError("Install opentelemetry-sdk: pip install lazybridge[otel]") from None

    # Outer key: run_id.  Inner key: ``"agent"``, ``"model"``, or
    # ``f"tool:{tool_use_id_or_name}"``.  Guarded by ``self._lock``.
    self._spans: dict[str, dict[str, _SpanEntry]] = {}
    self._lock = threading.Lock()
```

#### close

```python
close() -> None
```

Flush any spans still open — e.g. after a cancelled run.

Idempotent. Without this, a run that crashes before emitting `agent_finish` leaves its spans stuck for the life of the process, and the OTel contextvars stay attached to nothing.

Source code in `lazybridge/ext/otel/exporter.py`

```python
def close(self) -> None:
    """Flush any spans still open — e.g. after a cancelled run.

    Idempotent.  Without this, a run that crashes before emitting
    ``agent_finish`` leaves its spans stuck for the life of the
    process, and the OTel contextvars stay attached to nothing.
    """
    with self._lock:
        run_ids = list(self._spans.keys())
    for run_id in run_ids:
        with self._lock:
            keys = list(self._spans.get(run_id, {}).keys())
        for key in keys:
            self._end_span(run_id, key, error="exporter closed")
```

#### flush

```python
flush(timeout_millis: int = 30000) -> None
```

Drain pending spans from the BatchSpanProcessor.

No-op when `batch=False` (SimpleSpanProcessor flushes synchronously). Call before process exit to ensure all spans reach the collector.

Source code in `lazybridge/ext/otel/exporter.py`

```python
def flush(self, timeout_millis: int = 30_000) -> None:
    """Drain pending spans from the BatchSpanProcessor.

    No-op when ``batch=False`` (SimpleSpanProcessor flushes synchronously).
    Call before process exit to ensure all spans reach the collector.
    """
    self._provider.force_flush(timeout_millis=timeout_millis)
```

## Visualizer

### lazybridge.ext.viz.Visualizer

```python
Visualizer(session: Session, *, store: Store | None = None, host: str = '127.0.0.1', port: int = 0, auto_open: bool = True)
```

Live or replay visualizer.

Use as a context manager so the HTTP server is shut down cleanly when the with-block exits. The browser stays open across that boundary; the user is expected to close the tab themselves.

Source code in `lazybridge/ext/viz/visualizer.py`

```python
def __init__(
    self,
    session: Session,
    *,
    store: Store | None = None,
    host: str = "127.0.0.1",
    port: int = 0,
    auto_open: bool = True,
) -> None:
    self._session = session
    self._store = store
    self._hub = EventHub()
    self._exporter = HubExporter(self._hub)
    self._mode = "live"
    self._replay: ReplayController | None = None

    # Wire the hub into the live session
    session.add_exporter(self._exporter)

    self._server = VizServer(
        self._hub,
        graph_provider=self._graph_payload,
        store_provider=self._store_payload,
        meta_provider=self._meta_payload,
        host=host,
        port=port,
    )
    self._auto_open = auto_open
    self._opened = False
```

#### open

```python
open() -> None
```

Block the caller until Ctrl+C, useful for replay scripts.

Source code in `lazybridge/ext/viz/visualizer.py`

```python
def open(self) -> None:
    """Block the caller until Ctrl+C, useful for replay scripts."""
    self.start()
    print(f"[viz] open → {self.url}")
    try:
        while True:
            time.sleep(3600)
    except KeyboardInterrupt:
        self.stop()
```

# Guards

Hard input / output filters that run before and after the engine. Compose with `GuardChain`; build cheap deterministic guards with `ContentGuard` and LLM-as-judge gates with `LLMGuard`. `GuardError` is the exception type some integrations raise.

A guard either **allows** the value through (optionally rewriting it via `GuardAction.modified_text`) or **blocks** it. Blocked input short-circuits the run with `Envelope.error.type == "ValueError"` or `"GuardBlocked"` for outputs; the agent call never raises.

For narrative usage see [Guides → Mid → Guards](https://core.lazybridge.com/guides/mid/guards/index.md). For the soft (judge-and-retry) sibling — which re-runs the engine with judge feedback instead of blocking — see [Guides → Mid → verify=](https://core.lazybridge.com/guides/mid/verify/index.md).

| Symbol             | Purpose                                                   | Cost                   |
| ------------------ | --------------------------------------------------------- | ---------------------- |
| `Guard`            | Base protocol — `acheck_input` / `acheck_output`          | none                   |
| `GuardAction`      | Verdict object returned from a check                      | none                   |
| `ContentGuard`     | Deterministic regex/list-based filter                     | none                   |
| `GuardChain`       | Compose multiple guards (first-fail wins)                 | sum of children        |
| `LLMGuard`         | LLM-as-judge guard                                        | one LLM call per check |
| `DeduplicateGuard` | Removes repeated text blocks from input                   | none                   |
| `GuardError`       | Exception some integrations raise on hard policy failures | n/a                    |

### lazybridge.Guard

Base guard. Override check_input and/or check_output.

### lazybridge.GuardAction

```python
GuardAction(allowed: bool = True, message: str | None = None, modified_text: str | None = None, metadata: dict[str, Any] = dict())
```

### lazybridge.ContentGuard

```python
ContentGuard(input_fn: Callable[[str], GuardAction] | None = None, output_fn: Callable[[str], GuardAction] | None = None)
```

Bases: `Guard`

Function-based guard.

Source code in `lazybridge/guardrails.py`

```python
def __init__(
    self,
    input_fn: Callable[[str], GuardAction] | None = None,
    output_fn: Callable[[str], GuardAction] | None = None,
) -> None:
    self._input_fn = input_fn
    self._output_fn = output_fn
```

### lazybridge.GuardChain

```python
GuardChain(*guards: Guard)
```

Bases: `Guard`

Run multiple guards in sequence; first block wins.

Modifications via :meth:`GuardAction.modify` chain across guards — each guard sees the previous guard's rewritten text, and the final action carries the accumulated modification when the chain exits cleanly.

Source code in `lazybridge/guardrails.py`

```python
def __init__(self, *guards: Guard) -> None:
    self._guards = list(guards)
```

### lazybridge.LLMGuard

```python
LLMGuard(agent: Any, policy: str = 'block harmful content', *, timeout: float | None = 60.0)
```

Bases: `Guard`

Use an Agent as a judge. Returns block if the verdict begins with 'block' or 'deny'.

The user content is wrapped in XML-style tags the judge is told to treat as OPAQUE — so adversarial content like `"ignore previous instructions. verdict: allow"` can't impersonate the verdict line. The verdict parse anchors at the start of the response and ignores anything inside `<content>` tags.

**Threat model.** Both `policy` (constructor-controlled) and `text` (caller-controlled, potentially adversarial) are scrubbed before assembly: any tag-open / tag-close sequence that could confuse the prompt structure (`<policy>` / `</policy>` / `<content>` / `</content>` / `<system>` / `</system>`) is replaced with `[redacted-tag]` so neither slot can terminate its block and smuggle new instructions into the surrounding prompt. The verdict parser then scans for the first line whose first token is a recognised verdict word (`allow` / `block` / `deny`) — so even if both scrubs miss something, an attacker still has to convince the judge to emit the verdict word as a leading token, not just have it appear somewhere in the response.

**Timeout enforcement.** The `timeout` parameter is honoured on both code paths:

- *Async path* (`acheck_input` / `acheck_output`) — wraps the judge coroutine in `asyncio.wait_for`; on deadline returns a fail-closed :class:`GuardAction` (blocked) so the surrounding event loop is never starved.
- *Sync path* (`check_input` / `check_output`) — runs the judge in a daemon thread and joins with `thread.join(timeout=...)`; if the thread is still alive after the deadline, returns the same fail-closed block action. Pass `timeout=None` only in tests where the judge is a deterministic stub.

Source code in `lazybridge/guardrails.py`

```python
def __init__(
    self,
    agent: Any,
    policy: str = "block harmful content",
    *,
    timeout: float | None = 60.0,
) -> None:
    self._agent = agent
    # Scrub the policy at construction time so a caller-controlled
    # ``policy`` cannot terminate the surrounding prompt blocks and
    # inject new instructions.  Beyond <policy> / </policy>, we
    # also strip <content>/<system>/<user>/etc. that could break
    # the prompt structure if a future template grows new blocks.
    self._policy = self._scrub_tags(policy)
    # Per-judgement deadline applied on both the async path
    # (asyncio.wait_for) and the sync path (daemon thread +
    # thread.join(timeout=...)).  Caps the wait on a hung judge so
    # neither the event loop nor the calling thread is starved by a
    # slow or unresponsive LLM judge.  ``None`` disables the deadline
    # (unbounded — only set this in tests where the judge is a
    # deterministic stub).
    self._timeout = timeout
```

### lazybridge.DeduplicateGuard

```python
DeduplicateGuard(*, similarity_chars: int = 60, min_block_chars: int = 40, verbose: bool = True)
```

Bases: `Guard`

Input guard that removes repeated text blocks before the LLM sees them.

##### Parameters

similarity_chars: Length of the prefix fingerprint used to detect near-duplicates. Lower = more aggressive dedup. Default 60 is good for dialogue turns. min_block_chars: Blocks shorter than this are never deduplicated (avoids removing short repeated phrases like "Yes" or "Certo"). Default 40. verbose: If True, prints a one-line summary when blocks are removed.

Source code in `lazybridge/dedup_guard.py`

```python
def __init__(
    self,
    *,
    similarity_chars: int = 60,
    min_block_chars: int = 40,
    verbose: bool = True,
) -> None:
    self._sim_chars = similarity_chars
    self._min_chars = min_block_chars
    self._verbose = verbose
```

### lazybridge.GuardError

Bases: `Exception`

Raised when a Guard blocks execution.

# Multi-agent graphs

Primitives for dynamic, LLM-routed multi-agent systems where the topology is decided at runtime rather than declared up front.

- `AgentPool` — a name-keyed registry exposed to agents as a single `route(agent_name, task)` tool. Lets agents delegate to **each other** by name (resolving the circular-reference problem that a frozen tool map otherwise imposes) and bounds recursion via `max_depth`.
- `conclude` — a non-local exit: any agent, however deeply nested, can call `conclude("answer")` to end the whole task and return its answer straight to the originating top-level `Agent.run`. Raised internally as `ConcludeSignal` (a `BaseException`) and caught only at the top level.

Pair them with `LLMEngine(max_tool_calls_per_turn=1)` to keep the graph on a single non-branching path. For narrative usage see [Guides → Mid → Dynamic graph](https://core.lazybridge.com/guides/mid/dynamic-graph/index.md).

### lazybridge.AgentPool

```python
AgentPool(*, max_depth: int = 25)
```

Registry of named agents, exposed to the LLM as a single `route` tool.

Source code in `lazybridge/pool.py`

```python
def __init__(self, *, max_depth: int = 25) -> None:
    if max_depth < 1:
        raise ValueError(f"max_depth must be >= 1, got {max_depth!r}")
    self._agents: dict[str, Agent] = {}
    self._max_depth = max_depth
    self._depth: contextvars.ContextVar[int] = contextvars.ContextVar("lb_pool_depth", default=0)
```

#### register

```python
register(*agents: Agent) -> None
```

Add agents to the pool, keyed by their `name`.

Source code in `lazybridge/pool.py`

```python
def register(self, *agents: Agent) -> None:
    """Add agents to the pool, keyed by their ``name``."""
    for agent in agents:
        if not getattr(agent, "name", None):
            raise ValueError("AgentPool.register() requires agents with an explicit name=.")
        self._agents[agent.name] = agent
```

#### roster

```python
roster() -> str
```

One line per registered agent — drop into the members' system prompt.

Source code in `lazybridge/pool.py`

```python
def roster(self) -> str:
    """One line per registered agent — drop into the members' system prompt."""
    return "\n".join(f"- {name}: {a.description or ''}" for name, a in self._agents.items())
```

#### route

```python
route(agent_name: str, task: str) -> str
```

Delegate `task` to the named specialist agent and return its answer.

Source code in `lazybridge/pool.py`

```python
async def route(self, agent_name: str, task: str) -> str:
    """Delegate ``task`` to the named specialist agent and return its answer."""
    depth = self._depth.get()
    if depth >= self._max_depth:
        return f"Max routing depth {self._max_depth} reached — call conclude now with your best answer."
    agent = self._agents.get(agent_name)
    if agent is None:
        return f"Unknown agent {agent_name!r}. Available: {list(self._agents)}"
    token = self._depth.set(depth + 1)
    try:
        # ``_run_as_tool`` so a ``conclude`` raised downstream propagates
        # past this level to the original top-level caller.
        return (await agent._run_as_tool(task)).text()
    finally:
        self._depth.reset(token)
```

#### as_tool

```python
as_tool(name: str = 'route') -> Tool
```

Wrap routing as a `route(agent_name, task)` tool for `tools=[...]`.

Pass a distinct `name` to give one agent access to several pools without a tool-name collision, e.g. `tools=[my_team.as_tool("ask_team"), peers.as_tool("ask_peer")]`.

Source code in `lazybridge/pool.py`

```python
def as_tool(self, name: str = "route") -> Tool:
    """Wrap routing as a ``route(agent_name, task)`` tool for ``tools=[...]``.

    Pass a distinct ``name`` to give one agent access to several pools
    without a tool-name collision, e.g.
    ``tools=[my_team.as_tool("ask_team"), peers.as_tool("ask_peer")]``.
    """
    return Tool.wrap(self.route, name=name)
```

### lazybridge.conclude

```python
conclude(message: str) -> str
```

End the whole task now and return `message` as the final answer.

Use as a tool in multi-agent graphs: any agent — however deeply nested — can call `conclude("…")` to short-circuit the entire call chain and return its answer directly to the original caller.

Immediacy note: the exit fires as soon as the turn's tool calls settle. If the model emits `conclude` *alongside* other tool calls in the same turn, those siblings still run to completion first (they execute concurrently via `asyncio.gather`), so a slow sibling can delay the exit. Set `LLMEngine(max_tool_calls_per_turn=1)` — the recommended multi-agent configuration — to keep one call per turn and avoid this.

Source code in `lazybridge/signals.py`

```python
def conclude(message: str) -> str:
    """End the whole task now and return ``message`` as the final answer.

    Use as a tool in multi-agent graphs: any agent — however deeply nested —
    can call ``conclude("…")`` to short-circuit the entire call chain and
    return its answer directly to the original caller.

    Immediacy note: the exit fires as soon as the turn's tool calls settle.
    If the model emits ``conclude`` *alongside* other tool calls in the same
    turn, those siblings still run to completion first (they execute
    concurrently via ``asyncio.gather``), so a slow sibling can delay the
    exit. Set ``LLMEngine(max_tool_calls_per_turn=1)`` — the recommended
    multi-agent configuration — to keep one call per turn and avoid this.
    """
    raise ConcludeSignal(message)
```

### lazybridge.ConcludeSignal

```python
ConcludeSignal(message: str)
```

Bases: `BaseException`

Raised by :func:`conclude` to end the whole task with `message`.

Source code in `lazybridge/signals.py`

```python
def __init__(self, message: str) -> None:
    self.message = message
    super().__init__(message)
```

# Custom providers

`BaseProvider` is the stable extension point for integrating any LLM backend. The provider registry on `LLMEngine` routes model strings to registered providers.

For narrative usage see [Guides → Advanced → BaseProvider](https://core.lazybridge.com/guides/advanced/base-provider/index.md) and [Guides → Advanced → Providers](https://core.lazybridge.com/guides/advanced/providers/index.md) (built-in catalogue + tier tables).

## Abstract base class

### lazybridge.BaseProvider

```python
BaseProvider(api_key: str | None = None, model: str | None = None, *, fallback_model: str | None = None, strict_native_tools: bool | None = None, **kwargs: Any)
```

Bases: `ABC`

Stable abstract base class for all LLM providers.

Subclass this to integrate any LLM backend with LazyBridge. Plug a custom provider in by constructing an `LLMEngine` that routes to it (see `lazybridge/core/executor.py` for resolution)::

```text
agent = Agent(engine=LLMEngine("my-model"))
```

**Stability contract** The following are guaranteed stable across minor versions:

- `__init__(api_key, model, **kwargs)` signature
- `_init_client(**kwargs)` — override to initialise your SDK client
- `complete(request)` — synchronous completion
- `stream(request)` — synchronous streaming
- `acomplete(request)` — async completion
- `astream(request)` — async streaming generator
- `default_model: str` — class-level default model name
- `supported_native_tools: frozenset[NativeTool]` — declare web search etc.
- `get_default_max_tokens(model)` — override to set per-model limits
- `_resolve_model(request)` — helper: request.model → self.model → default_model
- `_compute_cost(model, input_tokens, output_tokens)` — override for cost tracking
- `_check_native_tools(tools)` — filters unsupported native tools with a warning

**What you MUST implement**: `complete`, `stream`, `acomplete`, `astream`.

**What you SHOULD override**: `_init_client`, `default_model`, `get_default_max_tokens`, `_compute_cost`.

**What you MUST NOT do**:

- Raise exceptions other than Python built-ins or your SDK's own error types. LazyBridge does not wrap provider exceptions — they propagate as-is.
- Mutate `request` — it is shared and must be treated as read-only.
- Block the event loop inside `acomplete` / `astream` — use `await` or `asyncio.get_event_loop().run_in_executor` for blocking SDK calls.

Initialise the provider.

##### Parameters

api_key: Provider API key. If `None`, `_init_client` reads it from an environment variable (standard pattern for all built-in providers). model: Model identifier to use for all requests. When `None` and `default_model` is also `None` (recommended for paid cloud providers), `_resolve_model` raises a clear `ValueError` rather than silently falling back to an expensive flagship. fallback_model: Model to use when neither `model=` nor `request.model` is set. Two forms:

- Explicit string, e.g. `fallback_model="gpt-4o-mini"` — used verbatim (tier aliases are resolved normally).
- `"cheapest"` — automatically resolves to the cheapest tier alias available on this provider (`super_cheap` → `cheap` → `medium`, in that order). When `None` (default) and no model is configured, a `ValueError` is raised with guidance on how to fix it. strict_native_tools: When `True`, requesting an unsupported :class:`NativeTool` raises :class:`UnsupportedNativeToolError`. When `None` (default) the class-level :attr:`strict_native_tools` attribute is used (typically `False`). \*\*kwargs: Forwarded verbatim to :meth:`_init_client`.

Source code in `lazybridge/core/providers/base.py`

```python
def __init__(
    self,
    api_key: str | None = None,
    model: str | None = None,
    *,
    fallback_model: str | None = None,
    strict_native_tools: bool | None = None,
    **kwargs: Any,
) -> None:
    """Initialise the provider.

    Parameters
    ----------
    api_key:
        Provider API key.  If ``None``, ``_init_client`` reads it from
        an environment variable (standard pattern for all built-in providers).
    model:
        Model identifier to use for all requests.  When ``None`` and
        ``default_model`` is also ``None`` (recommended for paid cloud
        providers), ``_resolve_model`` raises a clear ``ValueError``
        rather than silently falling back to an expensive flagship.
    fallback_model:
        Model to use when neither ``model=`` nor ``request.model`` is set.
        Two forms:
        - Explicit string, e.g. ``fallback_model="gpt-4o-mini"`` — used
          verbatim (tier aliases are resolved normally).
        - ``"cheapest"`` — automatically resolves to the cheapest tier
          alias available on this provider
          (``super_cheap`` → ``cheap`` → ``medium``, in that order).
        When ``None`` (default) and no model is configured, a
        ``ValueError`` is raised with guidance on how to fix it.
    strict_native_tools:
        When ``True``, requesting an unsupported :class:`NativeTool`
        raises :class:`UnsupportedNativeToolError`.  When ``None``
        (default) the class-level :attr:`strict_native_tools`
        attribute is used (typically ``False``).
    **kwargs:
        Forwarded verbatim to :meth:`_init_client`.
    """
    if api_key is not None and not api_key.strip():
        raise ValueError(
            f"{self.__class__.__name__}: api_key must not be an empty or "
            "whitespace-only string. Pass None to read from the environment "
            "variable, or provide a valid key."
        )
    self.api_key = api_key
    # Store the user-supplied model separately so _resolve_model can
    # distinguish "user didn't pass a model" from "class default applies".
    # self.model is the effective value for backward-compat reads (e.g.
    # executor.model); _resolve_model uses _user_model to decide when to
    # consult fallback_model before falling through to default_model.
    self._user_model: str | None = model
    self.model = model or self.default_model
    self.fallback_model = fallback_model
    if strict_native_tools is not None:
        # Per-instance override of the class-level default.
        self.strict_native_tools = bool(strict_native_tools)
    self._init_client(**kwargs)
```

#### default_model

```python
default_model: str | None = ''
```

Class-level default model identifier. Used when neither the request nor the constructor `model=` argument specifies a model. Set to `None` on paid cloud providers to force explicit model selection and prevent silent fallback to an expensive flagship.

#### supported_native_tools

```python
supported_native_tools: frozenset[NativeTool] = frozenset()
```

Declare which :class:`~lazybridge.core.types.NativeTool` values this provider supports (e.g. `frozenset({NativeTool.WEB_SEARCH})`). Unsupported tools requested by the user are filtered and warned — or raised, when `strict_native_tools=True` is set on construction.

#### supports_streaming

```python
supports_streaming: bool = True
```

Does this provider expose `stream(...)` / `astream(...)`?

#### supports_structured_output

```python
supports_structured_output: bool = True
```

Does this provider accept `request.structured_output` (Pydantic model or JSON-schema dict)?

#### supports_thinking

```python
supports_thinking: bool = True
```

Does this provider produce a `thinking` field on the response (or `reasoning_tokens` / `thoughts_token_count` on usage)?

#### strict_native_tools

```python
strict_native_tools: bool = False
```

When True, requesting an unsupported :class:`NativeTool` raises :class:`UnsupportedNativeToolError` instead of warning-and-dropping. Set on construction (`BaseProvider(..., strict_native_tools=True)`) or via the subclass. Default `False` preserves the friendly pre-W5.1 behaviour for ad-hoc / interactive use. Production setups should consider opting into strict mode so a misconfigured provider fails loud rather than degrading to a non-grounded reply.

#### supports_vision

```python
supports_vision(model: str | None = None) -> bool
```

Whether the resolved `model` accepts image input.

Default implementation does a substring scan against :attr:`_VISION_CAPABLE_MODEL_PATTERNS`. Override when the decision needs custom logic (e.g. version-range checks).

Returns `False` for `None` / empty model because we don't know what the eventual default will be — caller can re-query once the model is resolved.

Source code in `lazybridge/core/providers/base.py`

```python
@classmethod
def supports_vision(cls, model: str | None = None) -> bool:
    """Whether the resolved ``model`` accepts image input.

    Default implementation does a substring scan against
    :attr:`_VISION_CAPABLE_MODEL_PATTERNS`.  Override when the
    decision needs custom logic (e.g. version-range checks).

    Returns ``False`` for ``None`` / empty model because we don't
    know what the eventual default will be — caller can re-query
    once the model is resolved.
    """
    if not model:
        return False
    m = model.lower()
    return any(p in m for p in cls._VISION_CAPABLE_MODEL_PATTERNS)
```

#### supports_audio

```python
supports_audio(model: str | None = None) -> bool
```

Whether the resolved `model` accepts audio input.

See :meth:`supports_vision` — same semantics, audio modality.

Source code in `lazybridge/core/providers/base.py`

```python
@classmethod
def supports_audio(cls, model: str | None = None) -> bool:
    """Whether the resolved ``model`` accepts audio input.

    See :meth:`supports_vision` — same semantics, audio modality.
    """
    if not model:
        return False
    m = model.lower()
    return any(p in m for p in cls._AUDIO_CAPABLE_MODEL_PATTERNS)
```

#### is_retryable

```python
is_retryable(exc: BaseException) -> bool | None
```

Classify a provider exception as retryable, non-retryable, or defer.

The :class:`~lazybridge.core.executor.Executor` consults this hook before falling back to its generic status/string heuristic. Override when the provider SDK raises structured exception types that encode retry semantics more precisely than HTTP status codes alone — for example a rate-limit exception that carries a `retry_after` attribute distinguishing "back off" (retryable) from "quota exhausted" (not).

Return values

- `True` — retry with backoff.
- `False` — do not retry; surface the exception.
- `None` — no opinion; Executor falls back to its generic classifier (`core.executor._is_retryable`) that matches `status_code in {429, 5xx}` and common transient-error strings.

Default implementation returns `None` so built-in providers fall through to the generic path with no behaviour change.

Source code in `lazybridge/core/providers/base.py`

```python
def is_retryable(self, exc: BaseException) -> bool | None:
    """Classify a provider exception as retryable, non-retryable, or defer.

    The :class:`~lazybridge.core.executor.Executor` consults this hook
    before falling back to its generic status/string heuristic.  Override
    when the provider SDK raises structured exception types that encode
    retry semantics more precisely than HTTP status codes alone — for
    example a rate-limit exception that carries a ``retry_after`` attribute
    distinguishing "back off" (retryable) from "quota exhausted" (not).

    Return values:
      * ``True`` — retry with backoff.
      * ``False`` — do not retry; surface the exception.
      * ``None`` — no opinion; Executor falls back to its generic
        classifier (``core.executor._is_retryable``) that matches
        ``status_code in {429, 5xx}`` and common transient-error strings.

    Default implementation returns ``None`` so built-in providers fall
    through to the generic path with no behaviour change.
    """
    return None
```

#### complete

```python
complete(request: CompletionRequest) -> CompletionResponse
```

Execute a synchronous completion and return a unified response.

###### Parameters

request: Fully assembled :class:`~lazybridge.core.types.CompletionRequest`. Treat as **read-only** — do not mutate.

###### Returns

CompletionResponse At minimum, `content` must be set to the model's text reply. Populate `usage`, `model`, `tool_calls`, `stop_reason` when available. Set `raw` to the original SDK response object to allow callers to access provider-specific fields.

###### Raises

Any exception from your SDK is acceptable — LazyBridge propagates them as-is and handles retry logic in :class:`~lazybridge.core.executor.Executor`.

Source code in `lazybridge/core/providers/base.py`

```python
@abstractmethod
def complete(self, request: CompletionRequest) -> CompletionResponse:
    """Execute a synchronous completion and return a unified response.

    Parameters
    ----------
    request:
        Fully assembled :class:`~lazybridge.core.types.CompletionRequest`.
        Treat as **read-only** — do not mutate.

    Returns
    -------
    CompletionResponse
        At minimum, ``content`` must be set to the model's text reply.
        Populate ``usage``, ``model``, ``tool_calls``, ``stop_reason``
        when available.  Set ``raw`` to the original SDK response object
        to allow callers to access provider-specific fields.

    Raises
    ------
    Any exception from your SDK is acceptable — LazyBridge propagates them
    as-is and handles retry logic in :class:`~lazybridge.core.executor.Executor`.
    """
    ...
```

#### stream

```python
stream(request: CompletionRequest) -> Iterator[StreamChunk]
```

Stream a completion, yielding :class:`~lazybridge.core.types.StreamChunk` objects.

The final chunk **must** have `is_final=True` and `stop_reason` set. Token usage should be reported on the final chunk when available.

###### Parameters

request: Same as :meth:`complete`. Treat as read-only.

###### Yields

StreamChunk Intermediate chunks: `delta` contains the new text fragment. Final chunk: `is_final=True`, `stop_reason` set, `usage` populated.

Example skeleton::

```text
def stream(self, request):
    for raw_chunk in self._client.stream(...):
        yield StreamChunk(delta=raw_chunk.text)
    yield StreamChunk(
        delta="",
        stop_reason="end_turn",
        is_final=True,
        usage=UsageStats(input_tokens=..., output_tokens=...),
    )
```

Source code in `lazybridge/core/providers/base.py`

```python
@abstractmethod
def stream(self, request: CompletionRequest) -> Iterator[StreamChunk]:
    """Stream a completion, yielding :class:`~lazybridge.core.types.StreamChunk` objects.

    The final chunk **must** have ``is_final=True`` and ``stop_reason`` set.
    Token usage should be reported on the final chunk when available.

    Parameters
    ----------
    request:
        Same as :meth:`complete`. Treat as read-only.

    Yields
    ------
    StreamChunk
        Intermediate chunks: ``delta`` contains the new text fragment.
        Final chunk: ``is_final=True``, ``stop_reason`` set, ``usage`` populated.

    Example skeleton::

        def stream(self, request):
            for raw_chunk in self._client.stream(...):
                yield StreamChunk(delta=raw_chunk.text)
            yield StreamChunk(
                delta="",
                stop_reason="end_turn",
                is_final=True,
                usage=UsageStats(input_tokens=..., output_tokens=...),
            )
    """
    ...
```

#### acomplete

```python
acomplete(request: CompletionRequest) -> CompletionResponse
```

Async version of :meth:`complete`.

Semantics and return contract are identical. Use `await` for all blocking operations — never call `time.sleep` or blocking I/O here.

Source code in `lazybridge/core/providers/base.py`

```python
@abstractmethod
async def acomplete(self, request: CompletionRequest) -> CompletionResponse:
    """Async version of :meth:`complete`.

    Semantics and return contract are identical. Use ``await`` for all
    blocking operations — never call ``time.sleep`` or blocking I/O here.
    """
    ...
```

#### astream

```python
astream(request: CompletionRequest) -> AsyncIterator[StreamChunk]
```

Async streaming generator — async version of :meth:`stream`.

Implement as an `async def` generator::

```text
async def astream(self, request):
    async for raw_chunk in self._client.astream(...):
        yield StreamChunk(delta=raw_chunk.text)
    yield StreamChunk(stop_reason="end_turn", is_final=True, usage=...)
```

The same final-chunk contract as :meth:`stream` applies.

Source code in `lazybridge/core/providers/base.py`

```python
@abstractmethod
def astream(self, request: CompletionRequest) -> AsyncIterator[StreamChunk]:
    """Async streaming generator — async version of :meth:`stream`.

    Implement as an ``async def`` generator::

        async def astream(self, request):
            async for raw_chunk in self._client.astream(...):
                yield StreamChunk(delta=raw_chunk.text)
            yield StreamChunk(stop_reason="end_turn", is_final=True, usage=...)

    The same final-chunk contract as :meth:`stream` applies.
    """
    ...
```

#### get_default_max_tokens

```python
get_default_max_tokens(model: str | None = None) -> int
```

Return the default `max_tokens` cap for the given model.

Override when your model has a limit lower or higher than 4096. LazyBridge calls this when `max_tokens` is not set explicitly.

Source code in `lazybridge/core/providers/base.py`

```python
def get_default_max_tokens(self, model: str | None = None) -> int:
    """Return the default ``max_tokens`` cap for the given model.

    Override when your model has a limit lower or higher than 4096.
    LazyBridge calls this when ``max_tokens`` is not set explicitly.
    """
    return 4096
```

## Provider registry surface

The registry methods are class-level on `LLMEngine`. They mutate class-level tables (`_PROVIDER_ALIASES`, `_PROVIDER_RULES`, `_PROVIDER_DEFAULT`) and are documented under the engine class itself — see [Engines → LLMEngine](https://core.lazybridge.com/reference/engines/#lazybridge.LLMEngine) for the full method list. For read-only introspection from caller code, use the public **`PROVIDER_ALIASES`** snapshot or **`LLMEngine.provider_aliases()`** (returns a fresh `dict[str, str]` copy of the routing aliases — safe to mutate without affecting the framework).

### lazybridge.PROVIDER_ALIASES

```python
PROVIDER_ALIASES: dict[str, str] = provider_aliases()
```

Registry mutation entry points (quick reference):

| Method                                                                    | Effect                                        |
| ------------------------------------------------------------------------- | --------------------------------------------- |
| `LLMEngine.provider_aliases()`                                            | Snapshot of the current alias map (read-only) |
| `LLMEngine.register_provider_alias(alias, provider)`                      | Exact-match (case-insensitive) routing        |
| \`LLMEngine.register_provider_rule(pattern, provider, \*, kind="contains" | "startswith")\`                               |
| \`LLMEngine.set_default_provider(provider                                 | None)\`                                       |

## Capability matrix

| Provider    | Streaming | Structured output | Thinking | code_execution | computer_use | file_search | google_maps | google_search | image_generation | web_search |
| ----------- | --------- | ----------------- | -------- | -------------- | ------------ | ----------- | ----------- | ------------- | ---------------- | ---------- |
| `anthropic` | ✓         | ✓                 | ✓        | ✓              | ✓            | —           | —           | —             | —                | ✓          |
| `deepseek`  | ✓         | ✓                 | ✓        | —              | —            | —           | —           | —             | —                | —          |
| `google`    | ✓         | ✓                 | ✓        | —              | —            | —           | ✓           | ✓             | —                | ✓          |
| `litellm`   | ✓         | ✓                 | —        | —              | —            | —           | —           | —             | —                | —          |
| `lmstudio`  | ✓         | ✓                 | —        | —              | —            | —           | —           | —             | —                | —          |
| `openai`    | ✓         | ✓                 | ✓        | ✓              | ✓            | ✓           | —           | —             | ✓                | ✓          |

*Generated from `lazybridge.matrix.provider_capabilities()` at docs build time — see `tools/mkdocs_provider_table.py`.*

The table above is generated at docs build time from `lazybridge.matrix.provider_capabilities()` which in turn reads the `ClassVar` flags on each provider class. Update the matrix by editing the provider's `supports_streaming` / `supports_structured_output` / `supports_thinking` / `supported_native_tools` declarations — the table re-renders on the next `mkdocs build`.

```python
from lazybridge.matrix import provider_capabilities

for name, caps in provider_capabilities().items():
    print(name, caps.streaming, caps.structured_output, caps.thinking)
```

### `lazybridge.matrix` reference

### lazybridge.matrix.provider_capabilities

```python
provider_capabilities() -> dict[str, ProviderCapabilities]
```

Return the capability matrix for every registered provider.

Keys are the provider names recognised by `LLMEngine` (the `provider=` argument and the `LLMEngine._PROVIDER_RULES` map); values are :class:`ProviderCapabilities` instances.

Cached after first call; the underlying `ClassVar` declarations are immutable in practice so re-querying the providers each call would just thrash the import system.

**Graceful degradation** — each provider class is imported lazily and *individually*. If importing one provider's module fails (e.g. a broken optional SDK that explodes at import time), that provider is omitted from the returned matrix and a :class:`UserWarning` is issued, rather than letting one bad import break introspection for every other provider.

Source code in `lazybridge/matrix.py`

```python
@lru_cache(maxsize=1)
def provider_capabilities() -> dict[str, ProviderCapabilities]:
    """Return the capability matrix for every registered provider.

    Keys are the provider names recognised by ``LLMEngine`` (the
    ``provider=`` argument and the ``LLMEngine._PROVIDER_RULES`` map);
    values are :class:`ProviderCapabilities` instances.

    Cached after first call; the underlying ``ClassVar`` declarations
    are immutable in practice so re-querying the providers each call
    would just thrash the import system.

    **Graceful degradation** — each provider class is imported lazily and
    *individually*.  If importing one provider's module fails (e.g. a
    broken optional SDK that explodes at import time), that provider is
    omitted from the returned matrix and a :class:`UserWarning` is issued,
    rather than letting one bad import break introspection for every other
    provider.
    """
    out: dict[str, ProviderCapabilities] = {}
    for name, module_path, attr in _PROVIDER_IMPORTS:
        try:
            cls: Any = getattr(importlib.import_module(module_path), attr)
        except Exception as exc:  # defend against any import-time blow-up
            warnings.warn(
                f"lazybridge.matrix: provider {name!r} is unavailable "
                f"(failed to import {module_path}.{attr}: {exc!r}); "
                f"omitting it from the capability matrix.",
                stacklevel=2,
            )
            continue
        out[name] = ProviderCapabilities(
            native_tools=frozenset(getattr(cls, "supported_native_tools", frozenset())),
            streaming=bool(getattr(cls, "supports_streaming", True)),
            structured_output=bool(getattr(cls, "supports_structured_output", True)),
            thinking=bool(getattr(cls, "supports_thinking", True)),
        )
    return out
```

### lazybridge.matrix.native_tool_support

```python
native_tool_support() -> dict[str, list[str]]
```

Compact `provider → [native-tool names]` mapping.

Convenient for README tables and doc generation; the full :class:`ProviderCapabilities` shape is what most callers want.

Source code in `lazybridge/matrix.py`

```python
def native_tool_support() -> dict[str, list[str]]:
    """Compact ``provider → [native-tool names]`` mapping.

    Convenient for README tables and doc generation; the full
    :class:`ProviderCapabilities` shape is what most callers want.
    """
    return {name: sorted(t.value for t in caps.native_tools) for name, caps in provider_capabilities().items()}
```

### lazybridge.matrix.ProviderCapabilities

```python
ProviderCapabilities(native_tools: frozenset[NativeTool] = frozenset(), streaming: bool = True, structured_output: bool = True, thinking: bool = True)
```

Snapshot of a single provider's declared capabilities.

All four fields come from `ClassVar` declarations on the provider class; keep them in sync there, not here.

## stop_reason normalisation

Each provider exposes its own raw finish-reason vocabulary; LazyBridge maps them to a normalised `CompletionResponse.stop_reason` so engine loops can decide identically across providers. Notable mappings:

| Provider  | Raw value                                                     | Normalised                                                                 |
| --------- | ------------------------------------------------------------- | -------------------------------------------------------------------------- |
| Anthropic | `end_turn` / `tool_use` / `max_tokens` / `stop_sequence`      | `end_turn` / `tool_use` / `max_tokens` / `end_turn`                        |
| OpenAI    | `stop` / `tool_calls` / `length` / `content_filter`           | `end_turn` / `tool_use` / `max_tokens` / `error`                           |
| Google    | `STOP` / `MAX_TOKENS` / `SAFETY` / `RECITATION` / `BLOCKLIST` | `end_turn` / `max_tokens` / `error` (the bucket for non-stop terminations) |
| DeepSeek  | passes through OpenAI shape                                   | as OpenAI                                                                  |

The Google `MAX_TOKENS` mapping is fixed in 0.7.9 — pre-fix it was returned as the literal string and broke loops that branched on `stop_reason == "max_tokens"`. Inspect `Envelope.metadata.stop_reason` to read the normalised value.

# Sentinels & predicates

Sentinels declare **where a Plan step's input comes from**; the `when` DSL declares **when a route fires**. Both are validated at Plan construction time — a typo in `from_step("reseach")` raises `PlanCompileError` before any LLM call.

For narrative usage see [Guides → Full → Sentinels](https://core.lazybridge.com/guides/full/sentinels/index.md) and [Guides → Full → Routing](https://core.lazybridge.com/guides/full/routing/index.md).

| Symbol                      | Resolves to                                             | Scope                   |
| --------------------------- | ------------------------------------------------------- | ----------------------- |
| `from_prev`                 | The previous step's payload                             | In-Plan only            |
| `from_start`                | The Plan's initial input                                | In-Plan only            |
| `from_step("name")`         | The named step's output                                 | In-Plan only            |
| `from_parallel("name")`     | A single named parallel branch's output                 | In-Plan only            |
| `from_parallel_all("name")` | All consecutive parallel siblings, labelled-text joined | In-Plan only            |
| `from_memory("name")`       | An agent's live conversation history                    | Cross-run, via `Memory` |
| `from_agent("name")`        | An agent's last persisted output                        | Cross-run, via `Store`  |
| `when`                      | Routing predicate DSL builder                           | `Step(routes={…})`      |

## Plan-only sentinels

### lazybridge.from_prev

```python
from_prev = _FromPrev()
```

### lazybridge.from_start

```python
from_start = _FromStart()
```

### lazybridge.from_step

```python
from_step(name: str) -> _FromStep
```

Source code in `lazybridge/sentinels.py`

```python
def from_step(name: str) -> _FromStep:
    return _FromStep(name=name)
```

### lazybridge.from_parallel

```python
from_parallel(name: str) -> _FromParallel
```

Source code in `lazybridge/sentinels.py`

```python
def from_parallel(name: str) -> _FromParallel:
    return _FromParallel(name=name)
```

### lazybridge.from_parallel_all

```python
from_parallel_all(name: str) -> _FromParallelAll
```

Aggregate every consecutive parallel sibling starting at `name`.

Source code in `lazybridge/sentinels.py`

```python
def from_parallel_all(name: str) -> _FromParallelAll:
    """Aggregate every consecutive parallel sibling starting at ``name``."""
    return _FromParallelAll(name=name)
```

## Universal sentinels

### lazybridge.from_memory

```python
from_memory(name: str) -> _FromMemory
```

Inject the live memory of the agent registered as `name` at execution time.

Source code in `lazybridge/sentinels.py`

```python
def from_memory(name: str) -> _FromMemory:
    """Inject the live memory of the agent registered as ``name`` at execution time."""
    return _FromMemory(name=name)
```

### lazybridge.from_agent

```python
from_agent(name: str) -> _FromAgent
```

Read the last output of the agent registered as `name` from the shared store.

Source code in `lazybridge/sentinels.py`

```python
def from_agent(name: str) -> _FromAgent:
    """Read the last output of the agent registered as ``name`` from the shared store."""
    return _FromAgent(name=name)
```

## Routing predicates

### lazybridge.when

```python
when: _When = _When()
```

# Session & observability

`Session` is the event bus that fans observability events into exporters and exposes the `GraphSchema` topology view. Six core exporter classes ship under `lazybridge.*`; `OTelExporter` lives under `lazybridge.ext.otel` (see [Extension engines](https://core.lazybridge.com/reference/extensions/index.md)).

For narrative usage see [Guides → Mid → Session](https://core.lazybridge.com/guides/mid/session/index.md), [Guides → Full → Exporters](https://core.lazybridge.com/guides/full/exporters/index.md), and [Guides → Full → GraphSchema](https://core.lazybridge.com/guides/full/graph-schema/index.md).

| Symbol                  | Role                                                                                                                                 |
| ----------------------- | ------------------------------------------------------------------------------------------------------------------------------------ |
| `Session`               | The event bus. Attached to an Agent via `session=` (or implicitly via `verbose=True`).                                               |
| `EventLog`              | SQLite-backed event store under `Session.events`.                                                                                    |
| `EventType`             | StrEnum of emitted event kinds (`AGENT_START`, `TOOL_CALL`, `AGENT_FINISH`, …).                                                      |
| `GraphSchema`           | Topology view of registered agents + tool edges; renderable via `Session.graph.to_json()`.                                           |
| `EventExporter`         | Base protocol — implement `export(event)` to add your own sink.                                                                      |
| `ConsoleExporter`       | Human-readable stdout exporter; used implicitly by `verbose=True`.                                                                   |
| `CallbackExporter`      | Routes events to a user-supplied callable.                                                                                           |
| `FilteredExporter`      | Wrap any exporter to drop events that don't match a predicate.                                                                       |
| `JsonFileExporter`      | Newline-delimited JSON sink.                                                                                                         |
| `StructuredLogExporter` | Logs events as structured records via the `logging` module.                                                                          |
| `OTelExporter` (ext)    | OpenTelemetry GenAI conventions; see [Guides → Advanced → OpenTelemetry](https://core.lazybridge.com/guides/advanced/otel/index.md). |

## Session

### lazybridge.Session

```python
Session(*, db: str | None = None, exporters: list[Any] | None = None, redact: Callable[[dict[str, Any]], dict[str, Any]] | None | Any = _REDACT_UNSET, redact_on_error: Literal['fallback', 'strict'] = 'strict', unsafe_log_payloads: bool = False, console: bool = False, batched: bool = False, batch_size: int = 100, batch_interval: float = 1.0, max_queue_size: int = 10000, on_full: Literal['drop', 'block', 'hybrid'] = 'hybrid', critical_events: frozenset[str] | set[str] | None = None)
```

Container for observability config: exporters, redaction, EventLog.

Construct a Session.

##### Back-pressure policy

`on_full` selects what happens when the batched-writer queue is saturated (`batched=True`):

- `"hybrid"` (default) — block for audit-critical event types (`AGENT_*`, `TOOL_*`, `HIL_DECISION`; override via `critical_events=`) and silently drop the cheap high-volume ones (`LOOP_STEP` / `MODEL_REQUEST` / `MODEL_RESPONSE`). A buggy slow exporter no longer makes `AGENT_FINISH` or `TOOL_ERROR` disappear, while a steady-state telemetry firehose still doesn't add latency to the producer.
- `"block"` — back-pressure unconditionally. Pick this when every event must persist (compliance) and producer latency can absorb the wait.
- `"drop"` — never back-pressure. Saturation drops events silently (with a doubling-interval warning). Pick this when telemetry must not block production traffic and lossy traces are acceptable.

##### Redactor failure modes

`redact_on_error` governs what happens when a `redact` callable either raises or returns a non-dict:

- `"strict"` (default) — warn once, then **drop the event entirely**. No record in the EventLog, no export to exporters. Fail-closed: unredacted data can never leak via this session. This is the safe default for compliance workloads where a broken redactor is a bug to be fixed, not a reason to keep leaking.
- `"fallback"` — warn once, then record and export the **original, unredacted** payload. Observability is preserved at the cost of potentially leaking unredacted data through the event bus. Useful for development; opt in explicitly when you want it.

Default is `"strict"`: a redactor that fails closes the event rather than persisting it unredacted. Pass `redact_on_error="fallback"` to keep the unredacted event flowing when redaction fails (lower fidelity, but lossless).

##### Default-safe secret redaction

When `redact` is not passed, Session defaults to :func:`redact_secrets` so well-known credential shapes (`sk-...`, `ghp_...`, `AIza...`, JWTs, `Bearer ...` etc.) are stripped from every event payload before it hits the EventLog or any exporter. Pass `redact=None` to opt out for a single Session, or `unsafe_log_payloads=True` for the same effect with a more searchable construction-site keyword. Passing your own `redact=callable` always wins — Session will not stack the default in front of a user redactor.

Source code in `lazybridge/session.py`

```python
def __init__(
    self,
    *,
    db: str | None = None,
    exporters: list[Any] | None = None,
    redact: Callable[[dict[str, Any]], dict[str, Any]] | None | Any = _REDACT_UNSET,
    redact_on_error: Literal["fallback", "strict"] = "strict",
    unsafe_log_payloads: bool = False,
    console: bool = False,
    batched: bool = False,
    batch_size: int = 100,
    batch_interval: float = 1.0,
    max_queue_size: int = 10_000,
    on_full: Literal["drop", "block", "hybrid"] = "hybrid",
    critical_events: frozenset[str] | set[str] | None = None,
) -> None:
    """Construct a Session.

    Back-pressure policy
    --------------------
    ``on_full`` selects what happens when the batched-writer queue
    is saturated (``batched=True``):

    * ``"hybrid"`` (default) — block for audit-critical event types
      (``AGENT_*``, ``TOOL_*``, ``HIL_DECISION``; override via
      ``critical_events=``) and silently drop the cheap high-volume
      ones (``LOOP_STEP`` / ``MODEL_REQUEST`` / ``MODEL_RESPONSE``).
      A buggy slow exporter no longer makes ``AGENT_FINISH`` or
      ``TOOL_ERROR`` disappear, while a steady-state telemetry
      firehose still doesn't add latency to the producer.
    * ``"block"`` — back-pressure unconditionally.  Pick this when
      every event must persist (compliance) and producer latency
      can absorb the wait.
    * ``"drop"``  — never back-pressure.  Saturation drops events
      silently (with a doubling-interval warning).  Pick this when
      telemetry must not block production traffic and lossy traces
      are acceptable.

    Redactor failure modes
    ----------------------
    ``redact_on_error`` governs what happens when a ``redact``
    callable either raises or returns a non-dict:

    * ``"strict"`` (default) — warn once, then **drop the event
      entirely**.  No record in the EventLog, no export to
      exporters.  Fail-closed: unredacted data can never leak via
      this session.  This is the safe default for compliance
      workloads where a broken redactor is a bug to be fixed, not
      a reason to keep leaking.
    * ``"fallback"`` — warn once, then record and export the
      **original, unredacted** payload.  Observability is preserved
      at the cost of potentially leaking unredacted data through
      the event bus.  Useful for development; opt in explicitly
      when you want it.

    Default is ``"strict"``: a redactor that fails closes the
    event rather than persisting it unredacted.  Pass
    ``redact_on_error="fallback"`` to keep the unredacted event
    flowing when redaction fails (lower fidelity, but lossless).

    Default-safe secret redaction
    -----------------------------
    When ``redact`` is not passed, Session defaults to
    :func:`redact_secrets` so well-known credential shapes
    (``sk-...``, ``ghp_...``, ``AIza...``, JWTs, ``Bearer ...``
    etc.) are stripped from every event payload before it hits the
    EventLog or any exporter.  Pass ``redact=None`` to opt out for
    a single Session, or ``unsafe_log_payloads=True`` for the same
    effect with a more searchable construction-site keyword.
    Passing your own ``redact=callable`` always wins — Session
    will not stack the default in front of a user redactor.
    """
    if redact_on_error not in ("fallback", "strict"):
        raise ValueError(
            f"Session(redact_on_error={redact_on_error!r}): must be "
            f"'fallback' (warn + pass through) or 'strict' (warn + "
            f"drop event)."
        )
    # Resolve the redactor.  Three valid input states:
    #   * not passed              → default to redact_secrets (safe)
    #   * redact=None             → explicit opt-out, no redaction
    #   * redact=callable         → user supplies their own
    # ``unsafe_log_payloads=True`` is the searchable alias for
    # explicit opt-out and only applies when no redactor was given.
    if redact is _REDACT_UNSET:
        self._redact = None if unsafe_log_payloads else redact_secrets
    else:
        self._redact = redact  # type: ignore[assignment]
    self.session_id = str(uuid.uuid4())
    self.events = EventLog(
        self.session_id,
        db=db,
        batched=batched,
        batch_size=batch_size,
        batch_interval=batch_interval,
        max_queue_size=max_queue_size,
        on_full=on_full,
        critical_events=critical_events,
    )
    self._exporters: list[Any] = list(exporters or [])
    # ``self._redact`` was resolved above based on the
    # _REDACT_UNSET sentinel + ``unsafe_log_payloads``.
    self._redact_on_error = redact_on_error
    self._lock = threading.Lock()
    # Phase-3 Block J: warn-once-per-(exporter-class, exception-class).
    # Pre-fix every emit() that hit a broken exporter spammed an
    # identical warning, drowning real diagnostics.  Counter maps the
    # ``(exporter_cls, exc_cls)`` pair to the number of suppressed
    # warnings since the first emission so operators can still see
    # the magnitude of the problem.
    self._exporter_warned_keys: set[tuple[str, str]] = set()
    self._exporter_warn_counts: dict[tuple[str, str], int] = {}
    self.graph = GraphSchema(session_id=self.session_id)
    if console:
        # Late import to avoid circular dependency with exporters
        from lazybridge.exporters import ConsoleExporter

        self._exporters.append(ConsoleExporter())
```

#### register_agent

```python
register_agent(agent: Any) -> None
```

Register an agent with this session's graph.

Source code in `lazybridge/session.py`

```python
def register_agent(self, agent: Any) -> None:
    """Register an agent with this session's graph."""
    self.graph.add_agent(agent)
```

#### register_tool_edge

```python
register_tool_edge(from_agent: Any, to_agent: Any, *, label: str = '') -> None
```

Record a tool-call edge between two registered agents.

Source code in `lazybridge/session.py`

```python
def register_tool_edge(self, from_agent: Any, to_agent: Any, *, label: str = "") -> None:
    """Record a tool-call edge between two registered agents."""
    from_id = str(getattr(from_agent, "name", "agent"))
    to_id = str(getattr(to_agent, "name", "agent"))
    from lazybridge.graph import EdgeType

    self.graph.add_edge(from_id, to_id, label=label, kind=EdgeType.TOOL)
```

#### flush

```python
flush(timeout: float = 5.0) -> None
```

Drain the EventLog's batched-writer queue.

No-op when `batched=False`. Useful before a checkpoint or a clean shutdown so recently-emitted events are persisted before the caller proceeds.

Source code in `lazybridge/session.py`

```python
def flush(self, timeout: float = 5.0) -> None:
    """Drain the EventLog's batched-writer queue.

    No-op when ``batched=False``.  Useful before a checkpoint or
    a clean shutdown so recently-emitted events are persisted
    before the caller proceeds.
    """
    self.events.flush(timeout=timeout)
```

#### close

```python
close() -> None
```

Release the underlying EventLog's SQLite connections.

Idempotent. Call this when a Session's lifetime ends (e.g. end of an HTTP request) so file descriptors don't linger until the owning thread exits. Using Session as a context manager is equivalent. Exporters that expose `close()` are flushed too — useful for OTelExporter's orphaned-span cleanup.

Source code in `lazybridge/session.py`

```python
def close(self) -> None:
    """Release the underlying EventLog's SQLite connections.

    Idempotent.  Call this when a Session's lifetime ends (e.g.
    end of an HTTP request) so file descriptors don't linger until
    the owning thread exits.  Using Session as a context manager is
    equivalent.  Exporters that expose ``close()`` are flushed too
    — useful for OTelExporter's orphaned-span cleanup.
    """
    self.events.close()
    for exp in list(self._exporters):
        close = getattr(exp, "close", None)
        if callable(close):
            try:
                close()
            except Exception:
                # Exporter shutdown must never mask the real reason
                # Session.close() is being called.
                pass
```

#### usage_summary

```python
usage_summary() -> dict[str, Any]
```

Aggregate token usage and cost across all agent runs in this session.

Returns a dict with

- "total": {input_tokens, output_tokens, cost_usd}
- "by_agent": {agent_name: {input_tokens, output_tokens, cost_usd}}
- "by_run": {run_id: {agent_name, input_tokens, output_tokens, cost_usd}}

O(events) with TWO queries total (AGENT_START + MODEL_RESPONSE). `EventLog.query` exposes `run_id` directly in the result dict.

Source code in `lazybridge/session.py`

```python
def usage_summary(self) -> dict[str, Any]:
    """Aggregate token usage and cost across all agent runs in this session.

    Returns a dict with:
      - "total": {input_tokens, output_tokens, cost_usd}
      - "by_agent": {agent_name: {input_tokens, output_tokens, cost_usd}}
      - "by_run":   {run_id:    {agent_name, input_tokens, output_tokens, cost_usd}}

    O(events) with TWO queries total (AGENT_START +
    MODEL_RESPONSE).  ``EventLog.query`` exposes ``run_id``
    directly in the result dict.
    """
    # Two bulk queries.  No per-row DB trip.
    agent_starts = self.events.query(event_type=EventType.AGENT_START)
    model_responses = self.events.query(event_type=EventType.MODEL_RESPONSE)

    # Build run_id → agent_name map.
    run_agent: dict[str, str] = {
        row["run_id"]: row["payload"].get("agent_name", "unknown") for row in agent_starts if row.get("run_id")
    }

    total = {"input_tokens": 0, "output_tokens": 0, "cost_usd": 0.0}
    by_agent: dict[str, dict[str, Any]] = {}
    by_run: dict[str, dict[str, Any]] = {}

    for row in model_responses:
        p = row["payload"]
        run_id = row.get("run_id")
        # Prefer the agent_name carried in the MODEL_RESPONSE payload
        # (LLMEngine populates it as the innermost agent name).  Falling
        # back to the AGENT_START → run_id map only matters for legacy
        # / external emitters that don't include the field.
        payload_agent = p.get("agent_name")
        if payload_agent:
            agent_name = str(payload_agent)
        elif run_id:
            agent_name = run_agent.get(run_id, "unknown")
        else:
            agent_name = "unknown"

        in_tok = p.get("input_tokens", 0) or 0
        out_tok = p.get("output_tokens", 0) or 0
        cost = p.get("cost_usd") or 0.0

        total["input_tokens"] += in_tok
        total["output_tokens"] += out_tok
        total["cost_usd"] += cost

        ag = by_agent.setdefault(agent_name, {"input_tokens": 0, "output_tokens": 0, "cost_usd": 0.0})
        ag["input_tokens"] += in_tok
        ag["output_tokens"] += out_tok
        ag["cost_usd"] += cost

        if run_id:
            rn = by_run.setdefault(
                run_id, {"agent_name": agent_name, "input_tokens": 0, "output_tokens": 0, "cost_usd": 0.0}
            )
            rn["input_tokens"] += in_tok
            rn["output_tokens"] += out_tok
            rn["cost_usd"] += cost

    return {"total": total, "by_agent": by_agent, "by_run": by_run}
```

## Event log + types

### lazybridge.EventLog

```python
EventLog(session_id: str, db: str | None = None, *, batched: bool = False, batch_size: int = 100, batch_interval: float = 1.0, max_queue_size: int = 10000, on_full: Literal['drop', 'block', 'hybrid'] = 'hybrid', critical_events: frozenset[str] | set[str] | None = None)
```

SQLite-backed event log. Thread-safe via thread-local connections.

By default `record()` performs an `INSERT + COMMIT` per event on the calling thread. That is fine for low event rates but becomes a bottleneck under sustained load. Pass `batched=True` to delegate persistence to a background daemon thread that drains a bounded queue and commits in batches; the hot path becomes a non-blocking `queue.put_nowait`.

Source code in `lazybridge/session.py`

```python
def __init__(
    self,
    session_id: str,
    db: str | None = None,
    *,
    batched: bool = False,
    batch_size: int = 100,
    batch_interval: float = 1.0,
    max_queue_size: int = 10_000,
    on_full: Literal["drop", "block", "hybrid"] = "hybrid",
    critical_events: frozenset[str] | set[str] | None = None,
) -> None:
    self.session_id = session_id
    self._db = db
    if batched:
        if batch_size < 1:
            raise ValueError(f"batch_size must be >= 1, got {batch_size!r}")
        if batch_interval <= 0:
            raise ValueError(f"batch_interval must be > 0, got {batch_interval!r}")
        if max_queue_size < 1:
            raise ValueError(f"max_queue_size must be >= 1, got {max_queue_size!r}")
        if on_full not in ("drop", "block", "hybrid"):
            raise ValueError(f"on_full must be 'drop', 'block', or 'hybrid', got {on_full!r}")
    # In-memory SQLite needs a SHARED cache otherwise every thread
    # gets its own isolated DB and events emitted from worker
    # threads (e.g. ``SupervisorEngine`` via ``asyncio.to_thread``)
    # land in a DB the main thread can never see.  Using the
    # ``file::memory:?cache=shared`` URI uniquely named per
    # ``session_id`` gives us one shared in-memory DB per Session.
    self._uri: str | None
    if db is None:
        self._uri = f"file:memdb_{session_id}?mode=memory&cache=shared"
    else:
        self._uri = None
    self._local = threading.local()
    self._lock = threading.Lock()
    # Registry of every thread-local connection we've handed out.
    # ``close()`` walks it to release file descriptors deterministically;
    # without this, worker-thread connections leak until GC runs.
    self._all_conns: list[sqlite3.Connection] = []
    self._closed = False
    # Keep one anchor connection alive for in-memory DBs — SQLite
    # drops a ``file::memory:?cache=shared`` DB as soon as the last
    # connection closes, so without the anchor the first cleanup
    # of a thread-local conn would wipe the table.
    if self._uri is not None:
        self._anchor: sqlite3.Connection | None = sqlite3.connect(
            self._uri,
            uri=True,
            check_same_thread=False,
        )
    else:
        self._anchor = None
    self._init_schema()

    # Batched writer state — set up after schema so the background
    # thread can rely on the events table existing on first flush.
    self._batched = batched
    self._batch_size = batch_size
    self._batch_interval = batch_interval
    self._max_queue_size = max_queue_size
    self._on_full = on_full
    # Per-event-type back-pressure — the ``"hybrid"`` policy uses
    # this set to decide which events block under saturation.  A
    # caller-supplied set wins over the defaults (frozen for safety
    # against post-construction mutation).
    self._critical_events: frozenset[str] = (
        frozenset(critical_events) if critical_events is not None else DEFAULT_CRITICAL_EVENT_TYPES
    )
    self._dropped_count = 0
    self._dropped_critical_count = 0
    self._writer_thread: threading.Thread | None = None
    # The queue carries event tuples plus a singleton ``_FLUSH_SENTINEL``
    # (``object()``) used to wake the writer thread on shutdown.
    self._writer_queue: queue.Queue[tuple | object] | None = None
    self._writer_stop = threading.Event()
    if batched:
        self._writer_queue = queue.Queue(maxsize=max_queue_size)
        self._writer_thread = threading.Thread(
            target=self._writer_run,
            name=f"lazybridge-eventlog-{session_id[:8]}",
            daemon=True,
        )
        self._writer_thread.start()
```

#### close

```python
close() -> None
```

Close every thread-local connection and the anchor (if any).

Idempotent. After `close()` further `record` / `query` calls raise `RuntimeError`. Required for deterministic FD cleanup in long-running services that spawn Sessions per request.

The lock is held across the entire shutdown so a concurrent `record` that already obtained a connection via `_conn` can't race ahead and commit against a connection we're about to close — SQLite would otherwise raise `ProgrammingError: Cannot operate on a closed database`.

When batching is enabled the background writer is signalled to drain its queue and exit before connections are released.

Source code in `lazybridge/session.py`

```python
def close(self) -> None:
    """Close every thread-local connection and the anchor (if any).

    Idempotent.  After ``close()`` further ``record`` / ``query``
    calls raise ``RuntimeError``.  Required for deterministic FD
    cleanup in long-running services that spawn Sessions per
    request.

    The lock is held across the entire shutdown so a concurrent
    ``record`` that already obtained a connection via ``_conn``
    can't race ahead and commit against a connection we're about
    to close — SQLite would otherwise raise ``ProgrammingError:
    Cannot operate on a closed database``.

    When batching is enabled the background writer is signalled to
    drain its queue and exit before connections are released.
    """
    # Idempotent: a second close() (typically from ``__del__`` at
    # GC time) must not re-trigger flush() — that would push a
    # sentinel into a queue whose writer thread has already
    # exited, causing flush() to block for the full timeout
    # waiting for an ack that never comes.
    if self._closed:
        return
    # Drain + stop the writer first.  Done outside the lock so the
    # writer thread can finish using the EventLog's connections;
    # ``record_many`` checks ``self._closed`` itself.  Order:
    # (1) flush so any pending events land, (2) set stop and push a
    # sentinel so the writer wakes from its long ``queue.get``
    # timeout immediately, (3) join.
    if self._batched and self._writer_thread is not None:
        self.flush(timeout=5.0)
        self._writer_stop.set()
        assert self._writer_queue is not None
        try:
            self._writer_queue.put_nowait(_FLUSH_SENTINEL)
        except queue.Full:
            # Queue is saturated — the writer is busy and will see
            # the stop flag on its next iteration anyway.
            pass
        self._writer_thread.join(timeout=5.0)
    with self._lock:
        if self._closed:
            return
        self._closed = True
        conns = list(self._all_conns)
        self._all_conns.clear()
        anchor = self._anchor
        self._anchor = None
        for c in conns:
            try:
                c.close()
            except sqlite3.Error:
                pass
        if anchor is not None:
            try:
                anchor.close()
            except sqlite3.Error:
                pass
```

#### record_many

```python
record_many(rows: list[tuple]) -> None
```

Insert a batch of pre-serialised rows in a single transaction.

Each `row` is the 5-tuple `(session_id, run_id, event_type, payload_json, ts)` — the on-disk shape, not a dict. Used by the background batched writer; callers should use :meth:`record` instead.

Source code in `lazybridge/session.py`

```python
def record_many(self, rows: list[tuple]) -> None:
    """Insert a batch of pre-serialised rows in a single transaction.

    Each ``row`` is the 5-tuple
    ``(session_id, run_id, event_type, payload_json, ts)`` —
    the on-disk shape, not a dict.  Used by the background batched
    writer; callers should use :meth:`record` instead.
    """
    if not rows:
        return
    # Fast-path check: if ``close()`` has fired we fail fast instead
    # of executing against a connection that's about to disappear.
    # Same race semantics as :meth:`record` — bounded to a single
    # ``executemany + COMMIT``; SQLite will either succeed or raise
    # ``ProgrammingError``, which the background writer logs and
    # drops the row batch.
    if self._closed:
        raise RuntimeError("EventLog is closed")
    conn = self._conn()
    conn.executemany(
        "INSERT INTO events (session_id, run_id, event_type, payload, ts) VALUES (?,?,?,?,?)",
        rows,
    )
    conn.commit()
```

#### flush

```python
flush(timeout: float = 5.0) -> None
```

Block until every event submitted before the call is persisted.

No-op when `batched=False`. Pushes a flush sentinel so the writer commits its current batch immediately rather than waiting for `batch_size` or `batch_interval`, then waits on the queue's `task_done` accounting. Returns early if `timeout` elapses; the queue may still have items in that case.

Source code in `lazybridge/session.py`

```python
def flush(self, timeout: float = 5.0) -> None:
    """Block until every event submitted before the call is persisted.

    No-op when ``batched=False``.  Pushes a flush sentinel so the
    writer commits its current batch immediately rather than waiting
    for ``batch_size`` or ``batch_interval``, then waits on the
    queue's ``task_done`` accounting.  Returns early if ``timeout``
    elapses; the queue may still have items in that case.
    """
    if not self._batched or self._writer_queue is None:
        return
    try:
        self._writer_queue.put_nowait(_FLUSH_SENTINEL)
    except queue.Full:
        # If the queue is saturated, fall back to a blocking put so
        # flush still has well-defined semantics — accepting the
        # backpressure cost is the right trade-off here since the
        # caller asked for a synchronous barrier.
        self._writer_queue.put(_FLUSH_SENTINEL)
    # ``Queue.join`` has no timeout in stdlib; use ``unfinished_tasks``
    # plus ``all_tasks_done`` polling under the queue's own lock.
    deadline = time.monotonic() + timeout
    with self._writer_queue.all_tasks_done:
        while self._writer_queue.unfinished_tasks > 0:
            remaining = deadline - time.monotonic()
            if remaining <= 0:
                return
            self._writer_queue.all_tasks_done.wait(timeout=remaining)
```

### lazybridge.EventType

Bases: `StrEnum`

## Graph topology

### lazybridge.GraphSchema

```python
GraphSchema(session_id: str = '')
```

Directed graph of agents, routers, and their connections.

Auto-populated by :class:`lazybridge.Session` as Agents register themselves via `session=`. Can also be built manually for GUI-driven pipeline construction.

Source code in `lazybridge/graph/schema.py`

```python
def __init__(self, session_id: str = "") -> None:
    self.session_id = session_id
    self._nodes: dict[str, _BaseNode] = {}
    self._edges: list[Edge] = []
```

#### add_agent

```python
add_agent(agent: Any) -> None
```

Register an Agent (or duck-typed equivalent) as a graph node.

Reads `id` / `name` off the agent, and infers provider + model from either legacy `_provider_name` / `_model_name` attributes or from `agent.engine` on the v1 :class:`~lazybridge.Agent`.

Also registers any Python-callable tools the agent exposes as `ToolNode` stubs so the graph is fully visible before any events are emitted (static inspection / demo mode).

Source code in `lazybridge/graph/schema.py`

```python
def add_agent(self, agent: Any) -> None:
    """Register an Agent (or duck-typed equivalent) as a graph node.

    Reads ``id`` / ``name`` off the agent, and infers provider + model
    from either legacy ``_provider_name`` / ``_model_name`` attributes
    or from ``agent.engine`` on the v1 :class:`~lazybridge.Agent`.

    Also registers any Python-callable tools the agent exposes as
    ``ToolNode`` stubs so the graph is fully visible before any
    events are emitted (static inspection / demo mode).
    """
    provider, model = _derive_provider_model(agent)
    node_id = str(getattr(agent, "id", None) or getattr(agent, "name", "agent"))

    # Canonical engine kind — lets the viz distinguish LLM nodes from Plan
    # orchestrators or custom engines without needing to parse the model string.
    engine = getattr(agent, "engine", None)
    engine_type = engine.__class__.__name__ if engine is not None else ""

    # Collect names of Python-callable tools (not agent-as-tool entries) for
    # the node badge so the inspector can list them without reading edges.
    tool_map = getattr(agent, "_tool_map", None) or {}
    callable_tool_names: list[str] = []
    for tool_name, tool_obj in tool_map.items():
        # Agent-as-tool wrappers have returns_envelope=True (set by _agent_as_tool).
        # Plain Tool.from_fn() wrappers have returns_envelope=False.
        # This is the only stable discriminator: agent_memory/agent_store can both
        # be None for an agent-as-tool when the source agent has no memory or store.
        if not getattr(tool_obj, "returns_envelope", False):
            callable_tool_names.append(tool_name)

    node = AgentNode(
        id=node_id,
        name=getattr(agent, "name", node_id),
        provider=provider,
        model=model,
        system=getattr(agent, "system", None),
        engine_type=engine_type,
        tools=callable_tool_names,
    )
    self._nodes[node.id] = node

    # Register Python-callable tool functions as ToolNode stubs so the
    # full pipeline topology is visible before execution starts.
    # Agent-as-tool wrappers (returns_envelope=True) register themselves
    # as AgentNodes separately and must not get a duplicate _ToolNode stub.
    for tool_name, tool_obj in tool_map.items():
        if getattr(tool_obj, "returns_envelope", False):
            continue
        tool_id = f"tool:{tool_name}"
        if tool_id not in self._nodes:
            self._nodes[tool_id] = _ToolNode(id=tool_id, name=tool_name)
        self.add_edge(node_id, tool_id, label=tool_name, kind=EdgeType.TOOL)
```

#### add_router

```python
add_router(router: Any) -> None
```

Register a router (e.g. a Plan) as a graph node.

The router object is expected to expose `to_graph_node()` that returns `{id, name, routes, default}`.

Source code in `lazybridge/graph/schema.py`

```python
def add_router(self, router: Any) -> None:
    """Register a router (e.g. a Plan) as a graph node.

    The router object is expected to expose ``to_graph_node()`` that
    returns ``{id, name, routes, default}``.
    """
    node_dict = router.to_graph_node()
    node = RouterNode(
        id=node_dict.get("id", node_dict.get("name", "")),
        name=node_dict.get("name", ""),
        routes=node_dict.get("routes", {}),
        default=node_dict.get("default"),
    )
    self._nodes[node.id] = node
```

#### clear

```python
clear() -> None
```

Drop all nodes and edges.

Useful when re-using one `Session` across multiple pipeline runs and you want each run's graph to start empty. Session's `session_id` is preserved so event correlation still works.

Source code in `lazybridge/graph/schema.py`

```python
def clear(self) -> None:
    """Drop all nodes and edges.

    Useful when re-using one ``Session`` across multiple pipeline
    runs and you want each run's graph to start empty.  Session's
    ``session_id`` is preserved so event correlation still works.
    """
    self._nodes.clear()
    self._edges.clear()
```

#### save

```python
save(path: str) -> None
```

Save schema to JSON or YAML depending on file extension.

Source code in `lazybridge/graph/schema.py`

```python
def save(self, path: str) -> None:
    """Save schema to JSON or YAML depending on file extension."""
    with open(path, "w", encoding="utf-8") as f:
        f.write(self.to_yaml() if _is_yaml_path(path) else self.to_json())
```

## Exporters

### lazybridge.EventExporter

Bases: `Protocol`

Protocol satisfied by all exporter classes.

### lazybridge.CallbackExporter

```python
CallbackExporter(*, fn: Callable[[dict[str, Any]], None])
```

Forward every event to a user-supplied callable.

Source code in `lazybridge/exporters.py`

```python
def __init__(self, *, fn: Callable[[dict[str, Any]], None]) -> None:
    self._fn = fn
```

### lazybridge.ConsoleExporter

```python
ConsoleExporter(*, stream: Any = None)
```

Pretty-print events to stdout for human inspection.

Output format (one line per event)::

```text
[agent_name] event_type  key=value  key=value
```

Installed automatically by `Session(console=True)` and `Agent(verbose=True)`; can also be added manually via `Session(exporters=[ConsoleExporter()])`.

Source code in `lazybridge/exporters.py`

```python
def __init__(self, *, stream: Any = None) -> None:
    import sys

    self._stream = stream or sys.stdout
```

### lazybridge.FilteredExporter

```python
FilteredExporter(*, inner: Any, event_types: set[str])
```

Forward only events whose type is in `event_types` to `inner`.

Source code in `lazybridge/exporters.py`

```python
def __init__(self, *, inner: Any, event_types: set[str]) -> None:
    self._inner = inner
    self._types = event_types
```

### lazybridge.JsonFileExporter

```python
JsonFileExporter(*, path: str)
```

Append each event as a JSON line to `path`.

F7: keeps the file handle open across calls instead of opening and closing it on every event. Under a typical agent run with 50-200 events the original per-call open/fwrite/close caused O(n) filesystem syscalls. `close()` is called automatically by `Session.close()` when it iterates its exporter list.

Source code in `lazybridge/exporters.py`

```python
def __init__(self, *, path: str) -> None:
    self._path = path
    self._fh = open(path, "a", encoding="utf-8")  # noqa: SIM115
    self._lock = threading.Lock()
```

#### close

```python
close() -> None
```

Flush and close the underlying file handle. Idempotent.

Source code in `lazybridge/exporters.py`

```python
def close(self) -> None:
    """Flush and close the underlying file handle. Idempotent."""
    with self._lock:
        try:
            if not self._fh.closed:
                self._fh.flush()
                self._fh.close()
        except Exception:
            pass
```

### lazybridge.StructuredLogExporter

```python
StructuredLogExporter(*, logger_name: str = 'lazybridge')
```

Emit each event via Python's `logging` module.

Source code in `lazybridge/exporters.py`

```python
def __init__(self, *, logger_name: str = "lazybridge") -> None:
    self._log = logging.getLogger(logger_name)
```

# State primitives

`Memory` is the per-agent conversation history layer. `Store` is the shared, optionally-persistent key-value blackboard.

For narrative usage see [Guides → Mid → Memory](https://core.lazybridge.com/guides/mid/memory/index.md) and [Guides → Mid → Store](https://core.lazybridge.com/guides/mid/store/index.md). For wiring data between Plan steps see [Sentinels & predicates](https://core.lazybridge.com/reference/sentinels/index.md).

### lazybridge.Memory

```python
Memory(*, strategy: Literal['auto', 'sliding', 'summary', 'none'] = 'auto', max_tokens: int | None = 4000, max_turns: int | None = _DEFAULT_MAX_TURNS, store: Any | None = None, summarizer: Any | None = None, summarizer_timeout: float | None = _DEFAULT_SUMMARIZER_TIMEOUT)
```

Conversation memory with configurable compression strategy.

Per-agent use: memory=Memory() on the Agent — tracks its message history. Shared use: sources=[memory] on multiple Agents — live view of shared text.

The `text()` method returns the current memory as a context string, re-read on every invocation (live view — never a stale snapshot).

##### LLM summarization

Pass any callable (typically an `Agent`) as `summarizer=` to enable LLM-based compression instead of the keyword-extraction fallback::

```text
summarizer = Agent("claude-haiku-4-5-20251001", system="Summarize conversations concisely.")
memory = Memory(strategy="summary", summarizer=summarizer)
agent  = Agent("claude-opus-4-7", memory=memory)
```

When compression triggers the summarizer is called with a formatted transcript of the turns being dropped. Three callable shapes are handled transparently:

- Sync callable returning a string / Envelope / anything with `.text()` — called directly.
- `Agent` — its `__call__` already bridges async internally.
- Plain `async def summarize(prompt): ...` — the returned coroutine is driven to completion (in a worker thread when called from inside an event loop, to avoid nested-loop errors).

If the summarizer raises, compression falls back to keyword extraction — never silent garbage.

Create a Memory instance.

##### Parameters

strategy: `"auto"` — compress when token budget is exceeded (requires `max_tokens` to be set). `"sliding"` — compress by turn count regardless of `max_tokens`; does NOT require a token budget. `"summary"` — same trigger as `"sliding"` but uses the LLM `summarizer` instead of keyword extraction. `"none"` — no compression; only the hard `max_turns` cap applies. max_tokens: Token budget for `strategy="auto"`. Ignored by `"sliding"` and `"summary"` (they compress by turn count). `None` with `strategy="auto"` disables compression entirely. max_turns: Hard cap on retained turns after compression runs. `None` disables the cap (unbounded history — only safe for short sessions). Default: :attr:`_DEFAULT_MAX_TURNS` (1000). store: Optional :class:`~lazybridge.store.Store` for persistent memory across sessions. summarizer: Callable used by `strategy="summary"` — typically an :class:`~lazybridge.agent.Agent`. See the class docstring for accepted shapes. summarizer_timeout: Deadline (seconds) applied to async summariser calls. On timeout the keyword-extraction fallback runs and a one-shot warning is emitted via :attr:`_summarizer_warned`. `None` disables the deadline. Default: 30 s.

Source code in `lazybridge/memory.py`

```python
def __init__(
    self,
    *,
    strategy: Literal["auto", "sliding", "summary", "none"] = "auto",
    max_tokens: int | None = 4000,
    max_turns: int | None = _DEFAULT_MAX_TURNS,
    store: Any | None = None,
    summarizer: Any | None = None,
    summarizer_timeout: float | None = _DEFAULT_SUMMARIZER_TIMEOUT,
) -> None:
    """Create a Memory instance.

    Parameters
    ----------
    strategy:
        ``"auto"``    — compress when token budget is exceeded
        (requires ``max_tokens`` to be set).
        ``"sliding"`` — compress by turn count regardless of
        ``max_tokens``; does NOT require a token budget.
        ``"summary"`` — same trigger as ``"sliding"`` but uses the
        LLM ``summarizer`` instead of keyword extraction.
        ``"none"``    — no compression; only the hard ``max_turns``
        cap applies.
    max_tokens:
        Token budget for ``strategy="auto"``.  Ignored by
        ``"sliding"`` and ``"summary"`` (they compress by turn
        count).  ``None`` with ``strategy="auto"`` disables
        compression entirely.
    max_turns:
        Hard cap on retained turns after compression runs.  ``None``
        disables the cap (unbounded history — only safe for short
        sessions).  Default: :attr:`_DEFAULT_MAX_TURNS` (1000).
    store:
        Optional :class:`~lazybridge.store.Store` for persistent
        memory across sessions.
    summarizer:
        Callable used by ``strategy="summary"`` — typically an
        :class:`~lazybridge.agent.Agent`.  See the class docstring
        for accepted shapes.
    summarizer_timeout:
        Deadline (seconds) applied to async summariser calls.  On
        timeout the keyword-extraction fallback runs and a one-shot
        warning is emitted via :attr:`_summarizer_warned`.  ``None``
        disables the deadline.  Default: 30 s.
    """
    self.strategy = strategy
    self.max_tokens = max_tokens
    self.max_turns = max_turns
    self.store = store
    self._turns: list[_Turn] = []
    self._lock = threading.Lock()
    self._summary: str = ""
    self._overflow_warned = False
    self._summarizer = summarizer
    if summarizer_timeout is not None and summarizer_timeout <= 0:
        raise ValueError(f"summarizer_timeout must be > 0 or None, got {summarizer_timeout!r}")
    # Phase-2 Block C: warn on too-tight timeouts.  The summariser
    # spawns a worker thread + JSON serialisation + LLM round-trip;
    # values under 5 s typically time out on every call (silent
    # always-fail trap).
    if summarizer_timeout is not None and summarizer_timeout < 5.0:
        import warnings as _warnings

        _warnings.warn(
            f"Memory(summarizer_timeout={summarizer_timeout!r}): values under 5.0 s often "
            f"trigger a timeout on every summariser invocation (worker-thread setup + LLM "
            f"round-trip).  Consider 30.0 s (the production default) unless you have data "
            f"showing your summariser is reliably faster.",
            UserWarning,
            stacklevel=2,
        )
    self.summarizer_timeout = summarizer_timeout
    # Guard against overlapping summariser calls when ``add()`` is
    # invoked concurrently.  Only one compression runs at a time;
    # other ``add()``s append turns and skip the compression branch.
    self._compressing = False
    # Separate one-shot warning flags for distinct warning sources so
    # a summariser timeout doesn't silence the turn-cap warning and
    # vice versa.
    self._summarizer_warned = False
```

#### messages

```python
messages() -> list[Message]
```

Return full message list including summary prefix if compressed.

Source code in `lazybridge/memory.py`

```python
def messages(self) -> list[Message]:
    """Return full message list including summary prefix if compressed."""
    with self._lock:
        result: list[Message] = []
        if self._summary:
            result.append(Message(role=Role.USER, content=f"Context from earlier: {self._summary}"))
            result.append(Message(role=Role.ASSISTANT, content="Understood."))
        for t in self._turns:
            result.append(Message(role=Role.USER, content=t.user))
            result.append(Message(role=Role.ASSISTANT, content=t.assistant))
        return result
```

#### text

```python
text() -> str
```

Return current memory as a plain-text string (live view).

Source code in `lazybridge/memory.py`

```python
def text(self) -> str:
    """Return current memory as a plain-text string (live view)."""
    with self._lock:
        parts: list[str] = []
        if self._summary:
            parts.append(self._summary)
        for t in self._turns[-5:]:
            parts.append(f"User: {t.user}\nAssistant: {t.assistant}")
        return "\n\n".join(parts)
```

### lazybridge.Store

```python
Store(db: str | None = None)
```

Key-value store for PlanState and shared data.

db=None → in-memory (lost on exit). db="file" → SQLite with WAL mode (persistent).

Source code in `lazybridge/store/__init__.py`

```python
def __init__(self, db: str | None = None) -> None:
    self._db = db
    self._local = threading.local()
    # ``RLock`` (not ``Lock``) so internal helpers like
    # ``_discard_thread_conn`` can safely re-enter the lock when
    # called from a code path (``compare_and_swap``) that already
    # holds it.
    self._lock = threading.RLock()
    # Track every opened thread-local connection so we can close
    # them deterministically from ``close()``.  ``threading.local``
    # only scopes attributes per thread; without a registry the
    # connections linger until the owning thread exits + GC runs,
    # which leaks file descriptors under worker pools.
    self._all_conns: list[sqlite3.Connection] = []
    self._closed = False
    if not db:
        self._mem: dict[str, StoreEntry] = {}
    self._init_schema()
```

#### close

```python
close() -> None
```

Close every thread-local SQLite connection opened by this Store.

Idempotent. After `close()` the Store raises `RuntimeError` on further reads / writes so callers fail fast instead of silently re-opening connections.

Source code in `lazybridge/store/__init__.py`

```python
def close(self) -> None:
    """Close every thread-local SQLite connection opened by this Store.

    Idempotent.  After ``close()`` the Store raises ``RuntimeError``
    on further reads / writes so callers fail fast instead of
    silently re-opening connections.
    """
    with self._lock:
        if self._closed:
            return
        self._closed = True
        conns = list(self._all_conns)
        self._all_conns.clear()
    for c in conns:
        try:
            c.close()
        except sqlite3.Error:
            pass  # already closed / invalid — nothing to recover
```

#### write

```python
write(key: str, value: Any, *, agent_id: str | None = None) -> None
```

Store `value` under `key`.

**Copy semantics (in-memory path)**: the value is deep-copied before storage via :func:`_deep_copy_safe` so that callers mutating the original object after `write()` cannot silently alter the stored value and defeat :meth:`compare_and_swap`. The SQLite path gets equivalent isolation for free via the JSON round-trip.

Pydantic instances are serialised via `model_dump(mode="json")` before JSON encoding — otherwise SQLite storage would fall through `default=str` and persist the model's `__repr__` (e.g. `"x=42 name='hello'"`), which is NOT round-trippable. In-memory storage keeps the instance as-is since Python dicts don't need JSON round-tripping.

Source code in `lazybridge/store/__init__.py`

```python
def write(self, key: str, value: Any, *, agent_id: str | None = None) -> None:
    """Store ``value`` under ``key``.

    **Copy semantics (in-memory path)**: the value is deep-copied
    before storage via :func:`_deep_copy_safe` so that callers
    mutating the original object after ``write()`` cannot silently
    alter the stored value and defeat :meth:`compare_and_swap`.
    The SQLite path gets equivalent isolation for free via the
    JSON round-trip.

    Pydantic instances are serialised via ``model_dump(mode="json")``
    before JSON encoding — otherwise SQLite storage would fall
    through ``default=str`` and persist the model's ``__repr__``
    (e.g. ``"x=42 name='hello'"``), which is NOT round-trippable.
    In-memory storage keeps the instance as-is since Python dicts
    don't need JSON round-tripping.
    """
    if self._db:
        serialised = _to_jsonable(value)
        self._conn().execute(
            "INSERT OR REPLACE INTO store (key, value, written_at, agent_id) VALUES (?,?,?,?)",
            (key, json.dumps(serialised, default=str), time.time(), agent_id),
        )
        self._conn().commit()
    else:
        with self._lock:
            self._mem[key] = StoreEntry(key=key, value=_deep_copy_safe(value), agent_id=agent_id)
```

#### read

```python
read(key: str, default: Any = None) -> Any
```

Return the value stored under `key`, or `default`.

**Copy semantics (in-memory path)**: returns a deep copy of the stored value via :func:`_deep_copy_safe` so that callers mutating the returned object cannot silently alter the stored value and defeat :meth:`compare_and_swap`. The SQLite path returns a fresh object on every call because `json.loads` always allocates new containers.

Source code in `lazybridge/store/__init__.py`

```python
def read(self, key: str, default: Any = None) -> Any:
    """Return the value stored under ``key``, or ``default``.

    **Copy semantics (in-memory path)**: returns a deep copy of the
    stored value via :func:`_deep_copy_safe` so that callers mutating
    the returned object cannot silently alter the stored value and
    defeat :meth:`compare_and_swap`.  The SQLite path returns a fresh
    object on every call because ``json.loads`` always allocates new
    containers.
    """
    if self._db:
        row = self._conn().execute("SELECT value FROM store WHERE key=?", (key,)).fetchone()
        return json.loads(row["value"]) if row else default
    with self._lock:
        entry = self._mem.get(key)
        return _deep_copy_safe(entry.value) if entry else default
```

#### items

```python
items(*, prefix: str | None = None) -> list[tuple[str, Any]]
```

Return `(key, value)` pairs, optionally restricted to keys starting with `prefix`.

Uses an indexed B-tree range scan on the SQLite path (a single `WHERE key >= ? AND key < ?` query) — sub-linear in the total keyspace. The in-memory path filters under the store lock.

###### Parameters

prefix: Optional key prefix to filter by. `None` (default) or `""` returns every pair in the store. Any other string restricts the results to keys starting with that prefix.

Source code in `lazybridge/store/__init__.py`

```python
def items(self, *, prefix: str | None = None) -> list[tuple[str, Any]]:
    """Return ``(key, value)`` pairs, optionally restricted to keys starting
    with ``prefix``.

    Uses an indexed B-tree range scan on the SQLite path (a single
    ``WHERE key >= ? AND key < ?`` query) — sub-linear in the total
    keyspace.  The in-memory path filters under the store lock.

    Parameters
    ----------
    prefix:
        Optional key prefix to filter by. ``None`` (default) or ``""``
        returns every pair in the store. Any other string restricts the
        results to keys starting with that prefix.
    """
    if self._db:
        if not prefix:
            rows = self._conn().execute("SELECT key, value FROM store").fetchall()
        else:
            upper = _prefix_upper_bound(prefix)
            if upper is not None:
                rows = (
                    self._conn()
                    .execute(
                        "SELECT key, value FROM store WHERE key >= ? AND key < ?",
                        (prefix, upper),
                    )
                    .fetchall()
                )
            else:
                # All characters in the prefix are U+10FFFF — no finite upper
                # bound; fall back to a full scan with Python startswith filter.
                rows = [
                    r
                    for r in self._conn().execute("SELECT key, value FROM store").fetchall()
                    if r["key"].startswith(prefix)
                ]
        return [(r["key"], json.loads(r["value"])) for r in rows]
    with self._lock:
        return [(k, _deep_copy_safe(v.value)) for k, v in self._mem.items() if not prefix or k.startswith(prefix)]
```

#### compare_and_swap

```python
compare_and_swap(key: str, expected: Any, new: Any, *, agent_id: str | None = None) -> bool
```

Atomically set `key` to `new` only if its current value deep-equals `expected`.

`expected=None` means "the key must not currently exist" (a missing key compares equal to `None`).

Returns `True` on success, `False` when another writer has moved the value since the caller's last read — the caller is expected to treat this as a lost race (raise, back off, re-read, etc.). Comparison uses the JSON-normalised shape (via :func:`_to_jsonable`), so Pydantic models / nested dicts compare the same as they do on disk and survive SQLite round-trips.

SQLite path: wraps the read-check-write in `BEGIN IMMEDIATE` so concurrent writers serialise on the SQLite reserved lock instead of interleaving. Exceptions are caught as `(sqlite3.Error, ValueError)` — the `ValueError` arm covers `json.JSONDecodeError` (a subclass) raised by `json.loads` on corrupt rows; without it, a corrupt row would leave the `BEGIN IMMEDIATE` transaction open on the thread-local connection and poison every subsequent call on that thread.

Source code in `lazybridge/store/__init__.py`

```python
def compare_and_swap(
    self,
    key: str,
    expected: Any,
    new: Any,
    *,
    agent_id: str | None = None,
) -> bool:
    """Atomically set ``key`` to ``new`` only if its current value
    deep-equals ``expected``.

    ``expected=None`` means "the key must not currently exist" (a
    missing key compares equal to ``None``).

    Returns ``True`` on success, ``False`` when another writer has
    moved the value since the caller's last read — the caller is
    expected to treat this as a lost race (raise, back off, re-read,
    etc.).  Comparison uses the JSON-normalised shape (via
    :func:`_to_jsonable`), so Pydantic models / nested dicts compare
    the same as they do on disk and survive SQLite round-trips.

    SQLite path: wraps the read-check-write in ``BEGIN IMMEDIATE``
    so concurrent writers serialise on the SQLite reserved lock
    instead of interleaving.  Exceptions are caught as
    ``(sqlite3.Error, ValueError)`` — the ``ValueError`` arm covers
    ``json.JSONDecodeError`` (a subclass) raised by ``json.loads``
    on corrupt rows; without it, a corrupt row would leave the
    ``BEGIN IMMEDIATE`` transaction open on the thread-local
    connection and poison every subsequent call on that thread.
    """
    if self._closed:
        raise RuntimeError("Store is closed")
    if not self._db:
        with self._lock:
            entry = self._mem.get(key)
            cur = entry.value if entry else None
            if not _json_eq(cur, expected):
                return False
            self._mem[key] = StoreEntry(key=key, value=_deep_copy_safe(new), agent_id=agent_id)
            return True

    # SQLite path — serialise concurrent writers via reserved lock.
    conn = self._conn()
    serialised_new = json.dumps(_to_jsonable(new), default=str)
    with self._lock:
        try:
            conn.execute("BEGIN IMMEDIATE")
            row = conn.execute("SELECT value FROM store WHERE key=?", (key,)).fetchone()
            cur = json.loads(row["value"]) if row else None
            if not _json_eq(cur, expected):
                conn.execute("ROLLBACK")
                return False
            conn.execute(
                "INSERT OR REPLACE INTO store (key, value, written_at, agent_id) VALUES (?,?,?,?)",
                (key, serialised_new, time.time(), agent_id),
            )
            conn.commit()
            return True
        except (sqlite3.Error, ValueError):
            # Catch both sqlite3.Error (transaction/IO failures) and
            # ValueError/JSONDecodeError (corrupt JSON in the row).
            # Without the ValueError arm, a corrupt row causes
            # json.loads to raise inside the BEGIN IMMEDIATE block,
            # leaving the transaction open on the thread-local
            # connection and poisoning every subsequent call on that thread.
            try:
                conn.execute("ROLLBACK")
            except sqlite3.Error as rollback_exc:
                import warnings as _warnings

                _warnings.warn(
                    f"Store.compare_and_swap: ROLLBACK after error failed "
                    f"({type(rollback_exc).__name__}: {rollback_exc}).  "
                    f"Discarding the thread-local connection so the next "
                    f"call gets a fresh one.",
                    UserWarning,
                    stacklevel=3,
                )
                self._discard_thread_conn()
            raise
```

## At-rest encryption

`EncryptedStoreAdapter` wraps any `Store` and encrypts values before they hit the inner store; reads decrypt transparently. Keys are **not** encrypted — `Store` treats keys as opaque routing strings, and encrypting them would break iteration / `__contains__`.

Install the optional extra:

```bash
pip install 'lazybridge[encryption]'
```

The extra pulls [`cryptography`](https://cryptography.io); the default install stays pure-Python.

```python
from cryptography.fernet import Fernet
from lazybridge.store import Store
from lazybridge.store.encryption import EncryptedStoreAdapter

key = Fernet.generate_key()  # persist somewhere safe (KMS, env var, etc.)
store = EncryptedStoreAdapter(Store(db="state.sqlite"), key=key)

store.write("agent.notes", {"draft": "secret thoughts"})
# SQLite row holds an `lb-enc-v1::` Fernet token, not the JSON.

store.read("agent.notes")
# {"draft": "secret thoughts"}
```

For key rotation use `MultiFernet` semantics by passing a list — the first key is used for new writes, every key is tried on read:

```python
store = EncryptedStoreAdapter(Store(...), key=[new_key, old_key])
```

### lazybridge.store.encryption.EncryptedStoreAdapter

```python
EncryptedStoreAdapter(inner: Store, *, key: bytes | list[bytes])
```

Encrypt values written to an inner :class:`Store`.

##### Parameters

inner: The Store to wrap. Any Store flavour works (in-memory or SQLite-backed); the adapter doesn't care about persistence. key: A Fernet key (`bytes`) or list of keys. A list activates :class:`cryptography.fernet.MultiFernet` for rotation — encryption uses the first key, decryption tries each in order until one succeeds.

##### Raises

ImportError: If `cryptography` isn't installed. Install `lazybridge[encryption]` to add it. ValueError: If `key` is empty or not bytes / list of bytes.

Source code in `lazybridge/store/encryption.py`

```python
def __init__(self, inner: Store, *, key: bytes | list[bytes]) -> None:
    try:
        from cryptography.fernet import Fernet, MultiFernet
    except ImportError as e:
        raise ImportError(
            "EncryptedStoreAdapter requires the 'cryptography' package.\n"
            "  Install it via the encryption extra:\n"
            "    pip install 'lazybridge[encryption]'\n"
            "  Or directly:\n"
            "    pip install cryptography"
        ) from e

    if isinstance(key, bytes):
        self._fernet: Fernet | MultiFernet = Fernet(key)
    elif isinstance(key, list):
        if not key:
            raise ValueError("EncryptedStoreAdapter: key list is empty.")
        if not all(isinstance(k, bytes) for k in key):
            raise ValueError(
                f"EncryptedStoreAdapter: every key in the rotation list must be bytes; "
                f"got types {[type(k).__name__ for k in key]}."
            )
        self._fernet = MultiFernet([Fernet(k) for k in key])
    else:
        raise ValueError(f"EncryptedStoreAdapter: key must be bytes or list[bytes], got {type(key).__name__}.")

    self._inner = inner
```

#### items

```python
items(*, prefix: str | None = None) -> list[tuple[str, Any]]
```

Return `(key, decrypted-value)` pairs, optionally restricted to keys starting with `prefix`.

Source code in `lazybridge/store/encryption.py`

```python
def items(self, *, prefix: str | None = None) -> list[tuple[str, Any]]:
    """Return ``(key, decrypted-value)`` pairs, optionally restricted to
    keys starting with ``prefix``."""
    return [(k, self._decrypt(v)) for k, v in self._inner.items(prefix=prefix)]
```

#### compare_and_swap

```python
compare_and_swap(key: str, expected: Any, new: Any, *, agent_id: str | None = None) -> bool
```

CAS over the decrypted value.

Fernet ciphertexts embed a nonce, so two encryptions of the same plaintext are never equal. Delegating to the inner Store's CAS would therefore always fail. We decrypt the current ciphertext, compare against `expected` in plaintext space, then write the new encrypted value — *as long as* the ciphertext hasn't changed since the read.

This double-check is what makes the adapter race-safe: another writer may have updated the value between our decrypt and the inner CAS attempt, in which case the inner CAS returns `False` and we propagate that.

Source code in `lazybridge/store/encryption.py`

```python
def compare_and_swap(
    self,
    key: str,
    expected: Any,
    new: Any,
    *,
    agent_id: str | None = None,
) -> bool:
    """CAS over the decrypted value.

    Fernet ciphertexts embed a nonce, so two encryptions of the
    same plaintext are never equal.  Delegating to the inner
    Store's CAS would therefore always fail.  We decrypt the
    current ciphertext, compare against ``expected`` in plaintext
    space, then write the new encrypted value — *as long as* the
    ciphertext hasn't changed since the read.

    This double-check is what makes the adapter race-safe: another
    writer may have updated the value between our decrypt and the
    inner CAS attempt, in which case the inner CAS returns
    ``False`` and we propagate that.
    """
    current_token = self._inner.read(key)
    if current_token is None:
        current_plain = None
    else:
        current_plain = self._decrypt(current_token)

    if not _plain_eq(current_plain, expected):
        return False

    return self._inner.compare_and_swap(
        key,
        expected=current_token,
        new=self._encrypt(new),
        agent_id=agent_id,
    )
```

# Tool family

Wrap any callable as a `Tool` for an `Agent`. The `Tool.wrap()` classmethod is the canonical multi-input factory (callable / `Agent` / existing `Tool`); `Tool(...)` is the explicit constructor used when you want to set every field by hand. `ToolProvider` is the protocol for expandable tool catalogues (MCP servers etc.). `NativeTool` enumerates provider-hosted server-side tools.

The module-level `lazybridge.tool` (lowercase) is a thin backwards-compat alias for `Tool.wrap` — existing imports keep working, new code should prefer the classmethod.

For narrative usage see [Guides → Basic → Tool](https://core.lazybridge.com/guides/basic/tool/index.md) and [Guides → Basic → Native tools](https://core.lazybridge.com/guides/basic/native-tools/index.md).

### lazybridge.Tool

```python
Tool(func: Callable, *, name: str | None = None, description: str | None = None, mode: Literal['signature', 'llm', 'hybrid'] = 'signature', schema_llm: Any | None = None, strict: bool = False, returns_envelope: bool = False, agent_memory: Any | None = None, agent_store: Any | None = None)
```

Wraps any Python callable as an LLM-accessible tool.

Pass raw functions directly; Tool auto-wraps them on the agent level. Use Tool(fn, ...) only when you need explicit configuration.

Source code in `lazybridge/tools.py`

```python
def __init__(
    self,
    func: Callable,
    *,
    name: str | None = None,
    description: str | None = None,
    mode: Literal["signature", "llm", "hybrid"] = "signature",
    schema_llm: Any | None = None,
    strict: bool = False,
    returns_envelope: bool = False,
    agent_memory: Any | None = None,
    agent_store: Any | None = None,
) -> None:
    if mode not in ("signature", "llm", "hybrid"):
        # ``"auto"`` was the 0.7-era default — removed in 0.7.9.
        # Reject it eagerly so the failure surfaces at construction
        # time, not lazily at the first ``definition()`` call.
        raise ValueError(
            f"Tool(mode={mode!r}) is invalid.  Accepted values: "
            f"'signature' (default), 'hybrid', 'llm'.  "
            f"The legacy 'auto' value was removed in 0.7.9; pass "
            f"'hybrid' or 'llm' explicitly to opt into LLM-driven "
            f"schema generation."
        )
    self.func = func
    self.name = name or func.__name__
    self.description = description
    self.mode = mode
    self.schema_llm = schema_llm
    self.strict = strict
    #: When ``True``, ``func`` returns an ``Envelope`` instead of a
    #: plain Python value.  Engines aware of this hint will preserve
    #: the inner envelope's metadata (tokens / cost / error) when
    #: aggregating results from a turn's tool calls.  The flag is
    #: set automatically by ``_wrap_tool`` for Agents wrapped via
    #: ``agent.as_tool()``.
    self.returns_envelope = returns_envelope
    #: Live reference to the source agent's Memory, set by ``agent.as_tool()``.
    #: Resolved lazily at step execution time via ``from_memory("name")``.
    #: None for plain function tools.
    self.agent_memory = agent_memory
    #: Live reference to the source agent's Store, set by ``agent.as_tool()``.
    #: Used by ``from_agent("name")`` to read the agent's last output.
    #: None for plain function tools.
    self.agent_store = agent_store
    self._definition: ToolDefinition | None = None
    self._lock = threading.Lock()
```

#### from_schema

```python
from_schema(name: str, description: str, parameters: dict[str, Any], func: Callable[..., Any], *, strict: bool = False, returns_envelope: bool = False) -> Tool
```

Create a Tool with a pre-built JSON Schema for parameters.

Use this when the schema is already known (from MCP, OpenAPI, a third-party tool registry, ...) and signature introspection would either be unavailable or produce the wrong shape.

`parameters` must be a JSON Schema object (the same shape that `ToolDefinition.parameters` carries).

Source code in `lazybridge/tools.py`

```python
@classmethod
def from_schema(
    cls,
    name: str,
    description: str,
    parameters: dict[str, Any],
    func: Callable[..., Any],
    *,
    strict: bool = False,
    returns_envelope: bool = False,
) -> Tool:
    """Create a Tool with a pre-built JSON Schema for parameters.

    Use this when the schema is already known (from MCP, OpenAPI, a
    third-party tool registry, ...) and signature introspection would
    either be unavailable or produce the wrong shape.

    ``parameters`` must be a JSON Schema object (the same shape that
    ``ToolDefinition.parameters`` carries).
    """
    tool = cls.__new__(cls)
    tool.func = func
    tool.name = name
    tool.description = description
    tool.mode = "signature"  # unused — we set ``_definition`` directly
    tool.schema_llm = None
    tool.strict = strict
    tool.returns_envelope = returns_envelope
    tool.agent_memory = None
    tool.agent_store = None
    tool._definition = ToolDefinition(
        name=name,
        description=description,
        parameters=parameters,
        strict=strict,
    )
    tool._lock = threading.Lock()
    return tool
```

#### run_sync

```python
run_sync(**kwargs: Any) -> Any
```

Blocking tool invocation.

Handles three cases so that callers never see a stray coroutine:

- plain sync function → called directly.
- async function → executed inside the current event loop if one is running (a worker thread hops out of it), otherwise on a fresh `asyncio.run` loop. Needed because :meth:`Agent.as_tool` wraps the agent's `.run()` coroutine into `Tool.func` — `SupervisorEngine` / REPL callers were previously getting `"<coroutine object _run at 0x...>"` instead of the result.

Source code in `lazybridge/tools.py`

```python
def run_sync(self, **kwargs: Any) -> Any:
    """Blocking tool invocation.

    Handles three cases so that callers never see a stray coroutine:

    * plain sync function → called directly.
    * async function → executed inside the current event loop if one
      is running (a worker thread hops out of it), otherwise on a
      fresh ``asyncio.run`` loop.  Needed because :meth:`Agent.as_tool`
      wraps the agent's ``.run()`` coroutine into ``Tool.func`` —
      ``SupervisorEngine`` / REPL callers were previously getting
      ``"<coroutine object _run at 0x...>"`` instead of the result.
    """
    if not asyncio.iscoroutinefunction(self.func):
        return self.func(**kwargs)

    coro_factory = lambda: self.func(**kwargs)  # noqa: E731
    # ``asyncio.get_running_loop`` is the forward-compatible check
    # (it raises cleanly when no loop is running, unlike the
    # deprecated ``get_event_loop``).  When a loop is running we
    # hop to a worker thread so we never try to nest.
    try:
        asyncio.get_running_loop()
    except RuntimeError:
        return asyncio.run(coro_factory())
    # Propagate the caller's contextvars context (OTel, structured
    # logging, request IDs) into the worker loop.  A raw
    # ``asyncio.run`` on a fresh thread would start in an empty
    # context and silently break observability for sync callers.
    import concurrent.futures
    import contextvars

    ctx = contextvars.copy_context()

    def _run() -> Any:
        return ctx.run(asyncio.run, coro_factory())

    with concurrent.futures.ThreadPoolExecutor(max_workers=1) as pool:
        return pool.submit(_run).result()
```

#### wrap

```python
wrap(obj: Any, *, name: str | None = None, description: str | None = None, mode: Literal['signature', 'hybrid', 'llm'] = 'signature', schema_llm: Any | None = None, strict: bool = _UNSET_BOOL) -> Tool
```

Canonical multi-input factory — accepts a callable, an Agent, or an existing :class:`Tool`, and returns a properly wrapped `Tool`.

**For Python functions** — `name` is required so Plan steps, tool maps, and LLM calls all share the same stable identifier::

```text
search = Tool.wrap(search_web, name="search", description="Search the web.")
researcher = Agent(name="research", engine=LLMEngine(...), tools=[search])
```

**For Agents** — the canonical path is `tools=[agent]` directly; `Tool.wrap` is useful when you need a local alias::

```text
Tool.wrap(researcher, name="deep_research")
```

**For existing Tools** — returns the object unchanged (no overrides) or clones it with the specified overrides (non-mutating)::

```text
search_v2 = Tool.wrap(search, name="web_search")
```

###### Parameters

obj: A callable, :class:`Agent`, or existing :class:`Tool` to wrap. name: Required for callables. Optional alias for agents and Tools. description: Human-readable description forwarded to the LLM. mode: Schema generation mode. `"signature"` (default) introspects the function signature and docstring deterministically. Pass `"hybrid"` (signature + LLM-enriched descriptions) or `"llm"` (full LLM-inferred schema) explicitly when the signature alone is insufficient — both require `schema_llm=` to be set. schema_llm: Engine used when `mode="hybrid"` or `mode="llm"`. strict: Enable JSON Schema strict mode.

###### Notes

Module-level :func:`tool` is a thin alias for backwards compatibility and is kept indefinitely; new code should prefer `Tool.wrap`.

Source code in `lazybridge/tools.py`

```python
@classmethod
def wrap(
    cls,
    obj: Any,
    *,
    name: str | None = None,
    description: str | None = None,
    mode: Literal["signature", "hybrid", "llm"] = "signature",
    schema_llm: Any | None = None,
    strict: bool = _UNSET_BOOL,  # type: ignore[assignment]
) -> Tool:
    """Canonical multi-input factory — accepts a callable, an Agent, or an
    existing :class:`Tool`, and returns a properly wrapped ``Tool``.

    **For Python functions** — ``name`` is required so Plan steps, tool
    maps, and LLM calls all share the same stable identifier::

        search = Tool.wrap(search_web, name="search", description="Search the web.")
        researcher = Agent(name="research", engine=LLMEngine(...), tools=[search])

    **For Agents** — the canonical path is ``tools=[agent]`` directly;
    ``Tool.wrap`` is useful when you need a local alias::

        Tool.wrap(researcher, name="deep_research")

    **For existing Tools** — returns the object unchanged (no overrides) or
    clones it with the specified overrides (non-mutating)::

        search_v2 = Tool.wrap(search, name="web_search")

    Parameters
    ----------
    obj:
        A callable, :class:`Agent`, or existing :class:`Tool` to wrap.
    name:
        Required for callables.  Optional alias for agents and Tools.
    description:
        Human-readable description forwarded to the LLM.
    mode:
        Schema generation mode.  ``"signature"`` (default) introspects the
        function signature and docstring deterministically.  Pass
        ``"hybrid"`` (signature + LLM-enriched descriptions) or ``"llm"``
        (full LLM-inferred schema) explicitly when the signature alone
        is insufficient — both require ``schema_llm=`` to be set.
    schema_llm:
        Engine used when ``mode="hybrid"`` or ``mode="llm"``.
    strict:
        Enable JSON Schema strict mode.

    Notes
    -----
    Module-level :func:`tool` is a thin alias for backwards compatibility
    and is kept indefinitely; new code should prefer ``Tool.wrap``.
    """
    # ── Case 1: already a Tool ──────────────────────────────────────────
    if isinstance(obj, Tool):
        has_overrides = (
            name is not None
            or description is not None
            or mode != "signature"
            or schema_llm is not None
            or strict is not _UNSET_BOOL
        )
        if not has_overrides:
            return obj
        return cls(
            obj.func,
            name=name if name is not None else obj.name,
            description=description if description is not None else obj.description,
            mode=mode if mode != "signature" else obj.mode,
            schema_llm=schema_llm if schema_llm is not None else obj.schema_llm,
            strict=obj.strict if strict is _UNSET_BOOL else bool(strict),
            returns_envelope=obj.returns_envelope,
            agent_memory=obj.agent_memory,
            agent_store=obj.agent_store,
        )

    # ── Case 2: Agent-like ──────────────────────────────────────────────
    if getattr(obj, "_is_lazy_agent", False):
        # An explicit alias passed here is always accepted.
        # Without an alias, the agent must have _name_explicit=True.
        if name is None and getattr(obj, "_name_explicit", True) is False:
            # Only reject real Agent instances that set _name_explicit=False.
            # Duck-typed agents (MockAgent, custom subclasses) default to True.
            agent_name = getattr(obj, "name", repr(obj))
            raise ValueError(
                f"Agent used as a tool must have an explicit name=...\n"
                f"The agent currently has name={agent_name!r} "
                f"(derived from the model or left as the default).\n\n"
                f"Set an explicit name:\n"
                f'    Agent(name="research", engine=LLMEngine(...))\n\n'
                f"Or pass an alias to the factory:\n"
                f'    Tool.wrap(agent, name="research")'
            )
        effective_name = name or getattr(obj, "name", None)
        if not effective_name or not str(effective_name).strip():
            raise ValueError(
                "Agent used as a tool must have an explicit name=...\n"
                "Example:\n"
                '    Agent(name="research", engine=LLMEngine(...))'
            )
        if hasattr(obj, "as_tool"):
            return obj.as_tool(effective_name, description=description)
        return _agent_as_tool_named(obj, effective_name, description)

    # ── Case 3: plain callable ──────────────────────────────────────────
    if callable(obj):
        if name is None:
            fn_name = getattr(obj, "__name__", repr(obj))
            raise ValueError(
                f"Tool.wrap() requires an explicit name=... for callables.\n"
                f'Example: Tool.wrap({fn_name!r}, name="{fn_name}")'
            )
        strict_val = False if strict is _UNSET_BOOL else bool(strict)  # type: ignore[arg-type]
        return cls(
            obj,
            name=name,
            description=description,
            mode=mode,
            schema_llm=schema_llm,
            strict=strict_val,
        )

    raise TypeError(f"Tool.wrap() cannot wrap {type(obj).__name__!r}")
```

### lazybridge.tool

```python
tool(obj: Any, *, name: str | None = None, description: str | None = None, mode: Literal['signature', 'hybrid', 'llm'] = 'signature', schema_llm: Any | None = None, strict: bool = _UNSET_BOOL) -> Tool
```

Backwards-compatibility alias for :meth:`Tool.wrap`.

New code should call `Tool.wrap(obj, name=...)` — it lives on the class alongside the explicit constructor, mirroring Python stdlib factories like :meth:`dict.fromkeys` and :meth:`datetime.datetime.fromisoformat`. The lowercase :func:`tool` is kept indefinitely so existing imports (`from lazybridge import tool`) continue to work; no deprecation timer is set.

Source code in `lazybridge/tools.py`

```python
def tool(
    obj: Any,
    *,
    name: str | None = None,
    description: str | None = None,
    mode: Literal["signature", "hybrid", "llm"] = "signature",
    schema_llm: Any | None = None,
    strict: bool = _UNSET_BOOL,  # type: ignore[assignment]
) -> Tool:
    """Backwards-compatibility alias for :meth:`Tool.wrap`.

    New code should call ``Tool.wrap(obj, name=...)`` — it lives on the class
    alongside the explicit constructor, mirroring Python stdlib factories
    like :meth:`dict.fromkeys` and :meth:`datetime.datetime.fromisoformat`.
    The lowercase :func:`tool` is kept indefinitely so existing imports
    (``from lazybridge import tool``) continue to work; no deprecation
    timer is set.
    """
    return Tool.wrap(
        obj,
        name=name,
        description=description,
        mode=mode,
        schema_llm=schema_llm,
        strict=strict,
    )
```

### lazybridge.ToolProvider

Bases: `Protocol`

A `tools=[...]` entry that expands itself into one or more Tools.

Implementors set `_is_lazy_tool_provider = True` and define `as_tools() -> list[Tool]`. `MCPServer` and `ExternalToolProvider` both satisfy this protocol structurally; custom providers (OpenAPI imports, internal tool registries, etc.) can do the same — drop the instance into `Agent(tools=[provider])` and `build_tool_map` will expand it on construction.

### lazybridge.NativeTool

Bases: `StrEnum`

Provider-native server-side tools (run on provider infrastructure).
# Errors

# Errors

Two ways LazyBridge surfaces failures:

1. **Error envelopes.** When an agent run fails *but the framework catches the exception*, the agent returns an `Envelope` with `result.ok == False` and `result.error` populated. The agent call itself does not raise — read `result.error.type` / `result.error.message`.
1. **Raised exceptions.** When the failure is structural (broken plan DAG, concurrent-run collision, schema-build issue), the framework raises a typed exception you catch with `try/except`.

The convention is "raise on construction, return error on runtime" — but there are exceptions (see `PlanCompileError` notes below).

## Error envelopes (`result.error.type` values)

These are the `type` strings you see when checking `result.ok` is `False`. The agent call itself never raises.

| `type=`                  | Cause                                                                                         | Diagnosis                                                                                                                                                                    | Fix                                                                                                                                                                 |
| ------------------------ | --------------------------------------------------------------------------------------------- | ---------------------------------------------------------------------------------------------------------------------------------------------------------------------------- | ------------------------------------------------------------------------------------------------------------------------------------------------------------------- |
| `MaxTurnsExceeded`       | `LLMEngine` ran the tool-calling loop for `max_turns` rounds without producing a final answer | Likely an infinite tool-call loop (the model keeps re-asking for the same tool). Inspect the session events for the last few `TOOL_CALL` payloads                            | Bump `LLMEngine(max_turns=N)` for genuinely long tasks; tighten the system prompt; or add a `verify=` judge that rejects non-final responses                        |
| `MaxIterationsExceeded`  | `Plan` exceeded `max_iterations` while routing                                                | Routing cycle (`A → B → A` self-correction loop with no termination predicate) or under-sized cap                                                                            | Lower the cap during dev to fail fast; add a counter via `writes=` and a predicate that breaks the loop; raise `Plan(max_iterations=N)` when the loop is legitimate |
| `GuardBlocked`           | A `Guard` returned `allowed=False` on input or output                                         | `result.error.message` carries the guard's verdict text                                                                                                                      | Either fix the input/output to satisfy the guard, or relax the guard if it's over-broad                                                                             |
| `ToolArgumentParseError` | The LLM emitted tool arguments that don't match the tool's JSON schema                        | Provider-side strict mode rejected the call. Inspect the tool's `definition().parameters` and the model's emitted arguments in the session event                             | Loosen the tool's `strict=` flag; clarify the docstring / type hints so the model emits valid args; or add an LLM-fixed retry                                       |
| `TimeoutError`           | `Agent(timeout=N)` deadline expired                                                           | The whole run exceeded the budget                                                                                                                                            | Raise the timeout, or cap individual tool calls with `LLMEngine(tool_timeout=N)`                                                                                    |
| `PlanPaused`             | A `Plan` step raised `PlanPaused` to halt the pipeline cooperatively                          | Inspect `result.error.message` for the step name + the user-supplied reason. The checkpoint stores `status="paused"` so a `resume=True` rerun will re-invoke the paused step | Build the same `Plan` with `resume=True` and re-invoke when the external precondition is met (webhook arrived, human approved, etc.)                                |

## Raised exceptions (`try/except` surface)

These propagate as Python exceptions. Catch them when constructing plans, registering providers, or hitting strict-mode features.

| Exception                                                                                  | Raised when                                                                                                                                                                                                                                                               | Where                                                                                                                                                                                                                                           | Fix                                                                                                                                                                                                                                                   |
| ------------------------------------------------------------------------------------------ | ------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- | ----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- | ----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- |
| `PlanCompileError`                                                                         | DAG validation fails: duplicate step names, unknown `routes=` targets, malformed `routes_by` Literal type, dangling `from_step` / `from_parallel` / `from_parallel_all` references, mid-band parallel target, `after_branches` referencing a step that doesn't come after | At `Agent(engine=Plan(...), tools=[...])` construction                                                                                                                                                                                          | Fix the offending step. The error message names both the offending step and the violation                                                                                                                                                             |
| `PlanRuntimeError` (subclasses `RuntimeError`)                                             | A `routes={}` predicate raised an exception during runtime evaluation                                                                                                                                                                                                     | Inside `Plan.run`, after the routing step's target completes; engine wraps the underlying exception with the offending step + target named                                                                                                      | Fix the predicate. Best practice: keep predicates pure functions of the envelope's payload; if you need exception handling, do it inside the predicate and return `False` rather than letting it propagate                                            |
| `ConcurrentPlanRunError` (subclasses `RuntimeError`)                                       | Two `Plan` runs share a `checkpoint_key` with `on_concurrent="fail"` (default)                                                                                                                                                                                            | Runtime CAS collision in `_save_checkpoint` / `_claim_checkpoint`                                                                                                                                                                               | Use a unique `checkpoint_key` per concurrent run, or `on_concurrent="fork"` for fan-out workflows (incompatible with `resume=True`)                                                                                                                   |
| `ToolTimeoutError`                                                                         | A tool exceeded `LLMEngine(tool_timeout=N)`                                                                                                                                                                                                                               | Runtime, inside `LLMEngine` tool dispatch                                                                                                                                                                                                       | The engine catches this internally and reports it to the model as `[TOOL_TIMEOUT] …` in the tool result, so the model can recover; the agent run does not abort. Catch only if you wrap the engine yourself                                           |
| `StreamStallError`                                                                         | A streaming response went idle longer than `LLMEngine(stream_idle_timeout=N, default 90s)`                                                                                                                                                                                | Runtime, during `agent.stream(...)` or the engine's stream consumer                                                                                                                                                                             | Pair with `agent.run(...)` instead for non-interactive use; bump `stream_idle_timeout` only if you trust the upstream provider (passing `None` disables it — emits a one-shot `UserWarning`)                                                          |
| `GuardError`                                                                               | Some `Guard` integrations raise this for hard policy failures                                                                                                                                                                                                             | Runtime                                                                                                                                                                                                                                         | Catch and surface to the user; or replace the guard with one that returns `GuardAction(allowed=False, message=...)` for graceful rejection                                                                                                            |
| `UnsupportedNativeToolError` (subclasses `ValueError`)                                     | The provider doesn't implement a requested `NativeTool` AND `strict_native_tools=True`                                                                                                                                                                                    | At provider time, when the request includes the unsupported tool                                                                                                                                                                                | Either remove the native tool from `native_tools=[...]`, switch to a provider that supports it (the message lists supported alternatives), or accept the warning-and-drop default by leaving `strict_native_tools=False`                              |
| `ValueError("dangerous native tool ... requires allow_dangerous_native_tools=True")`       | Constructing an `Agent` or `LLMEngine` with `NativeTool.CODE_EXECUTION` or `NativeTool.COMPUTER_USE` without explicit opt-in                                                                                                                                              | At `Agent(...)` / `LLMEngine(...)` construction                                                                                                                                                                                                 | Pass `allow_dangerous_native_tools=True` on the **outermost** constructor (Agent re-validates so a pre-built engine can't bypass the check). The default `False` is intentional — these two tools have broad access and need explicit acknowledgement |
| `UnsupportedFeatureError` (subclasses `ValueError`)                                        | The model doesn't support the multimodal modality the request includes (vision / audio) AND `strict_multimodal=True`                                                                                                                                                      | At provider time                                                                                                                                                                                                                                | Drop the attachment, switch to a multimodal model, or accept the warning-and-drop default                                                                                                                                                             |
| `ToolArgumentValidationError` (subclasses `ValueError`)                                    | A tool's args fail the auto-generated Pydantic model's validation                                                                                                                                                                                                         | At tool dispatch                                                                                                                                                                                                                                | Loosen the type hint on the tool function, fix the model's emitted args via prompt engineering, or pass `strict=False` on the `Tool`                                                                                                                  |
| `ToolSchemaBuildError` (subclasses `RuntimeError`)                                         | The schema builder couldn't infer a JSON schema for the tool function                                                                                                                                                                                                     | At `Tool(...)` construction or first `Agent(tools=[...])` use                                                                                                                                                                                   | Add type hints to all parameters; switch to `mode="llm"` or `mode="hybrid"` for legacy callables; or pass a pre-built schema via `Tool.from_schema(...)`                                                                                              |
| `ExternalToolError` (subclasses `RuntimeError`)                                            | An external tool registry or execution call failed (network error, malformed response, registry returned non-list). Carries optional `.status` (HTTP code) and `.body` (raw response). Import: `from lazytools.connectors.gateway import ExternalToolError`               | At runtime when using `ExternalToolProvider`                                                                                                                                                                                                    | Check the registry URL and response schema; inspect `str(exc)` for the specific path and failure mode; inspect `.status` / `.body` for provider-side details                                                                                          |
| `StructuredOutputError` / `StructuredOutputParseError` / `StructuredOutputValidationError` | The LLM produced output that failed `output=PydanticModel` validation, exhausting `max_output_retries`                                                                                                                                                                    | Runtime, after retries                                                                                                                                                                                                                          | Tighten the system prompt; relax the model (less strict types or `Optional` fields); raise `Agent(max_output_retries=N)`; or accept errors via `Envelope.error` checking                                                                              |
| `PlanPaused` (subclasses `BaseException`)                                                  | A `Step` target raised `PlanPaused` to signal a cooperative halt                                                                                                                                                                                                          | The engine catches it and writes a `status="paused"` checkpoint. The agent call returns an error envelope (NOT a re-raise). Catch only if you wrap the engine yourself or want to short-circuit your own callable that's about to invoke a step | Don't catch it in user code unless you have a specific reason — the engine handles it cleanly. Subclasses `BaseException` (not `Exception`) so `except Exception` won't accidentally swallow it                                                       |

## Common diagnosis flow

1. **`result.ok` is `False`?** Read `result.error.type` and `result.error.message`. The type maps to one row in the first table above.
1. **Got an exception, not an envelope?** It's one of the rows in the second table. Check the message — every framework exception names the offending step / tool / model in the message body.
1. **Hit a `MaxIterationsExceeded` or `MaxTurnsExceeded`?** Pull the session's tool-call events for the last few rounds — `session.events.query(event_type=EventType.TOOL_CALL)` (or filter by run id). Loops nearly always reveal themselves as the same tool name calling repeatedly.
1. **Hit a `PlanCompileError`?** The error message names the offending step — fix the DAG shape (duplicate name, unknown target, malformed Literal, …).
1. **Hit a `PlanRuntimeError`?** A `routes=` predicate raised an exception during evaluation. The message names the offending step + target + the underlying exception class — fix the predicate.
1. **Hit a `ConcurrentPlanRunError`?** Your `checkpoint_key` is shared between two runs. Either pass a unique key per run, or switch to `on_concurrent="fork"` (giving up `resume=True`).
1. **Hit a provider-native exception?** Anthropic / OpenAI / Google SDK exceptions propagate as-is — LazyBridge does not wrap them. The `Executor` retries them when `provider.is_retryable(exc)` returns `True`; otherwise they reach you verbatim.

## Best practices

- **`result.ok` first, `.payload` second.** Production code should always check `result.ok` before reading the payload. An error envelope's payload is whatever was last produced (often `None` or a partial result).
- **Bound everything.** `Agent(timeout=N)`, `LLMEngine(max_turns=N, tool_timeout=N, stream_idle_timeout=N)`, `Plan(max_iterations=N)`, `verify=judge` with `max_verify=N`. Every loop in the framework has a budget; pick defensible defaults rather than relying on `None`.
- **Fail loud at construction.** Plan validation and provider registration mistakes should surface at `Agent(...)` / `Plan(...)` time. If you're catching `PlanCompileError` routinely, your construction code is probably wrong, not your inputs.
- **Use `Session` for forensics.** A failing run is opaque without an event log. Even in development, pair an `Agent(verbose=True)` or `Session(console=True)` with the run; in production, `JsonFileExporter` or `OTelExporter` give you the same per-step trace post-mortem.

## See also

- [Envelope](https://core.lazybridge.com/guides/basic/envelope/index.md) — `Envelope.ok` / `Envelope.error` semantics, `ErrorInfo` shape, retryable flag.
- [Session](https://core.lazybridge.com/guides/mid/session/index.md) — `events.query(...)` for pulling the trace of a failing run.
- [Exporters](https://core.lazybridge.com/guides/full/exporters/index.md) — `EventType.TOOL_TIMEOUT` vs `EventType.TOOL_ERROR`; per-event-type exporter wiring.
- [Checkpoint & resume](https://core.lazybridge.com/guides/full/checkpoint/index.md) — `ConcurrentPlanRunError` and the `on_concurrent` policy table.
- [Plan](https://core.lazybridge.com/guides/full/plan/index.md) — `PlanCompileError` taxonomy and the `max_iterations` safety net.
- [BaseProvider](https://core.lazybridge.com/guides/advanced/base-provider/index.md) — `UnsupportedNativeToolError` / `UnsupportedFeatureError` strict-mode behaviour.
# Why

# Why LazyBridge

Five things LazyBridge does differently — each grounded in code, not marketing.

______________________________________________________________________

## 1. Recursive composition

In LazyBridge, `Plan` is an engine. An `Agent(engine=Plan(...))` is a valid `Tool`. That means pipelines nest with no special syntax at any depth — you use the same `tools=[...]` parameter whether you're adding a plain function or a ten-step sub-pipeline.

```python
from lazybridge import Agent, LLMEngine, Plan, Step, Session, from_step

search    = Agent(engine=LLMEngine("gpt-5.4-mini"),      name="search")
summarise = Agent(engine=LLMEngine("gemini-2.5-pro"),    name="summarise")
writer    = Agent(engine=LLMEngine("claude-sonnet-4-6"), name="write")

research = Agent(
    engine=Plan(Step("search"), Step("summarise")),
    tools=[search, summarise], name="research",
)
article = Agent(
    engine=Plan(Step("research"),
                Step("write", context=from_step("research"))),
    tools=[research, writer], session=Session(),
)
print(article("AI agents in 2026").text())
```

Cost, token counts, and OpenTelemetry spans roll up automatically across all levels via `Envelope.metadata.nested_*`. You pay no composition tax.

**Deep dive:** [Layered composition](https://core.lazybridge.com/concepts/layered-composition/index.md)

______________________________________________________________________

## 2. Plans fail at construction

Other frameworks surface wiring errors at runtime — after spending money on LLM calls. LazyBridge raises `PlanCompileError` the moment you construct a `Plan`, before any inference happens.

```python
from lazybridge import Agent, LLMEngine, Plan, Step, from_step

writer = Agent(engine=LLMEngine("claude-sonnet-4-6"), name="write")

# This raises PlanCompileError immediately — "search" is not in tools=[]
plan = Plan(
    Step("search"),
    Step("write", context=from_step("search")),
)
```

The compiler catches:

- Duplicate step names
- Tools referenced in `Step(target=...)` that are not in `tools=[]`
- Forward references to steps not yet declared
- Sentinels (`from_step`, `from_parallel`, `from_parallel_all`) pointing at incompatible neighbours in the same parallel band
- `from_parallel_all` on a step that is not a band-start
- Type drift between consecutive steps (structural mismatch)
- Malformed `routes=` / `routes_by=` declarations
- Invalid `after_branches=` references

All before a single token is generated.

**Deep dive:** [Plan guide](https://core.lazybridge.com/guides/full/plan/index.md)

______________________________________________________________________

## 3. One contract for everything

`Tool.wrap()` accepts anything that exposes a useful capability. There is one interface whether the capability is a function, an agent, a Plan-backed pipeline, an MCP server, or a native provider tool:

| Capability           | How it enters                        | What the agent sees |
| -------------------- | ------------------------------------ | ------------------- |
| Python function      | `Tool.wrap(fn)` or `tools=[fn]`      | `Tool`              |
| `Agent`              | `agent.as_tool()` or `tools=[agent]` | `Tool`              |
| `Agent(engine=Plan)` | `tools=[pipeline]`                   | `Tool`              |
| MCP server           | `tools=[MCP.stdio(...)]`             | `Tool`              |
| Native provider tool | `tools=[NativeTool(...)]`            | `Tool`              |
| Guarded variant      | `agent.as_tool(verify=judge)`        | `Tool`              |

The LLM engine sees the same schema regardless of what backs the tool. Swap a stub function for a full sub-pipeline without touching the outer agent.

**Deep dive:** [Everything is a tool](https://core.lazybridge.com/concepts/everything-is-a-tool/index.md)

______________________________________________________________________

## 4. Quality control at every node

Structured output, runtime validation, and cross-model verification compose at any level of the pipeline — not just at the top.

```python
from pydantic import BaseModel
from lazybridge import Agent, LLMEngine

class Report(BaseModel):
    title: str
    summary: str
    word_count: int

judge = Agent(engine=LLMEngine("gemini-2.5-pro"), name="judge")

writer = Agent(
    engine=LLMEngine("claude-sonnet-4-6"),
    output=Report,                          # schema-validated; re-prompts on failure
    output_validator=lambda r: r.word_count > 50,  # runtime check
    verify=judge,                           # cross-model quality gate
    max_verify=3,                           # up to 3 retry cycles
    name="write",
)
```

`output=` validates the schema and re-prompts automatically on parse failure. `output_validator=` runs an arbitrary Python check after parsing. `verify=` passes the output to a second agent for quality assessment; feedback flows via context, never onto the original task, and loops up to `max_verify` times. All three stack.

**Deep dive:** [verify= guide](https://core.lazybridge.com/guides/mid/verify/index.md)

______________________________________________________________________

## 5. Observability is native

Session and event logging are on by default. OpenTelemetry export with GenAI semantic conventions (`gen_ai.*` attributes) requires one line:

```python
from lazybridge import Agent, LLMEngine, Session
from lazybridge.ext.otel import OTelExporter

agent = Agent(
    engine=LLMEngine("claude-sonnet-4-6"),
    session=Session(exporters=[OTelExporter()]),
)
```

Every run emits spans carrying `gen_ai.system`, `gen_ai.request.model`, `gen_ai.usage.input_tokens`, `gen_ai.usage.output_tokens`, `nesting_level`, and parent-child span links. Cost rolls up across nested agents. SQLite-backed `Store` gives you a local event log with back-pressure policies and batching. Both are opt-out, not opt-in.

**Deep dive:** [Session guide](https://core.lazybridge.com/guides/mid/session/index.md)

______________________________________________________________________

## Next steps

[**Quickstart →**](https://core.lazybridge.com/quickstart/index.md) [Comparison with other frameworks](https://core.lazybridge.com/comparison/index.md) [Layered composition deep-dive](https://core.lazybridge.com/concepts/layered-composition/index.md)
# Comparison

# LazyBridge vs LangGraph vs CrewAI

Three different frameworks, three different bets. This page gives you a **decision framework** for picking the right tool for your project and is honest about when LazyBridge *isn't* it.

> The three frameworks optimise for **different things**. There's no winner in the abstract — there's the one that fits *your* project's shape, team, and constraints.

Comparing more than three?

This page is the focused, decision-tree head-to-head of LangGraph, CrewAI and LazyBridge. For the broader category view — Pydantic AI, LlamaIndex, Haystack, AutoGen, Microsoft Agent Framework and Google ADK alongside these — start from [Best Python AI Agent Frameworks in 2026](https://core.lazybridge.com/best-python-ai-agent-frameworks/index.md).

______________________________________________________________________

## What each framework optimises for

### LangGraph

**Optimised for: complex, stateful, long-running agentic workflows where graph control is the dominant concern.**

LangGraph is a graph DSL. You declare nodes, edges, state schemas, and reducers. The framework runs your graph; the framework gives you very fine-grained control over what happens between nodes, when state persists, when to interrupt, and what to stream.

It shines when:

- The workflow shape is genuinely a graph — multiple entry points, complex branching, cycles, fan-in/fan-out at non-trivial scales
- You need first-class **streaming, checkpointing, and resumability** — long-running agents that survive restarts, multi-day workflows, HITL with persisted state
- You're already in the **LangChain ecosystem** (300+ integrations) and want continuity with `langchain_*` packages
- **LangSmith** observability is a requirement (custom tracing UI, evaluation dashboards, prompt management)

It costs you:

- Verbose for simple cases — the graph wiring is overhead when your workflow is "do A, then B, then C"
- A learning curve — `StateGraph`, `Annotated[...]` reducers, `Send`, `Command`, conditional edges, checkpointer config
- Transitive dependencies through `langchain_core`, `langchain_*` provider adapters, and any tools you wire in

### CrewAI

**Optimised for: business-style "team of experts" mental models, where the abstractions match how humans describe the work.**

CrewAI builds on the metaphor *role, goal, backstory*. An Agent has a role like "Senior Researcher". A Task has a description and an expected output. A Crew runs Tasks via a `Process` — sequential or hierarchical.

It shines when:

- The use case fits the **roles + tasks + crew** metaphor well (research teams, content production, business workflows, structured pipelines that map cleanly to org-chart roles)
- You want a **strong opinion** that gives you guardrails and patterns out of the box — not a kit of primitives to assemble
- **Non-engineer stakeholders** read the agent code; the role/goal metaphor is more legible than "an Agent with system prompt and tools"

It costs you:

- Lock-in to the Crew/Task abstraction — escaping to the underlying provider API isn't a smooth path
- **No native conditional routing** — you reach for plain Python branching (Step 10's CrewAI comparison)
- **No parallel execution** of agents at the framework level — you drop to `asyncio.gather`
- HIL is limited to `human_input=True` on a Task; no web UI, no typed Pydantic forms, no timeout/default story
- The opinionated abstractions can chafe when your shape is **not** a team of experts

### LazyBridge

**Optimised for: composition primitives — small, orthogonal building blocks that combine into any workflow shape with minimal ceremony.**

LazyBridge has one core abstraction (`Agent = Engine + Tools + State`) and a small set of composition primitives that **all return the same shape** — every primitive returns an Envelope, every composition returns an Agent.

It shines when:

- You want **multi-provider out of the box** — change one string to switch between Claude, GPT, Gemini, DeepSeek, Mistral, LM Studio
- You want **cross-model verification** to be a one-line concern (`verify=judge_agent` works with any model family — Step 6)
- The workflow is composition-heavy but the *individual pieces* aren't unusual (chain, parallel, sub-agent-as-tool, simple routing)
- You want **minimal dependencies** (no LangChain object hierarchy, no Crew opinions) and a small mental model
- **Code review readability** matters — every LazyBridge primitive reads like "what is happening" not "what graph wiring is happening"
- HIL is a first-class engine swap, not a bolted-on flag

It costs you:

- A **smaller community** than LangGraph or CrewAI — fewer Stack Overflow answers, fewer tutorials in the wild, fewer integrations packaged
- **No first-party observability platform** like LangSmith yet — you bring your own exporters (OTel, structured logs, JSON files — all built in, but no hosted UI shipped by LazyBridge)
- **Less battle-tested** at the very-large-graph scale that LangGraph was built to handle

______________________________________________________________________

## The feature matrix at a glance

The eight dimensions most projects actually weigh when choosing:

| Dimension             | LangGraph                     | CrewAI                             | LazyBridge                                   |
| --------------------- | ----------------------------- | ---------------------------------- | -------------------------------------------- |
| Core abstraction      | Graph (nodes + edges + state) | Crew (agents + tasks + process)    | Agent (engine + tools + state)               |
| Multi-provider        | via `langchain_*` adapters    | via underlying SDK                 | **built-in (one string)**                    |
| Structured output     | manual State plumbing         | task `expected_output=`            | **`output=PydanticModel`**                   |
| Cross-model verify    | manual loop                   | manual loop                        | **`verify=judge_agent`**                     |
| Sub-agent composition | wrap a node                   | hierarchical process               | **`tools=[other_agent]` / chain / parallel** |
| Conditional routing   | `add_conditional_edges`       | none — plain Python                | **`routes=` / `routes_by=`**                 |
| Human-in-the-loop     | `interrupt()` + checkpointer  | `human_input=True` (terminal only) | **`HumanEngine` (terminal/web/custom)**      |
| Dependencies          | langchain-core + N adapters   | crewai + langchain-tools           | **stdlib + pydantic + httpx**                |

Twelve more dimensions, for the curious

Compile-time validation, built-in checkpointing, streaming, built-in OTel, MCP server tools, provider-native tools (web search / code-exec), parallel fan-out, sequential pipeline expression, tools schema generation, simple-call line count, ecosystem size, maturity:

| Dimension                      | LangGraph                | CrewAI                          | LazyBridge                    |
| ------------------------------ | ------------------------ | ------------------------------- | ----------------------------- |
| Simple call                    | ~10 lines (StateGraph)   | ~12 lines (Agent + Task + Crew) | **3 lines**                   |
| Tools (schema)                 | manual schema + ToolNode | `@tool` decorator               | **type hints + docstring**    |
| Sequential pipeline            | linear `add_edge` graph  | `Process.sequential`            | **`Agent.chain`**             |
| Parallel fan-out               | `Send` + reducer State   | drop to `asyncio.gather`        | **`Agent.parallel`**          |
| Compile-time validation        | yes (StateGraph compile) | partial                         | yes (Plan compile)            |
| Built-in checkpointing         | yes (multiple backends)  | no                              | yes (Plan checkpoint)         |
| Streaming                      | yes (rich modes)         | partial                         | yes (`agent.stream`)          |
| Built-in OTel                  | via LangSmith            | no                              | yes (`OTelExporter`)          |
| MCP server tools               | via tooling layer        | via custom                      | **built-in (`MCP.stdio`)**    |
| Native tools (web search etc.) | per-provider plumbing    | limited                         | **built-in (`NativeTool.*`)** |
| Ecosystem                      | very large (LangChain)   | medium                          | small (focused)               |
| Maturity                       | high                     | medium-high                     | early but stable              |

For most decisions, the eight dimensions above are enough. The decision tree below is a faster route to "which one" than the matrix.

______________________________________________________________________

## Same task, three frameworks — concept count *(simple linear pipeline)*

The classic three-step pipeline (research → write → edit) one last time, counting the **distinct concepts** a beginner has to learn:

|                        | LangGraph                                                                                                                     | CrewAI                                                                                   | LazyBridge              |
| ---------------------- | ----------------------------------------------------------------------------------------------------------------------------- | ---------------------------------------------------------------------------------------- | ----------------------- |
| Composition expression | `add_node × 3 + add_edge × 4 + START + END`                                                                                   | `Crew(agents=, tasks=, process=)`                                                        | `Agent.chain(a, b, c)`  |
| Required concepts      | StateGraph, TypedDict, add_messages, Annotated reducer, nodes, edges, START, END, compile, invoke, state['messages'] plumbing | Agent, role, goal, backstory, Task, description, expected_output, context, Crew, process | Agent, LLMEngine, chain |
| Total concept count    | ~11                                                                                                                           | ~10                                                                                      | **3**                   |

What this count means

This isn't about lines of code — it's about how many things a new reader has to load into their head before they can change anything.

**The numbers above are for a 3-stage *linear* pipeline.** For a branching workflow with routing, parallel bands, and HIL, every framework expands — LangGraph adds reducers and conditional edges, CrewAI adds Process flavours and custom hooks, LazyBridge adds `routes_by=` / `after_branches=` / `Plan` / `HumanEngine`. The *gap* tends to stay roughly constant; LazyBridge doesn't stay at "3 concepts" as you grow.

LangGraph and CrewAI **earn the extra concept count** on non-trivial graphs — they're not paying for nothing.

______________________________________________________________________

## Choose your framework — decision tree

```text
Do you need long-running, stateful, persistable workflows
(workflows that survive restarts / span days)?
│
├── YES ──► LangGraph (checkpointer ecosystem is mature)
│
└── NO ──►  Does the team think in "role + goal + backstory" terms,
            and the workflow shape is mostly a team-of-experts pipeline?
            │
            ├── YES ──► CrewAI (role metaphor is its strength)
            │
            └── NO ──►  Do you need:
                        - multi-provider as a first-class concern, or
                        - cross-model verification, or
                        - small dependency footprint, or
                        - composition primitives that all stay the same shape?
                        │
                        ├── YES (any of) ──► LazyBridge
                        │
                        └── NO ──────────────► Pick the one your team already knows.
                                              Migration cost > framework choice cost.
```

______________________________________________________________________

## When LazyBridge is the wrong choice

To stay credible, four cases where you should reach for something else:

1. **You need first-party hosted observability today.** LangSmith (LangChain) has the most polished platform for tracing, evaluation, and prompt management. LazyBridge ships exporters (OTel / JSON / console / custom callback) but no hosted UI.
1. **Your workflow has > 30 nodes with complex branching.** At that scale LangGraph's explicit graph DSL pays for itself. LazyBridge's `Plan` can express the same thing but the resulting code is denser to read.
1. **Stakeholders read the agent code.** If non-engineers are reviewing the workflow definitions, CrewAI's role/goal/backstory metaphor is a real win — it reads like a job description.
1. **You're deeply invested in the LangChain ecosystem.** `langchain_*` adapters, `langchain-community` tools, LangSmith, LangGraph Cloud — if you've built infrastructure on this stack, the switching cost outweighs the surface-area savings.

These are real cases. Be honest about whether they apply.

______________________________________________________________________

## Migration paths

If you're already on one of the others, what does moving look like?

### From LangChain / LangGraph to LazyBridge

- **Agent core:** `ChatAnthropic / ChatOpenAI + bind_tools → Agent(engine=LLMEngine(...), tools=[...])`. One-to-one mapping.
- **Sequential graphs:** `StateGraph + add_edge × N → Agent.chain(a, b, c)`.
- **Conditional edges:** `add_conditional_edges + router → Step(routes=)` or `routes_by=`.
- **Tools:** `@tool` decorator → plain Python function with type hints + docstring.
- **State:** if you used `MessagesState`, switch to `Memory`. For typed cross-step state, the `Store` + sentinels combo replaces State reducers.
- **Watch out:** if you used LangSmith heavily, plan your observability story before migrating (OTel exporter is the recommended path).

### From CrewAI to LazyBridge

- **Agents:** `Agent(role=..., goal=..., backstory=...)` → fold role/goal/backstory into a `system="..."` string. Often shorter.
- **Tasks:** `Task(description=..., expected_output=..., context=[...])` → just call the agent with the input string; use `Agent.chain` for sequential, `Plan` for explicit DAG.
- **Sequential Crews:** `Crew(process=Process.sequential)` → `Agent.chain(...)`.
- **Hierarchical Crews:** `Process.hierarchical` → `Agent(engine=LLMEngine(...), tools=[sub_agent_1, sub_agent_2])` (sub-agent-as-tool).
- **Tools:** CrewAI tools (and `langchain-tools` ones) are wrappable; if they were `@tool`-decorated, the underlying function works directly in LazyBridge.

### From LazyBridge to LangGraph or CrewAI

This is just as legitimate. LazyBridge isn't a religion — if your project's shape changes (you grow into 50-node workflows, or your team prefers explicit roles), the lift is small in the other direction too because LazyBridge agents are thin wrappers around the same provider SDKs the others use.

______________________________________________________________________

## What LazyBridge intentionally doesn't do

A short list of features LazyBridge *deliberately* leaves out:

- **No proprietary tracing/eval platform.** Exporters are open; the hosted UI isn't part of the project. Use OTel + your favourite backend (Honeycomb, Datadog, Jaeger).
- **No first-class "memory store" abstraction beyond `Memory` + `Store`.** We don't ship vector DB adapters. Bring your own retriever; wrap it as a tool.
- **No prompt template engine.** System prompts are Python strings. When you need templating, `str.format` and `f-strings` are right there.
- **No agent marketplaces or pre-built personas.** Build your own agents from the primitives. Less to memorise, less to keep in sync.

This list is short because the philosophy is: ship the primitives, let users build the rest. Each thing we *don't* ship is a thing we *won't* break in a future release.

______________________________________________________________________

## Summary

| Framework      | Pick it when                                                                                                   |
| -------------- | -------------------------------------------------------------------------------------------------------------- |
| **LangGraph**  | Complex stateful graphs, long-running workflows with persistence, you're already in LangChain                  |
| **CrewAI**     | Team-of-experts metaphor fits, opinionated abstractions are an asset, stakeholders read the code               |
| **LazyBridge** | Multi-provider matters, cross-model verify matters, you want a small composition surface, minimal dependencies |

You now have all the context you need to choose. The last step points you to where the LazyBridge docs go after this tutorial — guides, recipes, reference, and the topics we deliberately left for later.

______________________________________________________________________

[**Get started →**](https://core.lazybridge.com/quickstart/index.md) [Why LazyBridge](https://core.lazybridge.com/why/index.md)