Runtime configs & testing¶

The 0.7-era AgentRuntimeConfig / ResilienceConfig / ObservabilityConfig wrapper-of-flat-kwargs configs were deleted in 0.7.9 — they bundled flat kwargs into shareable objects with a flat kwarg > config object > default precedence game that required a private _UNSET sentinel on every kwarg (a documented LLM trap, T14 in the audit).

For fleet management, use a Python dict spread:

PROD_DEFAULTS = dict(
    timeout=60,
    max_retries=5,
    max_output_retries=2,
    cache=True,
    verbose=False,
    session=session,
)

researcher = Agent(**PROD_DEFAULTS, engine=LLMEngine("model"), name="research")
writer     = Agent(**PROD_DEFAULTS, engine=LLMEngine("model"), name="write")

Same end-user value, no precedence-game complexity, no sentinel.

Cache config¶

CacheConfig carries real semantic value (enabled, ttl) consumed inside LLMEngine. As of the v1 API pass it lives under lazybridge.core.types (import it from there); the top-level re-export is deprecated since 0.10 and still present (with a DeprecationWarning) as of 1.0.1 — pending removal in a future major version, not yet scheduled.

lazybridge.core.types.CacheConfig `dataclass` ¶

CacheConfig(enabled: bool = True, ttl: str = '5m')

Mark the static prefix of a request (system prompt + tool definitions) as cacheable.

Providers with explicit prompt caching (Anthropic) need a cache_control marker on the last block of each cached segment; this makes it a one-flag opt-in instead of asking callers to hand-craft provider-specific content lists.

Provider-specific behaviour:

Anthropic — the system prompt is upgraded from a string to a [{type: "text", text, cache_control}] block; a cache breakpoint is also placed on the last tool definition if tools are present. Cache hits cost ~10% of input tokens; writes cost ~25% more. TTL options: "5m" (default) or "1h".
OpenAI — automatic for system prompts >1024 tokens; no user-visible opt-in is required. This config is a no-op but accepted for forward-compat.
Google Gemini — explicit Context Caching uses a different lifecycle (create a cache resource, reference by name). Not auto-wired from this config; pass via extra if needed.
DeepSeek — automatic; no-op.

Testing¶

lazybridge.MockAgent ¶

MockAgent(responses: Any, *, name: str = 'mock_agent', description: str | None = None, output: type = str, cycle: bool = False, delay_ms: float = 0.0, default_input_tokens: int = 10, default_output_tokens: int = 20, default_cost_usd: float = 0.0, default_latency_ms: float | None = None, default_model: str = 'mock', default_provider: str = 'mock')

Deterministic test double that quacks like :class:lazybridge.Agent.

Designed for testing pipeline composition and data transmission without touching a real provider. Drop-in compatible with:

Agent(..., tools=[mock]) — wrapped via _wrap_tool using the _is_lazy_agent duck-type marker; nested Envelope metadata rolls up through the tool boundary.
Plan(Step(target=mock, ...)) — PlanEngine detects _is_lazy_agent and calls target.run(env) directly.
mock.as_tool() returns a standard :class:Tool.
Agent.chain(mock_a, mock_b) / Agent.parallel(mock_a, mock_b).

Response specification. The responses argument is resolved per call in this order:

Callable fn(env) -> value (sync or async) — the incoming :class:Envelope is passed in. Whatever the callable returns is re-fed through rules 2–4.
Dict — keys are substrings checked against env.task in insertion order; "*" is a catch-all default. No match and no default raises :class:RuntimeError.
List — one response per call, in order. Exhausting the list raises :class:RuntimeError unless cycle=True.
Scalar — returned on every call.

Response value types (after callable resolution):

:class:Envelope — returned verbatim; payload + metadata + error preserved. Use this to test error-path propagation.
:class:ErrorInfo — returned as Envelope(error=...).
:class:BaseException instance — raised from run() so tests can assert on provider-style failures.
:class:~pydantic.BaseModel / str / dict / list / scalar — wrapped as the envelope payload with the mock's default metadata (tokens, cost, model, provider).

Recording. Every call is appended to :attr:calls. Use :meth:assert_called_with, :meth:assert_call_count, :attr:call_count, :attr:last_call for assertions. :meth:reset clears both the call log and the list/cycle cursor.

Example::

from lazybridge import Agent, Plan, Step
from lazybridge.testing import MockAgent

researcher = MockAgent(
    {"weather": "sunny", "market": "bullish", "*": "no data"},
    name="researcher",
    default_input_tokens=50, default_output_tokens=30,
)
writer = MockAgent(
    lambda env: f"Report based on: {env.text()}",
    name="writer",
)

plan = Plan(
    Step(target=researcher, task="weather today"),
    Step(target=writer),   # from_prev by default
)
agent = Agent(engine=plan)
env = agent("daily brief")

assert "sunny" in env.text()
assert researcher.call_count == 1
assert writer.call_count == 1
assert env.metadata.input_tokens + env.metadata.output_tokens > 0