Replay-mode managed-agents for engineering teams
Replay-mode managed-agents for engineering teams. CI, code review, incident triage. Part of the subagentdispatch fleet — same chassis, same replay layer, vertical-specific corpus. (2 cassettes on this site.)
Ticket-deflector — Triage a refund-after-cancel support ticket
Classification: (b) escalate to billing — duplicate-charge after a cancellation requires a finance review. Draft reply (held for billing review, not sent): Thanks for flagging this — I'm pulling the cancellation and the new charge now so billing can confirm both in one pass. You'll get a refund + a written confirmation that the Starter plan is fully closed within one business day.
Platform capabilities
Record once. Replay everywhere. Stream cassettes from Cloudflare R2 with zero tokens spent.
SSE replay from R2
Every demo streams a recorded cassette through /api/v1/replay/<slug>. The chunk shape, delta_ms timing, and tool-use boundaries match the original Anthropic response byte-for-byte.
MCP server
Connect from Claude Desktop, Cursor, or any MCP-compatible AI tool to ingest cassettes, query the replay layer, and trigger record-mode against your own Anthropic key.
Per-vertical fleet
subagentlegal, subagentfinance, subagentengineering — each fleet site runs the same chassis with a SITE_VERTICAL var that filters cassettes and themes the hero. Add a vertical in one commit.
Built on
Edge-boundary recording (Netflix replay pattern). Cassettes on R2. SSE streamed verbatim.
HAR 1.2 + _anthropic SSE chunks, keyed by sha256(method + url + body)
OpenNext on Cloudflare. Replay endpoint emits SSE with original delta_ms timing
Live / record traffic goes through Cloudflare AI Gateway — retries, observability, cost capture
Demo cassettes
2 in the subagentengineering vertical — every one streamed live from /api/v1/replay/<slug>.
How it works
Three steps from a live Anthropic API call to a zero-token replay.
Record once
REPLAY_MODE=record routes through Cloudflare AI Gateway and captures every Anthropic request + SSE response chunk as a HAR cassette in R2.
Match by request class
Cassettes key on sha256(method + url + normalized_body). Request IDs, UUIDs, and timestamps are stripped so the same logical request matches across runs.
Replay everywhere
REPLAY_MODE=replay never touches the network. GET /api/v1/replay/<slug> streams the recorded SSE chunks with original delta_ms timing. Zero API key, zero tokens.
Search the corpus
BM25-lite over every demo cassette. Type to jump to the matching replay.
Fleet
Same chassis, vertical-specific corpus. Each site runs the replay layer with its own SITE_VERTICAL.
Programmatic discovery: /api/v1/fleet