Server
Production checklist
Preflight checklist for AI Transport deployments.
Use this checklist before enabling AI Transport for production traffic.
Build and config
- Build with
v2,ai-transport, the selected shared adapter/cache/store features, andpushwhen push recipes are used. - Enable
[history],[versioned_messages], and[ai_transport]. - Use shared history, version store, and cache for horizontal deployments.
- Scope
[[ai_transport.channels]]to narrow prefixes. - Set history and version retention to cover reconnect, rewind, and support windows.
Auth
- Issue short-lived V2 capability tokens from a backend only.
- Keep capability patterns exact or tenant/user scoped.
- Use app-key HTTP only from trusted server workers.
- Revoke by
jtiorclient_idfor incident response. - Do not accept client-asserted identity fields.
Operations
- Scrape Prometheus metrics from every node.
- Alert on history degraded/reset-required channels, recovery failures, rollup flush latency, active stream leaks, horizontal transport drops, and push queue/provider failures.
- Run the AIT-S conformance suite before release.
- Run
scripts/ai-transport-bench-guard.shand review budgets. - Run
scripts/ai-transport-ga-gate.sh ci-evidenceon every release-candidate branch. - Run
scripts/ai-transport-ga-gate.sh release-evidencebefore any GA tag; it fails until the external S14 scale/chaos, rolling-upgrade, and full SDK compatibility manifests are committed. - Generate release evidence with
scripts/ai-transport-s14-release-evidence.sh,scripts/ai-transport-rolling-upgrade-redis.mjs --execute, andscripts/sdk-compat-full-matrix.mjs --execute. - Execute
test/loadandtools/chaosprofiles on production-like hardware before headline claims. - Build Docker images both with and without
ai-transportenabled.
Client behavior
- Treat Sockudo history as authoritative after reconnect or notification open.
- Use
until_attachfor late-join history to avoid gaps. - Reduce mutable-message events by
message_serialandversion.serial. - Send terminal stream status for completed/cancelled/error turns.
- Refresh tokens before expiry and back off on auth errors.
GA readiness
The canonical readiness record is
docs/specs/ai-transport-ga-readiness.md. Server releases ship
AI Transport defaults off before SDK releases that depend on the feature. Full product parity still
depends on the SDK plans in plans/ai-transport/02-sdk-prompts.md and the E4/E5 enablement work in
plans/ai-transport/03-existing-sdks-prompts.md.