It happened again. At 2:47 AM on a Tuesday, your on-call engineer gets paged. A major upstream model provider is down. Not degraded. Down. And now hundreds of tenant AI agents, mid-conversation, mid-workflow, mid-tool-call, are frozen in place. Some tenants have enterprise SLAs. Some are running autonomous agents that