| Dimension | Setting |
|---|---|
| Autonomy level | L2 Assisted |
| Approval gates | High-severity tickets require human confirmation before Tier-2 assignment; refunds over $500 require explicit approval |
| Override path | Support agent can override any routing decision via the triage UI override button; logged and audited |
| Escalation trigger | Confidence below 0.7, unrecognized category, customer sentiment flagged as high-distress, or any mention of legal/compliance terms |
| Fallback behavior | Route to human queue with full ticket context and agent reasoning summary; max 4-hour SLA |
| Instrument | Status | Notes |
|---|---|---|
| Distributed tracing (OpenTelemetry) | In Place | CHAIN, LLM, and TOOL spans instrumented. Trace IDs propagated to CRM and ticketing system calls. |
| LLM call logging | In Place | Prompt/completion pairs logged (PII stripped) with model, temperature, and token counts. |
| Token cost metrics | Partial | Total tokens per session logged; per-tool-call cost breakdown not yet available. Cost dashboards not yet configured. |
| Error rate alerting | In Place | PagerDuty alert on error rate > 5% over 5-minute window; escalates to on-call engineer. |
| Latency p95 dashboard | Partial | Overall latency measured; per-span breakdown (model inference vs tool vs network) not yet visible in dashboard. |
| Session replay | Missing | Full agent session replay (for post-incident diagnosis) not implemented. Logs available but no replay interface. |
| Audit trail (HITL actions) | In Place | All human override and approval actions logged with timestamp, agent ID, and routing decision delta. |
Deploy to canary at 5% traffic (read-only ticket classification only — disable CRM write tool in canary config) while blockers B1 and B2 are resolved. Monitor error rate, latency p95, and HITL escalation rate during canary. Once B1 and B2 are cleared and load test passes, expand to 25% then full traffic in 1-week increments. Do not enable CRM write tool at any traffic tier until B2 (circuit breaker) is implemented and tested.