Inference dominates everything
At 1M interactions/month, agent operations cost $59K-$151K. LLM inference accounts for 70-80%. Cutting observability by 50% saves $2K. Cutting inference by 20% saves $10-25K.
| Component | Monthly Cost | % of Total |
|---|---|---|
| Inference (LLM API calls) | $45,000 - $120,000 | 70-80% |
| Agent compute | $8,000 - $15,000 | 10-13% |
| Observability | $3,000 - $8,000 | 5-6% |
| Governance | $2,000 - $5,000 | 3-4% |
Right-sized intelligence
Trust Cascade Architecture
Confidence-Based Escalation
Each level has a confidence threshold. If a decision can be made confidently at Level 1, it stays there. If confidence is below threshold, it escalates to Level 2. And so on. Only genuinely complex decisions reach expensive levels.