Back to CatalogHidden Mode Switching breaks user trust and reproducibility by silently swapping which model serves a request under the hood.
Agentic AI
Anti-Patterns
Hidden Mode Switching
Silently routing users to a cheaper model without telling them — eroding trust and making debugging impossible.
Intent & Description
🎯 Intent
Routing some traffic to a smaller/cheaper model to manage cost — without disclosing which model produced any given response.
📋 Context
Cost and capacity pressure push teams to mix flagship and smaller models in the backend. Nothing in the response, UI, or trace reveals the actual model used. Users discover the swap from degraded quality, not from you.
💡 Solution
Disclose model identity per response. Make routing decisions inspectable in traces and operator dashboards. Use multi-model-routing transparently.
Real-world Use Case
- Never use this; silent routing of model changes undermines reproducibility and trust.
- Use multi-model-routing transparently, with the resolved model identity disclosed per response.
- Make routing decisions inspectable in traces and operator dashboards.
Source
📌 TL;DR
Always disclose model identity per response — silent model swaps destroy reproducibility and trust.
Disadvantages
- Trust erodes when users notice quality shifts and you have no explanation to give
- Reproducibility breaks — identical requests return different quality across calls
- Eval results become misleading because you can’’t tell which model you’’re measuring