Learn
Model Routing
A gateway strategy for choosing the right model per task based on privacy, cost, latency, quality, and failure mode.
Why route
One model is rarely optimal for every task. Classification, retrieval reasoning, summarization, coding, and final answer generation have different cost and quality profiles.
- Private or local models for sensitive low-risk tasks
- Frontier models for high-complexity reasoning
- Fallbacks for outage or quality regression
- Cost and token telemetry per route