Model routing
Send one request; Cudator picks the right model on price, latency, and quality — then load-balances and fails over across a pool of credentials.
- Policy-based routing by cost, latency or task
- Weighted load-balancing & automatic failover
- One OpenAI-compatible endpoint for all vendors