Lower cost per token
Every request triggers real-time bidding, so providers compete on price instead of locking you into one margin stack.
Compute OverhangRoute every prompt through open competition and capture the best available economics without sacrificing reliability.
Every request triggers real-time bidding, so providers compete on price instead of locking you into one margin stack.
If a winner fails before first token, requests are re-auctioned automatically once to keep responses moving.
Keep your OpenAI-style integration while routing to the best eligible provider at runtime.
API-key auth, idempotency, bounded in-memory controls, and durable SQLite event persistence are included by default.