Six months ago I was managing 5 separate AI API accounts. OpenAI for chat, Anthropic for code, DeepSeek for cost-sensitive tasks. Each one had its own billing, its own rate limits, its own dashboard.
The wake-up call came when I got a $1,200 invoice from one provider and realized I could have used a cheaper model for 80% of those calls.
The solution: A single API gateway that routes each request to the cheapest model that can handle it.
Here's what I built: https://dubhehub.com
The Architecture
Your App โ Dubhe API Gateway โ 6 models (Fast, Code, Agent, Plus, Vision, Reasoning)
One API key, one endpoint, one bill. The gateway handles:
- Automatic routing based on model capability
- Fallback when one provider rate-limits you
- Unified usage tracking across all models
The Results
- Monthly spend: $800 โ $320 (60% reduction)
- Dev time saved: No more juggling SDKs
- Reliability: Automatic fallback means zero downtime from rate limits
Quick Start
curl https://dubhehub.com/v1/chat/completions \
-H "Authorization: Bearer YOUR_API_KEY" \
-H "Content-Type: application/json" \
-d '{
"model": "dubhe-fast",
"messages": [{"role": "user", "content": "Hello!"}]
}'
Pricing (per 1M tokens)
| Model | Input | Output |
|---|---|---|
| Fast | $0.20 | $0.60 |
| Code | $0.80 | $3.00 |
| Agent | $1.00 | $4.00 |
| Vision | $3.00 | $10.00 |
| Reasoning | $3.00 | $12.00 |
Try It
Free tier gives you 100K tokens to test it out. No credit card needed.
Built by an indie dev who got tired of overpaying for AI APIs. Feedback welcome! ๐
Top comments (0)