DEV Community

Dubhe
Dubhe

Posted on

How to Cut Your AI API Costs by 60% Without Changing Your Code

Six months ago I was managing 5 separate AI API accounts. OpenAI for chat, Anthropic for code, DeepSeek for cost-sensitive tasks. Each one had its own billing, its own rate limits, its own dashboard.

The wake-up call came when I got a $1,200 invoice from one provider and realized I could have used a cheaper model for 80% of those calls.

The solution: A single API gateway that routes each request to the cheapest model that can handle it.

Here's what I built: https://dubhehub.com

The Architecture

Your App โ†’ Dubhe API Gateway โ†’ 6 models (Fast, Code, Agent, Plus, Vision, Reasoning)
Enter fullscreen mode Exit fullscreen mode

One API key, one endpoint, one bill. The gateway handles:

  • Automatic routing based on model capability
  • Fallback when one provider rate-limits you
  • Unified usage tracking across all models

The Results

  • Monthly spend: $800 โ†’ $320 (60% reduction)
  • Dev time saved: No more juggling SDKs
  • Reliability: Automatic fallback means zero downtime from rate limits

Quick Start

curl https://dubhehub.com/v1/chat/completions \
  -H "Authorization: Bearer YOUR_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "dubhe-fast",
    "messages": [{"role": "user", "content": "Hello!"}]
  }'
Enter fullscreen mode Exit fullscreen mode

Pricing (per 1M tokens)

Model Input Output
Fast $0.20 $0.60
Code $0.80 $3.00
Agent $1.00 $4.00
Vision $3.00 $10.00
Reasoning $3.00 $12.00

Try It

Free tier gives you 100K tokens to test it out. No credit card needed.

โ†’ https://dubhehub.com

Built by an indie dev who got tired of overpaying for AI APIs. Feedback welcome! ๐Ÿ‰

Top comments (0)