DEV Community: Dubhe

How I connected OpenClaw to a Chinese frontier model and cut agent costs by 90%

Dubhe — Sat, 13 Jun 2026 17:26:51 +0000

I've been running OpenClaw for a few months. Love the agent workflow. Not so much the monthly bill.

Last week I set up a tiny routing layer that lets OpenClaw fall back to DeepSeek V4 for certain task types. The config change was 4 lines.

Here's what happened:

• Code review tasks: ~$0.15 → ~$0.002 each
• Batch processing: costs dropped from $20/day to ~$2/day
• The agent didn't notice the difference — same structured output, same format

The setup is straightforward. You need any OpenAI-compatible endpoint that supports the models you want. Drop a provider block into your OpenClaw config, swap the model name, restart.

Not claiming this replaces Claude for complex reasoning. But for high-volume structured work, having a cheap fallback available through the same interface changes how aggressively you can let the agent run autonomously.

Happy to answer questions about the routing setup.

How to Cut Your AI API Costs by 60% Without Changing Your Code

Dubhe — Sun, 07 Jun 2026 23:41:46 +0000

Six months ago I was managing 5 separate AI API accounts. OpenAI for chat, Anthropic for code, DeepSeek for cost-sensitive tasks. Each one had its own billing, its own rate limits, its own dashboard.

The wake-up call came when I got a $1,200 invoice from one provider and realized I could have used a cheaper model for 80% of those calls.

The solution: A single API gateway that routes each request to the cheapest model that can handle it.

Here's what I built: https://dubhehub.com

The Architecture

Your App → Dubhe API Gateway → 6 models (Fast, Code, Agent, Plus, Vision, Reasoning)

One API key, one endpoint, one bill. The gateway handles:

Automatic routing based on model capability
Fallback when one provider rate-limits you
Unified usage tracking across all models

The Results

Monthly spend: $800 → $320 (60% reduction)
Dev time saved: No more juggling SDKs
Reliability: Automatic fallback means zero downtime from rate limits

Quick Start

curl https://dubhehub.com/v1/chat/completions \
  -H "Authorization: Bearer YOUR_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "dubhe-fast",
    "messages": [{"role": "user", "content": "Hello!"}]
  }'

Pricing (per 1M tokens)

Model	Input	Output
Fast	$0.20	$0.60
Code	$0.80	$3.00
Agent	$1.00	$4.00
Vision	$3.00	$10.00
Reasoning	$3.00	$12.00

Try It

Free tier gives you 100K tokens to test it out. No credit card needed.

→ https://dubhehub.com

Built by an indie dev who got tired of overpaying for AI APIs. Feedback welcome! 🐉

I Built a Unified AI API With 6 Models — Here's What I Learned From 6 Months of Running It

Dubhe — Thu, 04 Jun 2026 11:07:55 +0000

6 months ago, I was managing 5 separate AI API accounts. OpenAI for chat, Anthropic for code, DeepSeek for cost-sensitive tasks. Each with its own SDK, its own billing, its own dashboard.

It was a mess.

So I built Dubhe AI — a single API gateway that routes requests to the best model for each task, all under one account, one API key, one bill.

What's Under the Hood

Fast (general purpose chat)
Reasoning (deep thinking & complex logic)
Code (programming & software engineering)
Vision (image understanding)
Agent (autonomous workflows)
Omni (long-context tasks)

All OpenAI-compatible. Switching from your current provider takes about 5 minutes.

Pricing That Actually Makes Sense

Plan	Price	Tokens/Month
Free	$0	100K
Starter	$9	3M
Pro	$29	12M
Business	$49	20M
Pro Plus	$99	40M

All plans include all 6 models. No hidden fees.

What I Learned

Price matters more than brand. Most indie devs choose providers based on cost first. Being 5-10x cheaper than OpenAI is a real advantage.
One API key is a big deal. Developers hate managing multiple credentials.
Global payments are hard. PayPal rejected us 3 times. We went with Paddle as our merchant of record.

Try it out: https://dubhehub.com

Would love your feedback — what's your biggest pain point with AI APIs today?

I Built a Unified AI API With 6 Models — What I Learned From 6 Months of Running It

Dubhe — Tue, 02 Jun 2026 19:34:16 +0000

Why I Built This

6 months ago, I was managing 5 separate AI API accounts. OpenAI for chat, Anthropic for code, DeepSeek for cost-sensitive tasks, Google for vision. Each with its own SDK, its own billing, its own dashboard.

It was a mess.

So I built Dubhe AI — a single API gateway that routes requests to the best model for each task, all under one account, one API key, one bill.

What's Under the Hood

Fast (DeepSeek V4 Flash) — general-purpose, great for chat
Vision (MiMo V2.5) — image understanding
Code (GLM 4.7) — programming tasks
Agent (MiniMax M2.7) — complex agent workflows
Plus (Qwen 3.6) — balanced performance
Reasoning (DeepSeek V4 Pro) — complex reasoning

All OpenAI-compatible. Switching from your current provider takes about 5 minutes.

Pricing That Actually Makes Sense

Plan	Price	Tokens/Month
Starter	9 dollars	3M
Pro	29 dollars	12M
Business	49 dollars	20M
Pro Plus	99 dollars	40M

All plans include access to all 6 models. No hidden fees, no per-model surcharges.

What I Learned

1. Price matters more than brand. Most indie devs I talked to choose providers based on cost first, performance second. Being 5-10x cheaper than OpenAI/Anthropic is a real advantage.

2. One API key is a surprisingly big deal. Developers hate managing multiple credentials. The "just works" factor is real.

3. Global payments are hard. PayPal rejected our application 3 times. We eventually went with Paddle as our merchant of record — they handle VAT, sales tax, and payment methods in 100+ countries.

Try It Out

If you're an indie dev or solo builder tired of juggling multiple AI providers: dubhehub.com

Would love your feedback — what's your biggest pain point with AI APIs today?

Your AI API Bill Is Too High — And the Fix Is One Line of Code

Dubhe — Sun, 31 May 2026 08:11:44 +0000

Here's the cold truth: if you're building anything with LLMs right now, you're probably overpaying by 10-40x.

I was spending $1,200/month on GPT-4o for my side project. A friend showed me his DeepSeek V4 Flash bill: $31. Same workload. Same quality.

The fix? Change one line of code.

# Before
client = OpenAI(api_key="sk-...")

# After  
client = OpenAI(
    api_key="sk-...",
    base_url="https://dubhehub.com/v1"  # ← this line
)

That's it. Same SDK. Same parameters. Same response format. Just a different endpoint.

The math that made me switch

Model	Input (per 1M)	Output (per 1M)	Cost vs GPT-4o
GPT-4o	$2.50	$10.00	—
dubhe-fast (DeepSeek V4)	$0.30	$0.60	~15x cheaper
dubhe-code (GLM-4.7)	$0.80	$3.00	~3.5x cheaper
dubhe-reasoner (DeepSeek V4 Pro)	$6.00	$18.00	~1.6x cheaper
dubhe-vision (multimodal)	$5.00	$15.00	—

What actually works

I've been running on this for a month. Here's what works:

Chat & streaming — identical. SSE, function calling, JSON mode, all work the same way.

Multi-model routing — the biggest hidden win. I use dubhe-fast for 80% of my traffic (chat, summarization) and switch to dubhe-code for code review. Total cost: ~$40/month instead of $1,200.

Vision — works with image URLs and base64. Same format as OpenAI.

What doesn't

Fine-tuning, Assistants API, and TTS aren't available yet. If you need those, keep an OpenAI key handy. For the other 95% of use cases, you won't notice the difference.

The real reason I'm not going back

My app does ~200K requests/month. With GPT-4o: $1,200. With multi-model routing: $33.42.

The money I save in a month pays for my whole cloud infrastructure.

You can start with 100K free tokens — no credit card, no commitment. Test it with your actual workload. If the quality isn't there, you've lost nothing but 10 minutes.

Start building at dubhehub.com

Note: I'm the founder of Dubhe Hub. All numbers are from my actual usage.

How to Build a Multi-Model AI App in 10 Minutes (With Code)

Dubhe — Fri, 29 May 2026 05:34:43 +0000

Have you ever wanted to use different AI models for different tasks — fast for simple queries, reasoning for complex problems, vision for images — without managing multiple API keys?

Here's how to build a multi-model app in 10 minutes.

The Concept

Instead of picking one model, you route each request to the model that fits best:

Simple chat → Fast model ($0.30/M)
Code review → Code model ($0.80/M)
Image analysis → Vision model ($5.00/M)
Complex logic → Reasoning model ($6.00/M)

Step 1: Install the SDK

npm install openai

Step 2: Set Up the Client

import OpenAI from 'openai';

const dubhe = new OpenAI({
  baseURL: 'https://dubhehub.com/v1',
  apiKey: process.env.DUBHE_API_KEY
});

Step 3: Route Requests by Type

async function routeRequest(task, content) {
  const modelMap = {
    'chat': 'dubhe-fast',
    'code': 'dubhe-code',
    'reason': 'dubhe-reasoning',
    'vision': 'dubhe-vision'
  };

  return await dubhe.chat.completions.create({
    model: modelMap[task] || 'dubhe-fast',
    messages: [{ role: 'user', content }]
  });
}

Step 4: Use It

// Simple chat — costs ~$0.00022
const chat = await routeRequest('chat', 'Explain what Docker is');

// Code review — costs ~$0.0028  
const review = await routeRequest('code', 'Review this function for bugs: ...');

// Deep reasoning — costs ~$0.009
const analysis = await routeRequest('reason', 'Analyze this business case...');

Why This Matters

Switching between models lets you optimize for cost and quality at the same time. One API key, one SDK, multiple price points.

The free tier (100K tokens) is enough to build and test your entire prototype.

👉 Try it at dubhehub.com

Need an AI API? Here's 100K Free Tokens & 6 Models to Start Building

Dubhe — Thu, 28 May 2026 09:44:21 +0000

Building with AI shouldn't cost you a dime to start. Here's how to get 100K free tokens and access 6 different AI models through a single API — no credit card required.

What You Get for Free

100,000 tokens to test any of 6 models:

Model	Best For
dubhe-fast	General chat, text generation
dubhe-code	Programming, debugging
dubhe-agent	Function calling, tool use
dubhe-plus	Long context (up to 1M tokens)
dubhe-vision	Image analysis, OCR
dubhe-reasoning	Complex problem-solving

Getting Started in 30 Seconds

Step 1: Go to dubhehub.com and click "Get Started"

Step 2: Sign up with your email — no credit card needed

Step 3: Create an API key from your dashboard

Step 4: Start coding:

import openai

client = openai.OpenAI(
    base_url="https://dubhehub.com/v1",
    api_key="your-key-here"
)

response = client.chat.completions.create(
    model="dubhe-fast",
    messages=[{"role": "user", "content": "Hello!"}]
)
print(response.choices[0].message.content)

Why Start Here?

One API key for 6 models — no juggling multiple providers
Pay only if you scale — 100K free tokens, then pay-as-you-go from $0.30/M tokens
OpenAI-compatible — same SDK, just change the base URL
No credit card — really. Just sign up and build.

The free tier is enough to build a prototype, test different models, and decide what works for your use case. When you're ready to scale, your API key and code stay exactly the same.

👉 Start building at dubhehub.com

Hey DEV! Building an affordable multi-model API — ask me anything 🐉

Dubhe — Thu, 28 May 2026 06:18:45 +0000

Hey everyone! 👋

I'm the founder of Dubhe Hub — an OpenAI-compatible API gateway that gives devs access to 6 different AI models (from fast chat to deep reasoning + vision) through a single endpoint.

Some background:

I've been building developer tools for a while
I noticed teams were either stuck paying OpenAI prices or juggling 3-4 different API keys
So I built Dubhe: one API key, 6 models, pay-as-you-go pricing starting from free

I'm here to share what I'm learning about:

Building and scaling an API platform
Multi-model routing strategies
Cost optimization for AI apps
Real-world AI usage patterns (not hype)

Happy to answer any questions about running an AI API or building in this space. Also always open to feedback!

👉 dubhehub.com — free tier available (100K tokens to start)

6 AI Models, 6 Use Cases: A Developer's Guide to Choosing the Right One

Dubhe — Thu, 28 May 2026 03:17:13 +0000

When you're building with LLMs, there's no single "best" model. The best model depends entirely on what you're building.

Here's a practical guide to which model fits which job — with real code examples and cost breakdowns.

The 6 Model Types

⚡ Fast — General Chat & Text

Best for: Customer support chatbots, content generation, summarization

This is your workhorse. Fast responses, lowest cost. Use it for the 80% of requests that don't need deep reasoning.

const completion = await client.chat.completions.create({
  model: 'dubhe-fast', // $0.30/M input
  messages: [{ role: 'user', content: 'Write a product description' }]
});

Cost per request: ~$0.00022
When to use: Default for all simple queries

💻 Code — Programming Tasks

Best for: Code generation, debugging, code review

Specialized for programming. Handles complex code patterns and multiple languages.

const completion = await client.chat.completions.create({
  model: 'dubhe-code', // $0.80/M input
  messages: [{ role: 'user', content: 'Write a React hook that debounces API calls' }]
});

🤖 Agent — Automation

Best for: Function calling, multi-step workflows, tool use

Optimized for agentic patterns — structured outputs, tool calling, and complex instructions.

⊕ Plus — Long Context

Best for: Document analysis, full-codebase review, long conversations

Handles up to 1M tokens of context.

👁 Vision — Image Understanding

Best for: Image analysis, OCR, screenshot understanding

const completion = await client.chat.completions.create({
  model: 'dubhe-vision',
  messages: [
    { role: 'user', content: [
      { type: 'text', text: 'What error is shown?' },
      { type: 'image_url', image_url: { url: 'https://example.com/error.png' } }
    ]}
  ]
});

🧠 Reasoner — Deep Reasoning

Best for: Complex problem-solving, math, logic, analysis

Spends extra compute tokens on "thinking" before answering.

Decision Tree

Is the task code-related?
  → Yes → Use Code model
  → No → Does it need images?
    → Yes → Use Vision model
    → No → Complex reasoning?
      → Yes → Use Reasoner model
      → No → Long context needed?
        → Yes → Use Plus model
        → No → Agentic (tool use)?
          → Yes → Use Agent model
          → No → Use Fast model

Cost Strategy

Route simple queries to Fast model (saves 10-20x)
Route coding questions to Code model
Fall back to Reasoner only when uncertain
Use Vision only when images are submitted

This approach can cut API costs by 60-80%.

Try It Yourself

All these models are available through a single API endpoint at Dubhe Hub — just change the model name in your code.

Pricing starts at $0.30/M input tokens. Free tier: 100K tokens to test before committing.

How to Switch From OpenAI to a Multi-Model AI API in 5 Minutes

Dubhe — Wed, 27 May 2026 07:27:36 +0000

If you're like most developers, you probably started with OpenAI's API. It's the default choice — great documentation, solid models. But as your project grows, you might notice some friction:

Costs add up fast. A single chat-heavy app can burn through hundreds of dollars.
No fallback. If OpenAI goes down, your app goes down.
One model doesn't fit all. Sometimes you need a fast model for simple queries and a reasoning model for complex tasks.

Powered by some of the most advanced AI models from China's leading labs — giving you enterprise-grade performance at a fraction of the usual cost.

The solution? A multi-model API gateway that gives you access to multiple providers through a single OpenAI-compatible endpoint.

What You'll Need

An API key from Dubhe Hub (free tier includes 100K tokens)
Any OpenAI SDK (Python, Node.js, curl — they all work)

Step 1: Get Your API Key

Go to dubhehub.com, sign up, and create an API key. The dashboard shows you six models to choose from.

Step 2: Make Your First Call

Here's a Node.js example. Notice that the code looks exactly like OpenAI's SDK:

import OpenAI from 'openai';

const dubhe = new OpenAI({
  baseURL: 'https://dubhehub.com/v1',
  apiKey: 'your-key-here'
});

// Fast model for quick responses
const quick = await dubhe.chat.completions.create({
  model: 'dubhe-fast',
  messages: [{ role: 'user', content: 'Explain REST APIs in 3 bullet points' }]
});
console.log(quick.choices[0].message.content);

Step 3: Swap Models Without Changing Code

The real power is switching models by changing just the model name:

// Use dubhe-reasoning for complex logic
const analysis = await dubhe.chat.completions.create({
  model: 'dubhe-reasoning', 
  messages: [{ role: 'user', content: 'Write a debounce function in TypeScript' }]
});

// Use dubhe-vision for image analysis
const vision = await dubhe.chat.completions.create({
  model: 'dubhe-vision',
  messages: [
    { role: 'user', content: [
      { type: 'text', text: 'What's in this image?' },
      { type: 'image_url', image_url: { url: 'https://example.com/photo.jpg' } }
    ]}
  ]
});

Model Overview

Model	Best For	Input/1M tokens	Output/1M tokens	Context
dubhe-fast	General chat, text	$0.30	$0.60	1M
dubhe-code	Programming tasks	$0.80	$3.00	200K
dubhe-agent	Automation, agents	$1.00	$4.00	205K
dubhe-plus	Long context, analysis	$2.00	$8.00	1M
dubhe-vision	Image understanding	$5.00	$15.00	1M
dubhe-reasoning	Deep reasoning	$6.00	$18.00	1M

Why Multi-Model?

Cost optimization: Use cheap models for simple tasks, expensive ones only when needed
Provider diversity: If one provider has an outage, switch to another instantly
Performance matching: Match model capability to task complexity

The migration takes about 5 minutes — just change the baseURL and apiKey. Everything else (streaming, function calling, embeddings) works the same way.

Try it yourself at dubhehub.com.