DEV Community

Cover image for China's Biggest AI Just Started Charging Users. DeepSeek Cut API Prices the Same Week.
tokenmixai
tokenmixai

Posted on • Originally published at tokenmix.ai

China's Biggest AI Just Started Charging Users. DeepSeek Cut API Prices the Same Week.

If you've been wondering when the "Chinese AI free-forever" era would end, the answer landed on May 4, 2026 with almost no fanfare. ByteDance updated Doubao's Apple App Store page with three paid tiers — 68元 ($9.5)/200元 ($28)/500元 ($70) per month — and let it sit for almost four weeks before the Chinese tech press caught it on June 1.

DeepSeek spent the same window cutting V4-Flash to 1元 per million input tokens (~$0.14).

Two of China's biggest AI labs just publicly committed to opposite theories of how to make this business work. Both are real bets. Both will probably be right for different reasons. And neither directly raises your API bill if you're building outside China — but the macro signal matters.

TL;DR

  • Doubao (ByteDance, 345M monthly users) launched 3-tier paid C-end subscription: $9.5 / $28 / $70 per month. Free tier preserved.
  • 120 trillion daily tokens consumed — up from ~60T three months ago. Estimated $3-5M daily inference cost.
  • DeepSeek cut V4-Flash pricing the same week. Opposite strategy: race to the API floor.
  • Your stack doesn't change if you build on Chinese model APIs internationally — Doubao API rates are unaffected by consumer subscription.
  • What does change: ByteDance just signaled that even the largest Chinese consumer AI provider needs revenue mechanisms. Free forever was always temporary.

The pricing in plain numbers

ByteDance verified across three Chinese tech outlets (36Kr, Sina Finance, The Paper). The Apple App Store filing is the primary source:

Tier Monthly RMB Monthly USD Annual RMB Annual USD
Standard 68 $9.5 688 $95
Enhanced 200 $28 2,048 $285
Pro 500 $70 5,088 $710

For reference:

  • ChatGPT Plus: $20
  • ChatGPT Pro: $200
  • Claude Pro: $20
  • Claude Max: $100-$200
  • Google AI Plus: $8
  • Google AI Ultra: $99.99

Doubao Standard at $9.5 slots between ChatGPT Go ($8) and ChatGPT Plus ($20). Doubao Pro at $70 is materially cheaper than the closest Western premium tier (Google AI Ultra at $100, ChatGPT Pro at $200).

Free tier survives. ByteDance was explicit: daily chat, Q&A, content writing, simple image generation stay free. The premium tiers are positioned as additive features (PPT generation at scale, data analysis, video editing — workloads the 36Kr coverage explicitly flags as "professional users burning tokens daily").

The cost math nobody talks about

Here's the number that drove this entire decision:

120 trillion tokens per day.

Three months ago it was ~60T/day. That growth curve is doubling every quarter. In industry inference cost estimates, 120T daily tokens translates to roughly 50,000-80,000 H100 GPU equivalents and $3-5M in daily inference cost.

ByteDance's 2026 AI budget got raised from 160B to 200B RMB ($28B) — about $76M/day in total AI spend including capex, opex, and talent. Inference alone is one of the larger line items.

If 1% of Doubao's 345M users convert to paid at an average ~700元/year, that's:

345,000,000 × 1% × 700 = 23.7 billion RMB/year
                       = ~$3.3 billion ARR
Enter fullscreen mode Exit fullscreen mode

Now compare: OpenAI ran ~$25B ARR in 2024 against ~$5B operating loss. So even with strong conversion, subscription revenue may not fully cover total inference cost at scale. Doubao's subscription play is partial offset, not full cost coverage.

The lesson Western devs should take from this: the "free forever" era was never going to scale. The only question was whether monetization arrived as price cuts (DeepSeek's bet), consumer subscriptions (Doubao's bet), or premium tiers (Anthropic's Mythos play).

What this means if you're a developer

Building on Chinese model APIs?

# If you're using Doubao API today:
# - No price change
# - No throttling change
# - No feature removal
# - Continue normally

client = OpenAI(
    base_url="https://api.tokenmix.ai/v1",  # or Volcengine direct
    api_key=os.getenv("DOUBAO_KEY")
)
# Cost-per-million-tokens stays exactly the same as last week
Enter fullscreen mode Exit fullscreen mode

The consumer subscription only affects the Doubao consumer app on iOS/Android. API customers (you) are completely unaffected.

Watching Chinese AI as a market signal?

This is the inflection point. The pattern I'd expect over the next 6 months:

Provider Likely move Why
Kimi Hold tiers, may compress price ranges Already had 39-559元 tiers; Doubao validates the structure
Zhipu (ChatGLM) Already executing — both C-end VIP + API price hikes Most aggressive monetization path
Qwen (Alibaba) Launch C-end + commerce-bundled tier Alibaba ecosystem leverage
MiniMax Maintain overseas focus Won't follow Doubao domestically
DeepSeek Continue API price cuts Explicit strategy divergence

For builders, the takeaway is split: if you depend on Chinese model APIs, route through stable providers (Volcengine, DeepSeek, gateway aggregators). If you care about Chinese model app UX for end-user products, plan for a less-free, more-segmented landscape.

Cross-referencing global pricing pressure

Doubao going paid doesn't directly raise Western consumer AI prices, but it removes the "but Chinese AI is free, so we can't charge more" argument from product debates. Expect modest upward pressure on ChatGPT Plus, Claude Pro, and Gemini consumer tiers over the next 6-12 months as competitive ground for "free is sustainable" disappears.

For B2B API customers — you and me — the dynamic is opposite. The same week Doubao went paid on the consumer side, DeepSeek cut V4-Flash to 1元 per million tokens input. That's roughly $0.14. For comparison, GPT-5.5 is $5/M and Claude Opus 4.8 is $5/M. The price war on API rates continues independent of consumer subscription rollouts.

Two theories, one cost structure

The most interesting part of all this is watching three different theories of AI monetization compete in public:

Theory Champion Mechanism
Consumer subscription pays for compute Doubao, ChatGPT Plus High-volume, low-margin C-end
Premium tier extracts value from heavy users Anthropic Mythos, ChatGPT Pro Specialized capability at premium price
API price war forces volume DeepSeek, Qwen on B-end Race to zero on per-token cost

All three theories have the same underlying cost structure (inference is expensive, demand is growing exponentially). The difference is which side of the supply-demand equation they're betting will give first.

My read after a year of watching this: consumer subscription wins on ARR, API price wars win on developer mindshare, premium tiers win on margin. The interesting companies are running all three plays simultaneously — Anthropic is doing exactly that with Claude free / Pro / Max + Mythos-class + API pricing tiers.

What I'm watching this week

For developers building right now:

  1. Don't refactor your Chinese API stack. No price change is coming. Doubao API rates hold.
  2. Watch Kimi, Zhipu, Qwen for C-end follow-ons. Expect 2-3 announcements over the next 8 weeks.
  3. Lock your DeepSeek price baseline. API price war means the floor keeps dropping — but only if you have a baseline to measure against.
  4. Plan abstraction layers. When pricing structures diverge this quickly, hard-coded model strings are technical debt. Use config-driven model selection.
# Bad — locks you to one provider's price point
client.chat.completions.create(model="doubao-pro", ...)

# Good — survives pricing structure changes
MODEL = os.getenv("LLM_MODEL", "doubao-pro")
client.chat.completions.create(model=MODEL, ...)
Enter fullscreen mode Exit fullscreen mode

If you want to swap between Chinese (Doubao, Kimi, Qwen, DeepSeek) and Western (OpenAI, Anthropic, Google) models through one OpenAI-compatible endpoint without managing six API keys, that's roughly what TokenMix does. (Disclosure: I work on the research side — the full data-cited breakdown is on the original article.)

Bottom line

Doubao going paid is the most important Chinese AI commercialization signal of 2026. It doesn't immediately change your stack if you're building outside China. It does signal that "free forever" was always temporary, and the question of how AI labs make money is moving from theory to public bet.

Three theories now competing in real time. The next 6 months will tell us which one (or which combination) actually pays the bills at frontier-model scale.

What's your read — is Doubao's bet the right one, or is DeepSeek's API price-floor strategy going to outlast it? Drop a comment.

Top comments (0)