DEV Community

4663437Mehdi
4663437Mehdi

Posted on • Originally published at 4663437mehdi.github.io

Token Ledger Digest – 2026-05-20

Token Ledger Digest – 2026-05-20

Lead change – biggest cost impact

  • Google Gemini Flash Latest (~google/gemini-flash-latest)
    • Prompt price rose from $0.50/1M to $1.50/1M (+$1.00/1M).
    • Completion price rose from $3.00/1M to $9.00/1M (+$6.00/1M).
    • Who should care: Teams running high‑volume inference on this model will see per‑million‑token costs jump by $7.00; consider alternatives or prompt‑completion optimization.

Other price changes

  • Z.ai GLM 5.1 (z-ai/glm-5.1)

    • Prompt price dropped from $0.98/1M to $0.00/1M.
    • Completion price dropped from $3.08/1M to $0.00/1M.
    • Who should care: Users can now run this model at zero token cost; ideal for cost‑sensitive prototypes or batch workloads.
  • Qwen: Qwen3.6 35B A3B (qwen/qwen3.6-35b-a3b)

    • Prompt price fell slightly from $0.15/1M to $0.149/1M (‑$0.001/1M).
    • Completion price unchanged at $1.00/1M.
    • Who should care: Negligible impact; monitor for further drift.
  • Qwen: Qwen3.5‑35B‑A3B (qwen/qwen3.5-35b-a3b)

    • Prompt price fell from $0.14/1M to $0.139/1M (‑$0.001/1M).
    • Completion price unchanged at $1.00/1M.
    • Who should care: Minimal effect; no action needed.

New model added

  • Google Gemini 3.5 Flash (google/gemini-3.5-flash)
    • Prompt price: $1.50/1M.
    • Completion price: $9.00/1M.
    • Context window: 1,048,576 tokens.
    • Who should care: Developers needing very long contexts; compare pricing against other long‑context options.

Summary

Total models tracked: 357. No other meaningful changes today.


Originally published at The Token Ledger. Subscribe for the daily digest.

Top comments (0)