DEV Community

4663437Mehdi
4663437Mehdi

Posted on • Originally published at 4663437mehdi.github.io

Token Ledger — 2026-06-26

The Token Ledger Digest – 2026-06-26

Lead change – biggest cost impact

  • MiniMax: MiniMax M2.5
    • Prompt price fell from $0.15/1M to $0.12/1M (‑20%).
    • Completion price fell from $0.90/1M to $0.48/1M (‑47%).
    • Who should care: Teams running high‑volume completion workloads on MiniMax M2.5 see per‑million‑token costs drop by $0.42 for completions and $0.03 for prompts.

Other price adjustments

  • MiniMax: MiniMax M2.7

    • Prompt: $0.24 → $0.18/1M (‑25%).
    • Completion: $0.96 → $0.72/1M (‑25%).
    • Relevant for latency‑sensitive apps using this model.
  • NVIDIA: Nemotron 3 Super

    • Prompt: $0.09 → $0.085/1M (‑5.6%).
    • Completion: $0.45 → $0.40/1M (‑11%).
    • Affects users of NVIDIA-hosted Nemotron workloads.
  • OpenAI: gpt-oss-120b

    • Prompt: $0.039 → $0.03/1M (‑23%).
    • Completion: $0.18 → $0.15/1M (‑17%).
    • Notable for cost‑conscious developers on OpenAI’s open‑source offering.

No model additions or removals today.

Three cheapest models (per‑million‑token)

  1. inclusionAI: Ling-2.6-flash – Prompt $0.01/1M, Completion $0.03/1M.
  2. IBM: Granite 4.0 Micro – Prompt $0.017/1M, Completion $0.112/1M.
  3. Meta: Llama 3.1 8B Instruct – Prompt $0.02/1M, Completion $0.03/1M.

Total models tracked: 339.


Originally published at The Token Ledger. Subscribe for the daily digest.

Top comments (0)