The Token Ledger Digest – 2026-06-26
Lead change – biggest cost impact
-
MiniMax: MiniMax M2.5
- Prompt price fell from $0.15/1M to $0.12/1M (‑20%).
- Completion price fell from $0.90/1M to $0.48/1M (‑47%).
- Who should care: Teams running high‑volume completion workloads on MiniMax M2.5 see per‑million‑token costs drop by $0.42 for completions and $0.03 for prompts.
Other price adjustments
-
MiniMax: MiniMax M2.7
- Prompt: $0.24 → $0.18/1M (‑25%).
- Completion: $0.96 → $0.72/1M (‑25%).
- Relevant for latency‑sensitive apps using this model.
-
NVIDIA: Nemotron 3 Super
- Prompt: $0.09 → $0.085/1M (‑5.6%).
- Completion: $0.45 → $0.40/1M (‑11%).
- Affects users of NVIDIA-hosted Nemotron workloads.
-
OpenAI: gpt-oss-120b
- Prompt: $0.039 → $0.03/1M (‑23%).
- Completion: $0.18 → $0.15/1M (‑17%).
- Notable for cost‑conscious developers on OpenAI’s open‑source offering.
No model additions or removals today.
Three cheapest models (per‑million‑token)
- inclusionAI: Ling-2.6-flash – Prompt $0.01/1M, Completion $0.03/1M.
- IBM: Granite 4.0 Micro – Prompt $0.017/1M, Completion $0.112/1M.
- Meta: Llama 3.1 8B Instruct – Prompt $0.02/1M, Completion $0.03/1M.
Total models tracked: 339.
Originally published at The Token Ledger. Subscribe for the daily digest.
Top comments (0)