Skip to content

DEV Community

4663437Mehdi

Posted on Jun 26 • Originally published at 4663437mehdi.github.io

Token Ledger — 2026-06-26

#ai #llm #api #news

The Token Ledger Digest – 2026-06-26

Lead change – biggest cost impact

MiniMax: MiniMax M2.5
- Prompt price fell from $0.15/1M to $0.12/1M (‑20%).
- Completion price fell from $0.90/1M to $0.48/1M (‑47%).
- Who should care: Teams running high‑volume completion workloads on MiniMax M2.5 see per‑million‑token costs drop by $0.42 for completions and $0.03 for prompts.

Other price adjustments

MiniMax: MiniMax M2.7
- Prompt: $0.24 → $0.18/1M (‑25%).
- Completion: $0.96 → $0.72/1M (‑25%).
- Relevant for latency‑sensitive apps using this model.
NVIDIA: Nemotron 3 Super
- Prompt: $0.09 → $0.085/1M (‑5.6%).
- Completion: $0.45 → $0.40/1M (‑11%).
- Affects users of NVIDIA-hosted Nemotron workloads.
OpenAI: gpt-oss-120b
- Prompt: $0.039 → $0.03/1M (‑23%).
- Completion: $0.18 → $0.15/1M (‑17%).
- Notable for cost‑conscious developers on OpenAI’s open‑source offering.

No model additions or removals today.

Three cheapest models (per‑million‑token)

inclusionAI: Ling-2.6-flash – Prompt $0.01/1M, Completion $0.03/1M.
IBM: Granite 4.0 Micro – Prompt $0.017/1M, Completion $0.112/1M.
Meta: Llama 3.1 8B Instruct – Prompt $0.02/1M, Completion $0.03/1M.

Total models tracked: 339.

Originally published at The Token Ledger. Subscribe for the daily digest.

Top comments (0)

Subscribe