AI API Pricing Digest – 2026-06-03
Most cost‑impacting change
- inclusionAI: Ring-2.6-1T – Prompt price fell from $0.30 / 1M to $0.075 / 1M (‑75 %); completion price fell from $2.50 / 1M to $0.625 / 1M (‑75 %). Who should care: Teams running high‑volume inference on this model can cut token costs by three‑quarters.
Other price changes
- Z.ai: GLM 5 – Prompt unchanged at $0.60 / 1M; completion price dropped from $2.08 / 1M to $1.92 / 1M (‑7.7 %). Who should care: Users sensitive to completion cost see modest savings.
Added model
- OpenRouter: Fusion – Prompt and completion prices listed as ‑$1.00 / token (placeholder indicating free access). Who should care: Developers seeking a zero‑cost experimental model; verify actual billing before production use.
Cheapest models today (per‑million‑token rates)
- inclusionAI: Ling-2.6-flash – Prompt $0.01 / 1M, Completion $0.03 / 1M
- IBM: Granite 4.0 Micro – Prompt $0.017 / 1M, Completion $0.112 / 1M
- Meta: Llama 3.1 8B Instruct – Prompt $0.02 / 1M, Completion $0.05 / 1M
Total models tracked: 343.
Originally published at The Token Ledger. Subscribe for the daily digest.
Top comments (0)