DEV Community

4663437Mehdi
4663437Mehdi

Posted on • Originally published at 4663437mehdi.github.io

AI API Pricing Digest – 2026-06-03

AI API Pricing Digest – 2026-06-03

Most cost‑impacting change

  • inclusionAI: Ring-2.6-1T – Prompt price fell from $0.30 / 1M to $0.075 / 1M (‑75 %); completion price fell from $2.50 / 1M to $0.625 / 1M (‑75 %). Who should care: Teams running high‑volume inference on this model can cut token costs by three‑quarters.

Other price changes

  • Z.ai: GLM 5 – Prompt unchanged at $0.60 / 1M; completion price dropped from $2.08 / 1M to $1.92 / 1M (‑7.7 %). Who should care: Users sensitive to completion cost see modest savings.

Added model

  • OpenRouter: Fusion – Prompt and completion prices listed as ‑$1.00 / token (placeholder indicating free access). Who should care: Developers seeking a zero‑cost experimental model; verify actual billing before production use.

Cheapest models today (per‑million‑token rates)

  1. inclusionAI: Ling-2.6-flash – Prompt $0.01 / 1M, Completion $0.03 / 1M
  2. IBM: Granite 4.0 Micro – Prompt $0.017 / 1M, Completion $0.112 / 1M
  3. Meta: Llama 3.1 8B Instruct – Prompt $0.02 / 1M, Completion $0.05 / 1M

Total models tracked: 343.


Originally published at The Token Ledger. Subscribe for the daily digest.

Top comments (0)