AI API Pricing Digest – 2026-07-04
Z.ai: GLM 5.2 – Prompt price fell from $0.930 / 1M to $0.909 / 1M (‑2.3%); completion price dropped from $3.000 / 1M to $2.856 / 1M (‑4.8%).
Who should care: Teams running large‑scale inference or fine‑tuning pipelines where completion tokens dominate cost; the ~4.8% reduction can shave thousands off monthly bills at high volume.
NVIDIA: Nemotron 3 Super – Prompt price decreased from $0.085 / 1M to $0.080 / 1M (‑5.9%); completion price rose from $0.400 / 1M to $0.450 / 1M (+12.5%).
Who should care: Users who rely heavily on completion output (e.g., chatbots, code generation) should monitor the 12.5% increase; prompt‑heavy workloads benefit slightly from the lower prompt rate.
Originally published at The Token Ledger. Subscribe for the daily digest.
Top comments (0)