<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:dc="http://purl.org/dc/elements/1.1/">
  <channel>
    <title>DEV Community: 4663437Mehdi</title>
    <description>The latest articles on DEV Community by 4663437Mehdi (@4663437mehdi).</description>
    <link>https://dev.to/4663437mehdi</link>
    <image>
      <url>https://media2.dev.to/dynamic/image/width=90,height=90,fit=cover,gravity=auto,format=auto/https:%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Fuser%2Fprofile_image%2F3933985%2Fd12e89b2-cc42-404c-b494-5ebd7577086c.png</url>
      <title>DEV Community: 4663437Mehdi</title>
      <link>https://dev.to/4663437mehdi</link>
    </image>
    <atom:link rel="self" type="application/rss+xml" href="https://dev.to/feed/4663437mehdi"/>
    <language>en</language>
    <item>
      <title>Token Ledger Digest – 2026-06-06</title>
      <dc:creator>4663437Mehdi</dc:creator>
      <pubDate>Sat, 06 Jun 2026 09:30:57 +0000</pubDate>
      <link>https://dev.to/4663437mehdi/token-ledger-digest-2026-06-06-3fac</link>
      <guid>https://dev.to/4663437mehdi/token-ledger-digest-2026-06-06-3fac</guid>
      <description>&lt;h1&gt;
  
  
  Token Ledger Digest – 2026-06-06
&lt;/h1&gt;

&lt;h2&gt;
  
  
  Removed Models
&lt;/h2&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;Sao10k: Llama 3 Euryale 70B v2.1&lt;/strong&gt; – model no longer available. Prior pricing: &lt;strong&gt;$1.48 / 1M&lt;/strong&gt; prompt, &lt;strong&gt;$1.48 / 1M&lt;/strong&gt; completion (8192‑token context). Teams using this 70B variant for high‑throughput workloads must seek alternatives.
&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;NousResearch: Hermes 2 Pro – Llama‑3 8B&lt;/strong&gt; – model no longer available. Prior pricing: &lt;strong&gt;$0.14 / 1M&lt;/strong&gt; prompt, &lt;strong&gt;$0.14 / 1M&lt;/strong&gt; completion (8192‑token context). Developers who relied on the low‑cost 8B Hermes version need to adjust their model selection.&lt;/li&gt;
&lt;/ul&gt;

&lt;h2&gt;
  
  
  Price Change
&lt;/h2&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;Google: Gemma 4 31B&lt;/strong&gt; – completion price lowered from &lt;strong&gt;$0.37 / 1M&lt;/strong&gt; to &lt;strong&gt;$0.36 / 1M&lt;/strong&gt;; prompt price unchanged at &lt;strong&gt;$0.12 / 1M&lt;/strong&gt;. Users of Gemma 4 31B with completion‑heavy prompts see a modest ~2.7% cost reduction.&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;Total models in catalog: &lt;strong&gt;344&lt;/strong&gt;. No other additions or modifications recorded for today.&lt;/p&gt;




&lt;p&gt;&lt;em&gt;Originally published at &lt;a href="https://4663437Mehdi.github.io/token-ledger/entry.html?d=2026-06-06" rel="noopener noreferrer"&gt;The Token Ledger&lt;/a&gt;. Subscribe for the daily digest.&lt;/em&gt;&lt;/p&gt;

</description>
      <category>ai</category>
      <category>llm</category>
      <category>api</category>
      <category>news</category>
    </item>
    <item>
      <title>The Token Ledger Digest – 2026-06-05</title>
      <dc:creator>4663437Mehdi</dc:creator>
      <pubDate>Fri, 05 Jun 2026 10:54:45 +0000</pubDate>
      <link>https://dev.to/4663437mehdi/the-token-ledger-digest-2026-06-05-3ok9</link>
      <guid>https://dev.to/4663437mehdi/the-token-ledger-digest-2026-06-05-3ok9</guid>
      <description>&lt;h1&gt;
  
  
  The Token Ledger Digest – 2026-06-05
&lt;/h1&gt;

&lt;p&gt;&lt;strong&gt;Most cost‑impacting change&lt;/strong&gt;: Meta: Llama 3.1 8B Instruct completion price fell from $0.05 to $0.03 per 1M tokens (prompt unchanged at $0.02/1M). Generation‑heavy users see ~40% lower cost.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Added models&lt;/strong&gt;:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;NVIDIA: Nemotron 3.5 Content Safety (free) – prompt $0.00/1M, completion $0.00/1M, context 128k. Ideal for zero‑cost safety filtering.&lt;/li&gt;
&lt;li&gt;NVIDIA: Nemotron 3 Ultra (free) – prompt $0.00/1M, completion $0.00/1M, context 1M. Suitable for applications needing massive context at no cost.&lt;/li&gt;
&lt;li&gt;NVIDIA: Nemotron 3 Ultra – prompt $0.50/1M, completion $2.50/1M, context 1M. Targets enterprises requiring paid ultra‑large‑context capability.&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;&lt;strong&gt;Cheapest models today&lt;/strong&gt; (per‑million):&lt;/p&gt;

&lt;ol&gt;
&lt;li&gt;inclusionAI: Ling-2.6-flash – prompt $0.01/1M, completion $0.03/1M.&lt;/li&gt;
&lt;li&gt;IBM: Granite 4.0 Micro – prompt $0.017/1M, completion $0.112/1M.&lt;/li&gt;
&lt;li&gt;Meta: Llama 3.1 8B Instruct – prompt $0.02/1M, completion $0.03/1M.&lt;/li&gt;
&lt;/ol&gt;

&lt;p&gt;Total models tracked: 346.&lt;/p&gt;




&lt;p&gt;&lt;em&gt;Originally published at &lt;a href="https://4663437Mehdi.github.io/token-ledger/entry.html?d=2026-06-05" rel="noopener noreferrer"&gt;The Token Ledger&lt;/a&gt;. Subscribe for the daily digest.&lt;/em&gt;&lt;/p&gt;

</description>
      <category>ai</category>
      <category>llm</category>
      <category>api</category>
      <category>news</category>
    </item>
    <item>
      <title>The Token Ledger Digest – 2026-06-04</title>
      <dc:creator>4663437Mehdi</dc:creator>
      <pubDate>Thu, 04 Jun 2026 10:45:11 +0000</pubDate>
      <link>https://dev.to/4663437mehdi/the-token-ledger-digest-2026-06-04-5elc</link>
      <guid>https://dev.to/4663437mehdi/the-token-ledger-digest-2026-06-04-5elc</guid>
      <description>&lt;h1&gt;
  
  
  The Token Ledger Digest – 2026-06-04
&lt;/h1&gt;

&lt;h2&gt;
  
  
  Removed
&lt;/h2&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;OpenAI: GPT-4 (older v0314)&lt;/strong&gt; – model removed. Previously $30.00 / 1M prompt tokens, $60.00 / 1M completion tokens. Users relying on this high‑cost legacy model must switch to alternatives.&lt;/li&gt;
&lt;/ul&gt;

&lt;h2&gt;
  
  
  Added
&lt;/h2&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;Qwen: Qwen3.7 Plus&lt;/strong&gt; – new model with 1M context. Prompt $0.40 / 1M tokens, completion $1.60 / 1M tokens. Attractive for long‑context workloads needing low cost.&lt;/li&gt;
&lt;/ul&gt;

&lt;h2&gt;
  
  
  Price Change
&lt;/h2&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;Qwen: Qwen3 30B A3B Instruct 2507&lt;/strong&gt; – prompt price rose from $0.0428 to $0.04815 / 1M (+$0.00535 / 1M); completion price rose from $0.1716 to $0.19305 / 1M (+$0.02145 / 1M). ~12.5% increase. Users of this model should budget for higher per‑token cost.&lt;/li&gt;
&lt;/ul&gt;




&lt;p&gt;&lt;em&gt;Originally published at &lt;a href="https://4663437Mehdi.github.io/token-ledger/entry.html?d=2026-06-04" rel="noopener noreferrer"&gt;The Token Ledger&lt;/a&gt;. Subscribe for the daily digest.&lt;/em&gt;&lt;/p&gt;

</description>
      <category>ai</category>
      <category>llm</category>
      <category>api</category>
      <category>news</category>
    </item>
    <item>
      <title>AI API Pricing Digest – 2026-06-03</title>
      <dc:creator>4663437Mehdi</dc:creator>
      <pubDate>Wed, 03 Jun 2026 12:09:25 +0000</pubDate>
      <link>https://dev.to/4663437mehdi/ai-api-pricing-digest-2026-06-03-3hc8</link>
      <guid>https://dev.to/4663437mehdi/ai-api-pricing-digest-2026-06-03-3hc8</guid>
      <description>&lt;h1&gt;
  
  
  AI API Pricing Digest – 2026-06-03
&lt;/h1&gt;

&lt;p&gt;&lt;strong&gt;Most cost‑impacting change&lt;/strong&gt;  &lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;inclusionAI: Ring-2.6-1T&lt;/strong&gt; – Prompt price fell from &lt;strong&gt;$0.30 / 1M&lt;/strong&gt; to &lt;strong&gt;$0.075 / 1M&lt;/strong&gt; (‑75 %); completion price fell from &lt;strong&gt;$2.50 / 1M&lt;/strong&gt; to &lt;strong&gt;$0.625 / 1M&lt;/strong&gt; (‑75 %).
&lt;em&gt;Who should care:&lt;/em&gt; Teams running high‑volume inference on this model can cut token costs by three‑quarters.&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;&lt;strong&gt;Other price changes&lt;/strong&gt;  &lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;Z.ai: GLM 5&lt;/strong&gt; – Prompt unchanged at &lt;strong&gt;$0.60 / 1M&lt;/strong&gt;; completion price dropped from &lt;strong&gt;$2.08 / 1M&lt;/strong&gt; to &lt;strong&gt;$1.92 / 1M&lt;/strong&gt; (‑7.7 %).
&lt;em&gt;Who should care:&lt;/em&gt; Users sensitive to completion cost see modest savings.&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;&lt;strong&gt;Added model&lt;/strong&gt;  &lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;OpenRouter: Fusion&lt;/strong&gt; – Prompt and completion prices listed as &lt;strong&gt;‑$1.00 / token&lt;/strong&gt; (placeholder indicating free access).
&lt;em&gt;Who should care:&lt;/em&gt; Developers seeking a zero‑cost experimental model; verify actual billing before production use.&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;&lt;strong&gt;Cheapest models today (per‑million‑token rates)&lt;/strong&gt;  &lt;/p&gt;

&lt;ol&gt;
&lt;li&gt;inclusionAI: Ling-2.6-flash – Prompt &lt;strong&gt;$0.01 / 1M&lt;/strong&gt;, Completion &lt;strong&gt;$0.03 / 1M&lt;/strong&gt;
&lt;/li&gt;
&lt;li&gt;IBM: Granite 4.0 Micro – Prompt &lt;strong&gt;$0.017 / 1M&lt;/strong&gt;, Completion &lt;strong&gt;$0.112 / 1M&lt;/strong&gt;
&lt;/li&gt;
&lt;li&gt;Meta: Llama 3.1 8B Instruct – Prompt &lt;strong&gt;$0.02 / 1M&lt;/strong&gt;, Completion &lt;strong&gt;$0.05 / 1M&lt;/strong&gt;
&lt;/li&gt;
&lt;/ol&gt;

&lt;p&gt;&lt;em&gt;Total models tracked: 343.&lt;/em&gt;&lt;/p&gt;




&lt;p&gt;&lt;em&gt;Originally published at &lt;a href="https://4663437Mehdi.github.io/token-ledger/entry.html?d=2026-06-03" rel="noopener noreferrer"&gt;The Token Ledger&lt;/a&gt;. Subscribe for the daily digest.&lt;/em&gt;&lt;/p&gt;

</description>
      <category>ai</category>
      <category>llm</category>
      <category>api</category>
      <category>news</category>
    </item>
    <item>
      <title>The Token Ledger Digest – 2026-06-02</title>
      <dc:creator>4663437Mehdi</dc:creator>
      <pubDate>Tue, 02 Jun 2026 11:44:20 +0000</pubDate>
      <link>https://dev.to/4663437mehdi/the-token-ledger-digest-2026-06-02-24fj</link>
      <guid>https://dev.to/4663437mehdi/the-token-ledger-digest-2026-06-02-24fj</guid>
      <description>&lt;h1&gt;
  
  
  The Token Ledger Digest – 2026-06-02
&lt;/h1&gt;

&lt;h2&gt;
  
  
  Most Impactful Change
&lt;/h2&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;Qwen: Qwen3 30B A3B Instruct 2507&lt;/strong&gt; – Prompt price fell from &lt;strong&gt;$0.0900&lt;/strong&gt; to &lt;strong&gt;$0.0428&lt;/strong&gt; /1M tokens; completion price fell from &lt;strong&gt;$0.3000&lt;/strong&gt; to &lt;strong&gt;$0.1716&lt;/strong&gt; /1M tokens.
&lt;em&gt;Who should care:&lt;/em&gt; Teams running cost‑sensitive inference on this model see ~42% lower prompt and ~43% lower completion costs.&lt;/li&gt;
&lt;/ul&gt;

&lt;h2&gt;
  
  
  Price Changes
&lt;/h2&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;Tencent: Hy3 preview&lt;/strong&gt; – Prompt &lt;strong&gt;$0.0660 → $0.0630&lt;/strong&gt; /1M; Completion &lt;strong&gt;$0.2600 → $0.2100&lt;/strong&gt; /1M.
&lt;em&gt;Who should care:&lt;/em&gt; Users of Hy3 preview benefit from modest savings on both prompt and completion.&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;MiniMax: MiniMax M2.7&lt;/strong&gt; – Prompt &lt;strong&gt;$0.2600 → $0.2790&lt;/strong&gt; /M (↑$0.0190); Completion unchanged at &lt;strong&gt;$1.2000&lt;/strong&gt; /1M.
&lt;em&gt;Who should care:&lt;/em&gt; Slight increase in prompt cost; completion cost stable.&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;DeepSeek: DeepSeek V3.2&lt;/strong&gt; – Prompt &lt;strong&gt;$0.2520 → $0.2288&lt;/strong&gt; /1M; Completion &lt;strong&gt;$0.3780 → $0.3432&lt;/strong&gt; /1M.
&lt;em&gt;Who should care:&lt;/em&gt; Moderate reductions across both prompt and completion.&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;DeepSeek: DeepSeek V3&lt;/strong&gt; – Prompt &lt;strong&gt;$0.2288 → $0.2002&lt;/strong&gt; /1M; Completion &lt;strong&gt;$0.9144 → $0.8001&lt;/strong&gt; /1M.
&lt;em&gt;Who should care:&lt;/em&gt; Notable completion‑cost drop (~12%) for chat workloads.&lt;/li&gt;
&lt;/ul&gt;

&lt;h2&gt;
  
  
  Removed Models
&lt;/h2&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;Baidu: ERNIE 4.5 300B A47B&lt;/strong&gt; – No longer available; previously &lt;strong&gt;$0.2800&lt;/strong&gt; prompt /1M, &lt;strong&gt;$1.1000&lt;/strong&gt; completion /1M.
&lt;em&gt;Who should care:&lt;/em&gt; Users needing this large Baidu model must migrate to alternatives.&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Google: Gemini 2.0 Flash Lite&lt;/strong&gt; – Removed; previously &lt;strong&gt;$0.0750&lt;/strong&gt; prompt /1M, &lt;strong&gt;$0.3000&lt;/strong&gt; completion /1M.
&lt;em&gt;Who should care:&lt;/em&gt; Applications relying on this low‑latency model need replacement.&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Google: Gemini 2.0 Flash&lt;/strong&gt; – Removed; previously &lt;strong&gt;$0.1000&lt;/strong&gt; prompt /1M, &lt;strong&gt;$0.4000&lt;/strong&gt; completion /1M.
&lt;em&gt;Who should care:&lt;/em&gt; Users of the standard Flash model must adjust their provider list.&lt;/li&gt;
&lt;/ul&gt;




&lt;p&gt;&lt;em&gt;Originally published at &lt;a href="https://4663437Mehdi.github.io/token-ledger/entry.html?d=2026-06-02" rel="noopener noreferrer"&gt;The Token Ledger&lt;/a&gt;. Subscribe for the daily digest.&lt;/em&gt;&lt;/p&gt;

</description>
      <category>ai</category>
      <category>llm</category>
      <category>api</category>
      <category>news</category>
    </item>
    <item>
      <title>The Token Ledger Digest – 2026-06-01</title>
      <dc:creator>4663437Mehdi</dc:creator>
      <pubDate>Mon, 01 Jun 2026 12:50:07 +0000</pubDate>
      <link>https://dev.to/4663437mehdi/the-token-ledger-digest-2026-06-01-56h8</link>
      <guid>https://dev.to/4663437mehdi/the-token-ledger-digest-2026-06-01-56h8</guid>
      <description>&lt;h1&gt;
  
  
  The Token Ledger Digest – 2026-06-01
&lt;/h1&gt;

&lt;p&gt;&lt;strong&gt;Most cost‑impacting change&lt;/strong&gt;  &lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;inclusionAI: Ring-2.6-1T&lt;/strong&gt; – Prompt price rose from &lt;strong&gt;$0.075/1M&lt;/strong&gt; to &lt;strong&gt;$0.30/1M&lt;/strong&gt; (4×); completion price rose from &lt;strong&gt;$0.625/1M&lt;/strong&gt; to &lt;strong&gt;$2.50/1M&lt;/strong&gt; (4×).
&lt;em&gt;Who should care:&lt;/em&gt; Teams running high‑volume inference on this model will see a four‑fold increase in per‑token cost; consider re‑evaluating usage or switching to alternatives.&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;&lt;strong&gt;Added models&lt;/strong&gt;  &lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;MiniMax: MiniMax M3&lt;/strong&gt; – Prompt &lt;strong&gt;$0.30/1M&lt;/strong&gt;, completion &lt;strong&gt;$1.20/1M&lt;/strong&gt;, 1M‑token context.
&lt;em&gt;Who should care:&lt;/em&gt; Applications needing very long context (up to 1M tokens) with moderate pricing.
&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Upstage: Solar Pro 3&lt;/strong&gt; – Prompt &lt;strong&gt;$0.15/1M&lt;/strong&gt;, completion &lt;strong&gt;$0.60/1M&lt;/strong&gt;, 128k‑token context.
&lt;em&gt;Who should care:&lt;/em&gt; Users seeking a cheaper mid‑range model with decent context length.&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;&lt;strong&gt;Removed models&lt;/strong&gt; (no longer available)  &lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;DeepSeek V4 Flash (free) – $0/$0
&lt;/li&gt;
&lt;li&gt;Xiaomi MiMo‑V2‑Omni – $0.40/1M prompt, $2.00/1M completion
&lt;/li&gt;
&lt;li&gt;Xiaomi MiMo‑V2‑Pro – $1.00/1M prompt, $3.00/1M completion
&lt;/li&gt;
&lt;li&gt;Mistral Devstral Medium – $0.40/1M prompt, $2.00/1M completion
&lt;/li&gt;
&lt;li&gt;Mistral Devstral Small 1.1 – $0.10/1M prompt, $0.30/1M completion
&lt;/li&gt;
&lt;li&gt;Mistral Large 2411 – $2.00/1M prompt, $6.00/1M completion
&lt;/li&gt;
&lt;li&gt;Mistral Pixtral Large 2411 – $2.00/1M prompt, $6.00/1M completion
&lt;em&gt;Who should care:&lt;/em&gt; Migrate any workloads off these IDs to avoid broken calls.&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;&lt;strong&gt;Other price adjustments&lt;/strong&gt;  &lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;Tencent: Hy3 preview&lt;/strong&gt; – Prompt $0.063→$0.066/1M (+5%); completion $0.21→$0.26/1M (+24%).
&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Z.ai: GLM 5&lt;/strong&gt; – Prompt unchanged $0.60/1M; completion $1.92→$2.08/1M (+8%).
&lt;em&gt;Who should care:&lt;/em&gt; Minor cost uplift; monitor if usage scales significantly.&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;Total models tracked: &lt;strong&gt;345&lt;/strong&gt;.&lt;/p&gt;




&lt;p&gt;&lt;em&gt;Originally published at &lt;a href="https://4663437Mehdi.github.io/token-ledger/entry.html?d=2026-06-01" rel="noopener noreferrer"&gt;The Token Ledger&lt;/a&gt;. Subscribe for the daily digest.&lt;/em&gt;&lt;/p&gt;

</description>
      <category>ai</category>
      <category>llm</category>
      <category>api</category>
      <category>news</category>
    </item>
    <item>
      <title>Token Ledger Digest – 2026-05-31</title>
      <dc:creator>4663437Mehdi</dc:creator>
      <pubDate>Sun, 31 May 2026 09:51:35 +0000</pubDate>
      <link>https://dev.to/4663437mehdi/token-ledger-digest-2026-05-31-4ofo</link>
      <guid>https://dev.to/4663437mehdi/token-ledger-digest-2026-05-31-4ofo</guid>
      <description>&lt;h1&gt;
  
  
  Token Ledger Digest – 2026-05-31
&lt;/h1&gt;

&lt;h2&gt;
  
  
  Cost‑impacting change
&lt;/h2&gt;

&lt;p&gt;&lt;strong&gt;Qwen: Qwen3 235B A22B Thinking 2507&lt;/strong&gt;  &lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;Prompt:&lt;/strong&gt; fell from $0.1495/1M to $0.10/1M (‑33%).
&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Completion:&lt;/strong&gt; fell from $1.495/1M to $0.10/1M (‑93%).
&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Who should care:&lt;/strong&gt; Teams running large‑scale generation workloads where output token cost dominates; this cut reduces per‑million completion expense by ~$1.40.&lt;/li&gt;
&lt;/ul&gt;

&lt;h2&gt;
  
  
  Other price changes
&lt;/h2&gt;

&lt;p&gt;&lt;strong&gt;MiniMax: MiniMax M2.7&lt;/strong&gt;  &lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Prompt: $0.279/1M → $0.26/1M (‑7%). Completion unchanged at $1.20/1M.
&lt;/li&gt;
&lt;li&gt;Relevant for users prioritizing prompt‑heavy tasks.&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;&lt;strong&gt;OpenAI: gpt-oss-20b&lt;/strong&gt;  &lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Prompt: $0.03/1M → $0.029/1M (‑3%). Completion unchanged at $0.14/1M.
&lt;/li&gt;
&lt;li&gt;Minor saving for latency‑sensitive apps using this model.&lt;/li&gt;
&lt;/ul&gt;

&lt;h2&gt;
  
  
  Model removals (6)
&lt;/h2&gt;

&lt;div class="table-wrapper-paragraph"&gt;&lt;table&gt;
&lt;thead&gt;
&lt;tr&gt;
&lt;th&gt;Model&lt;/th&gt;
&lt;th&gt;Context&lt;/th&gt;
&lt;th&gt;Prompt ($/1M)&lt;/th&gt;
&lt;th&gt;Completion ($/1M)&lt;/th&gt;
&lt;/tr&gt;
&lt;/thead&gt;
&lt;tbody&gt;
&lt;tr&gt;
&lt;td&gt;MiniMax: MiniMax M2.5 (free)&lt;/td&gt;
&lt;td&gt;204,800&lt;/td&gt;
&lt;td&gt;0.00&lt;/td&gt;
&lt;td&gt;0.00&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Upstage: Solar Pro 3&lt;/td&gt;
&lt;td&gt;128,000&lt;/td&gt;
&lt;td&gt;0.15&lt;/td&gt;
&lt;td&gt;0.60&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Baidu: ERNIE 4.5 21B A3B Thinking&lt;/td&gt;
&lt;td&gt;131,072&lt;/td&gt;
&lt;td&gt;0.07&lt;/td&gt;
&lt;td&gt;0.28&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Baidu: ERNIE 4.5 21B A3B&lt;/td&gt;
&lt;td&gt;131,072&lt;/td&gt;
&lt;td&gt;0.07&lt;/td&gt;
&lt;td&gt;0.28&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;AlfredPros: CodeLLaMa 7B Instruct Solidity&lt;/td&gt;
&lt;td&gt;4,096&lt;/td&gt;
&lt;td&gt;0.80&lt;/td&gt;
&lt;td&gt;1.20&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Mistral: Mistral 7B Instruct v0.1&lt;/td&gt;
&lt;td&gt;4,096&lt;/td&gt;
&lt;td&gt;0.11&lt;/td&gt;
&lt;td&gt;0.19&lt;/td&gt;
&lt;/tr&gt;
&lt;/tbody&gt;
&lt;/table&gt;&lt;/div&gt;

&lt;p&gt;Developers relying on any of these models must migrate to alternatives; none of the removed entries offered a free tier except the MiniMax M2.5 variant.&lt;/p&gt;

&lt;h2&gt;
  
  
  Cheapest models today (per‑million)
&lt;/h2&gt;

&lt;ol&gt;
&lt;li&gt;
&lt;strong&gt;inclusionAI: Ling-2.6-flash&lt;/strong&gt; – Prompt $0.01, Completion $0.03
&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;IBM: Granite 4.0 Micro&lt;/strong&gt; – Prompt $0.017, Completion $0.112
&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Meta: Llama 3.1 8B Instruct&lt;/strong&gt; – Prompt $0.02, Completion $0.05
&lt;/li&gt;
&lt;/ol&gt;

&lt;p&gt;Total models tracked: 350. No other meaningful changes recorded.&lt;/p&gt;




&lt;p&gt;&lt;em&gt;Originally published at &lt;a href="https://4663437Mehdi.github.io/token-ledger/entry.html?d=2026-05-31" rel="noopener noreferrer"&gt;The Token Ledger&lt;/a&gt;. Subscribe for the daily digest.&lt;/em&gt;&lt;/p&gt;

</description>
      <category>ai</category>
      <category>llm</category>
      <category>api</category>
      <category>news</category>
    </item>
    <item>
      <title>Token Ledger Digest – 2026-05-30</title>
      <dc:creator>4663437Mehdi</dc:creator>
      <pubDate>Sat, 30 May 2026 09:26:00 +0000</pubDate>
      <link>https://dev.to/4663437mehdi/token-ledger-digest-2026-05-30-k26</link>
      <guid>https://dev.to/4663437mehdi/token-ledger-digest-2026-05-30-k26</guid>
      <description>&lt;h1&gt;
  
  
  Token Ledger Digest – 2026-05-30
&lt;/h1&gt;

&lt;h2&gt;
  
  
  Removed Model
&lt;/h2&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;OpenAI: GPT-4o Audio&lt;/strong&gt; – model no longer available.

&lt;ul&gt;
&lt;li&gt;Previous pricing: &lt;strong&gt;$2.50 / 1M prompt tokens&lt;/strong&gt;, &lt;strong&gt;$10.00 / 1M completion tokens&lt;/strong&gt;.
&lt;/li&gt;
&lt;li&gt;Who should care: Teams relying on audio‑enabled GPT‑4o must migrate to alternatives.&lt;/li&gt;
&lt;/ul&gt;


&lt;/li&gt;

&lt;/ul&gt;

&lt;h2&gt;
  
  
  Price Decreases
&lt;/h2&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;MoonshotAI Kimi Latest&lt;/strong&gt; (&lt;code&gt;~moonshotai/kimi-latest&lt;/code&gt;)  &lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Prompt: &lt;strong&gt;$0.73 → $0.68 / 1M&lt;/strong&gt; (−6.3%).
&lt;/li&gt;
&lt;li&gt;Completion: &lt;strong&gt;$3.49 → $3.42 / 1M&lt;/strong&gt; (−2.0%).
&lt;/li&gt;
&lt;li&gt;Who should care: Cost‑sensitive applications using Kimi for long‑form generation.&lt;/li&gt;
&lt;/ul&gt;


&lt;/li&gt;

&lt;li&gt;

&lt;p&gt;&lt;strong&gt;MoonshotAI Kimi K2.6&lt;/strong&gt; (&lt;code&gt;moonshotai/kimi-k2.6&lt;/code&gt;) – identical changes to Kimi Latest.  &lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Prompt: &lt;strong&gt;$0.73 → $0.68 / 1M&lt;/strong&gt;.
&lt;/li&gt;
&lt;li&gt;Completion: &lt;strong&gt;$3.49 → $3.42 / 1M&lt;/strong&gt;.
&lt;/li&gt;
&lt;li&gt;Who should care: Users of the K2.6 variant see the same savings.&lt;/li&gt;
&lt;/ul&gt;


&lt;/li&gt;

&lt;li&gt;

&lt;p&gt;&lt;strong&gt;DeepSeek V4 Flash&lt;/strong&gt; (&lt;code&gt;deepseek/deepseek-v4-flash&lt;/code&gt;)  &lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Prompt: &lt;strong&gt;$0.10 → $0.098 / 1M&lt;/strong&gt; (−1.7%).
&lt;/li&gt;
&lt;li&gt;Completion: &lt;strong&gt;$0.20 → $0.197 / 1M&lt;/strong&gt; (−1.7%).
&lt;/li&gt;
&lt;li&gt;Who should care: Developers running high‑volume flash workloads benefit marginally.&lt;/li&gt;
&lt;/ul&gt;


&lt;/li&gt;

&lt;/ul&gt;

&lt;h2&gt;
  
  
  Price Increase
&lt;/h2&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;Qwen Qwen3.5‑35B‑A3B&lt;/strong&gt; (&lt;code&gt;qwen/qwen3.5-35b-a3b&lt;/code&gt;)

&lt;ul&gt;
&lt;li&gt;Prompt: &lt;strong&gt;$0.139 → $0.140 / 1M&lt;/strong&gt; (+0.7%).
&lt;/li&gt;
&lt;li&gt;Completion: unchanged at &lt;strong&gt;$1.00 / 1M&lt;/strong&gt;.
&lt;/li&gt;
&lt;li&gt;Who should care: Slight uptick for prompt‑heavy Qwen usage; monitor budget impact.&lt;/li&gt;
&lt;/ul&gt;


&lt;/li&gt;

&lt;/ul&gt;




&lt;p&gt;&lt;em&gt;Originally published at &lt;a href="https://4663437Mehdi.github.io/token-ledger/entry.html?d=2026-05-30" rel="noopener noreferrer"&gt;The Token Ledger&lt;/a&gt;. Subscribe for the daily digest.&lt;/em&gt;&lt;/p&gt;

</description>
      <category>ai</category>
      <category>llm</category>
      <category>api</category>
      <category>news</category>
    </item>
    <item>
      <title>2026-05-29 Digest</title>
      <dc:creator>4663437Mehdi</dc:creator>
      <pubDate>Fri, 29 May 2026 10:55:08 +0000</pubDate>
      <link>https://dev.to/4663437mehdi/2026-05-29-digest-3ep3</link>
      <guid>https://dev.to/4663437mehdi/2026-05-29-digest-3ep3</guid>
      <description>&lt;h1&gt;
  
  
  2026-05-29 Digest
&lt;/h1&gt;

&lt;h2&gt;
  
  
  Most Impactful Change
&lt;/h2&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;Model:&lt;/strong&gt; DeepSeek: DeepSeek V3.2 Speciale
&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;What changed:&lt;/strong&gt; Removed from the model catalog
&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Numbers:&lt;/strong&gt; Prompt price was &lt;strong&gt;$0.287 / 1M tokens&lt;/strong&gt;, completion price &lt;strong&gt;$0.431 / 1M tokens&lt;/strong&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Who should care:&lt;/strong&gt; Teams running high‑volume, cost‑sensitive workloads; loss of this low‑cost option may raise effective inference spend and prompt a search for cheaper substitutes.&lt;/li&gt;
&lt;/ul&gt;

&lt;h2&gt;
  
  
  Added Models
&lt;/h2&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Model:&lt;/strong&gt; StepFun: Step 3.7 Flash  &lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;What changed:&lt;/strong&gt; Newly available
&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Numbers:&lt;/strong&gt; Prompt &lt;strong&gt;$0.20 / 1M&lt;/strong&gt;, completion &lt;strong&gt;$1.15 / 1M&lt;/strong&gt; (context 256k)
&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Who should care:&lt;/strong&gt; Users needing large‑context generation at moderate cost; suitable for document‑level tasks.
&lt;/li&gt;
&lt;/ul&gt;


&lt;/li&gt;

&lt;li&gt;

&lt;p&gt;&lt;strong&gt;Model:&lt;/strong&gt; Anthropic: Claude Opus 4.8 (Fast)  &lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;What changed:&lt;/strong&gt; Newly available
&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Numbers:&lt;/strong&gt; Prompt &lt;strong&gt;$10.00 / 1M&lt;/strong&gt;, completion &lt;strong&gt;$50.00 / 1M&lt;/strong&gt; (context 1M)
&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Who should care:&lt;/strong&gt; Enterprises prioritizing top‑tier reasoning speed and willing to pay a premium.
&lt;/li&gt;
&lt;/ul&gt;


&lt;/li&gt;

&lt;li&gt;

&lt;p&gt;&lt;strong&gt;Model:&lt;/strong&gt; Anthropic: Claude Opus 4.8  &lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;What changed:&lt;/strong&gt; Newly available
&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Numbers:&lt;/strong&gt; Prompt &lt;strong&gt;$5.00 / 1M&lt;/strong&gt;, completion &lt;strong&gt;$25.00 / 1M&lt;/strong&gt; (context 1M)
&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Who should care:&lt;/strong&gt; Organizations needing high‑quality outputs with slightly lower latency than the Fast variant.
&lt;/li&gt;
&lt;/ul&gt;


&lt;/li&gt;

&lt;/ul&gt;

&lt;h2&gt;
  
  
  Removed Model (aside from the lead)
&lt;/h2&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;Model:&lt;/strong&gt; Baidu: Qianfan-OCR-Fast

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;What changed:&lt;/strong&gt; Removed
&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Numbers:&lt;/strong&gt; Prompt &lt;strong&gt;$0.68 / 1M&lt;/strong&gt;, completion &lt;strong&gt;$2.81 / 1M&lt;/strong&gt; (context 65k)
&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Who should care:&lt;/strong&gt; OCR‑focused pipelines that relied on this model’s pricing; may need to adjust OCR service costs.
&lt;/li&gt;
&lt;/ul&gt;


&lt;/li&gt;

&lt;/ul&gt;

&lt;h2&gt;
  
  
  Price Change
&lt;/h2&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;Model:&lt;/strong&gt; Z.ai: GLM 4.5 Air

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;What changed:&lt;/strong&gt; Completion price increased
&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Numbers:&lt;/strong&gt; Old completion &lt;strong&gt;$0.84 / 1M&lt;/strong&gt;, new completion &lt;strong&gt;$0.85 / 1M&lt;/strong&gt; (prompt unchanged at &lt;strong&gt;$0.125 / 1M&lt;/strong&gt;)
&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Who should care:&lt;/strong&gt; Developers using this model for completion‑heavy workloads; budget impact is minimal (~1% rise).
&lt;/li&gt;
&lt;/ul&gt;


&lt;/li&gt;

&lt;/ul&gt;

&lt;h2&gt;
  
  
  Cheapest Models Today (for reference)
&lt;/h2&gt;

&lt;ol&gt;
&lt;li&gt;inclusionAI: Ling-2.6-flash – &lt;strong&gt;$0.01 / 1M&lt;/strong&gt; prompt, &lt;strong&gt;$0.03 / 1M&lt;/strong&gt; completion
&lt;/li&gt;
&lt;li&gt;IBM: Granite 4.0 Micro – &lt;strong&gt;$0.017 / 1M&lt;/strong&gt; prompt, &lt;strong&gt;$0.112 / 1M&lt;/strong&gt; completion
&lt;/li&gt;
&lt;li&gt;Meta: Llama 3.1 8B Instruct – &lt;strong&gt;$0.02 / 1M&lt;/strong&gt; prompt, &lt;strong&gt;$0.05 / 1M&lt;/strong&gt; completion
&lt;/li&gt;
&lt;/ol&gt;

&lt;p&gt;&lt;em&gt;Total models tracked: 357.&lt;/em&gt;&lt;/p&gt;




&lt;p&gt;&lt;em&gt;Originally published at &lt;a href="https://4663437Mehdi.github.io/token-ledger/entry.html?d=2026-05-29" rel="noopener noreferrer"&gt;The Token Ledger&lt;/a&gt;. Subscribe for the daily digest.&lt;/em&gt;&lt;/p&gt;

</description>
      <category>ai</category>
      <category>llm</category>
      <category>api</category>
      <category>news</category>
    </item>
    <item>
      <title>2026-05-28 Token Ledger Digest</title>
      <dc:creator>4663437Mehdi</dc:creator>
      <pubDate>Thu, 28 May 2026 11:03:02 +0000</pubDate>
      <link>https://dev.to/4663437mehdi/2026-05-28-token-ledger-digest-1ik4</link>
      <guid>https://dev.to/4663437mehdi/2026-05-28-token-ledger-digest-1ik4</guid>
      <description>&lt;h1&gt;
  
  
  2026-05-28 Token Ledger Digest
&lt;/h1&gt;

&lt;h2&gt;
  
  
  Price Change
&lt;/h2&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;Model:&lt;/strong&gt; Tencent: Hy3 preview
&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;What changed:&lt;/strong&gt; Prompt price fell from $0.066/1M to $0.063/1M tokens; completion price fell from $0.26/1M to $0.21/1M tokens.
&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Who should care:&lt;/strong&gt; Developers running Hy3 preview in cost‑sensitive pipelines.&lt;/li&gt;
&lt;/ul&gt;

&lt;h2&gt;
  
  
  Added
&lt;/h2&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;Model:&lt;/strong&gt; MoonshotAI: Kimi K2.6 (free)
&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;What changed:&lt;/strong&gt; New free model released with a 262,144‑token context window.
&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Who should care:&lt;/strong&gt; Teams needing zero‑cost long‑context generation.&lt;/li&gt;
&lt;/ul&gt;

&lt;h2&gt;
  
  
  Removed
&lt;/h2&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;Model:&lt;/strong&gt; Baidu Qianfan: CoBuddy (free)
&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;What changed:&lt;/strong&gt; Free model with a 131,072‑token context window removed from the catalog.
&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Who should care:&lt;/strong&gt; Users who depended on this free offering; must migrate to an alternative.
&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;&lt;strong&gt;Total models tracked:&lt;/strong&gt; 356.&lt;/p&gt;




&lt;p&gt;&lt;em&gt;Originally published at &lt;a href="https://4663437Mehdi.github.io/token-ledger/entry.html?d=2026-05-28" rel="noopener noreferrer"&gt;The Token Ledger&lt;/a&gt;. Subscribe for the daily digest.&lt;/em&gt;&lt;/p&gt;

</description>
      <category>ai</category>
      <category>llm</category>
      <category>api</category>
      <category>news</category>
    </item>
    <item>
      <title>Token Ledger Digest – 2026-05-27</title>
      <dc:creator>4663437Mehdi</dc:creator>
      <pubDate>Wed, 27 May 2026 11:03:45 +0000</pubDate>
      <link>https://dev.to/4663437mehdi/token-ledger-digest-2026-05-27-52l9</link>
      <guid>https://dev.to/4663437mehdi/token-ledger-digest-2026-05-27-52l9</guid>
      <description>&lt;h1&gt;
  
  
  Token Ledger Digest – 2026-05-27
&lt;/h1&gt;

&lt;p&gt;The most cost‑impacting change is a 50% price cut for &lt;strong&gt;Qwen: Qwen3.7 Max&lt;/strong&gt;, reducing both prompt and completion costs by half.&lt;/p&gt;

&lt;h2&gt;
  
  
  Price Changes
&lt;/h2&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;Qwen: Qwen3.7 Max&lt;/strong&gt; – Prompt price fell from $2.50 to $1.25 per 1M tokens; completion price fell from $7.50 to $3.75 per 1M tokens. &lt;em&gt;Who should care:&lt;/em&gt; Large‑scale Qwen users save $5.00 per 1M tokens overall.
&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Qwen: Qwen3.6 35B A3B&lt;/strong&gt; – Prompt price down 6.7% from $0.15 to $0.14 per 1M; completion unchanged at $1.00 per 1M. &lt;em&gt;Who should care:&lt;/em&gt; Minor savings for prompt‑heavy workloads.
&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Qwen: Qwen3.6 27B&lt;/strong&gt; – Prompt price down 3.3% from $0.30 to $0.29 per 1M; completion unchanged at $3.20 per 1M. &lt;em&gt;Who should care:&lt;/em&gt; Small reduction for prompt‑intensive tasks.
&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Xiaomi: MiMo‑V2.5‑Pro&lt;/strong&gt; – Prompt price cut 56.5% from $1.00 to $0.435 per 1M; completion price cut 71% from $3.00 to $0.87 per 1M. &lt;em&gt;Who should care:&lt;/em&gt; Significant savings for both input and output‑heavy applications.
&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Xiaomi: MiMo‑V2.5&lt;/strong&gt; – Prompt price down 65% from $0.40 to $0.14 per 1M; completion price down 86% from $2.00 to $0.28 per 1M. &lt;em&gt;Who should care:&lt;/em&gt; Large cost reduction, especially for completion‑heavy workloads.
&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Z.ai: GLM 4.5 Air&lt;/strong&gt; – Prompt price down 3.8% from $0.13 to $0.125 per 1M; completion price down 1.2% from $0.85 to $0.84 per 1M. &lt;em&gt;Who should care:&lt;/em&gt; Negligible impact; relevant only for very high‑volume users.
&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;DeepSeek: DeepSeek V3&lt;/strong&gt; – Prompt price down 28.5% from $0.32 to $0.2288 per 1M; completion price up 2.7% from $0.89 to $0.9144 per 1M. &lt;em&gt;Who should care:&lt;/em&gt; Net saving of $0.0668 per 1M tokens; completion‑heavy users see a slight increase.&lt;/li&gt;
&lt;/ul&gt;

&lt;h2&gt;
  
  
  Removed Model
&lt;/h2&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;Arcee AI: Trinity Large Thinking (free)&lt;/strong&gt; – Model removed from catalog; previously offered zero‑cost access with a 262k‑token context. &lt;em&gt;Who should care:&lt;/em&gt; Users who relied on a free large‑context model must now switch to a paid alternative.&lt;/li&gt;
&lt;/ul&gt;




&lt;p&gt;&lt;em&gt;Originally published at &lt;a href="https://4663437Mehdi.github.io/token-ledger/entry.html?d=2026-05-27" rel="noopener noreferrer"&gt;The Token Ledger&lt;/a&gt;. Subscribe for the daily digest.&lt;/em&gt;&lt;/p&gt;

</description>
      <category>ai</category>
      <category>llm</category>
      <category>api</category>
      <category>news</category>
    </item>
    <item>
      <title>The Token Ledger Digest – 2026-05-25</title>
      <dc:creator>4663437Mehdi</dc:creator>
      <pubDate>Mon, 25 May 2026 11:17:49 +0000</pubDate>
      <link>https://dev.to/4663437mehdi/the-token-ledger-digest-2026-05-25-211k</link>
      <guid>https://dev.to/4663437mehdi/the-token-ledger-digest-2026-05-25-211k</guid>
      <description>&lt;h1&gt;
  
  
  The Token Ledger Digest – 2026-05-25
&lt;/h1&gt;

&lt;h2&gt;
  
  
  Removed Model
&lt;/h2&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;Model:&lt;/strong&gt; Tongyi DeepResearch 30B A3B (&lt;code&gt;alibaba/tongyi-deepresearch-30b-a3b&lt;/code&gt;)&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Change:&lt;/strong&gt; Removed from the model catalog&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Details:&lt;/strong&gt; Context length 131,072 tokens; Prompt price &lt;strong&gt;$0.09 / 1M tokens&lt;/strong&gt;; Completion price &lt;strong&gt;$0.45 / 1M tokens&lt;/strong&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Who should care:&lt;/strong&gt; Developers and teams using this model for deep‑research or long‑context tasks should plan migration to alternative models to avoid service disruption.&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;No models were added and no price changes were recorded today. Total models in the catalog remain at 357.&lt;/p&gt;




&lt;p&gt;&lt;em&gt;Originally published at &lt;a href="https://4663437Mehdi.github.io/token-ledger/entry.html?d=2026-05-25" rel="noopener noreferrer"&gt;The Token Ledger&lt;/a&gt;. Subscribe for the daily digest.&lt;/em&gt;&lt;/p&gt;

</description>
      <category>ai</category>
      <category>llm</category>
      <category>api</category>
      <category>news</category>
    </item>
  </channel>
</rss>
