<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:dc="http://purl.org/dc/elements/1.1/">
  <channel>
    <title>DEV Community: keeper</title>
    <description>The latest articles on DEV Community by keeper (@lanternproton).</description>
    <link>https://dev.to/lanternproton</link>
    <image>
      <url>https://media2.dev.to/dynamic/image/width=90,height=90,fit=cover,gravity=auto,format=auto/https:%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Fuser%2Fprofile_image%2F3921069%2F1021b254-811e-486b-bf4d-526f3004db30.png</url>
      <title>DEV Community: keeper</title>
      <link>https://dev.to/lanternproton</link>
    </image>
    <atom:link rel="self" type="application/rss+xml" href="https://dev.to/feed/lanternproton"/>
    <language>en</language>
    <item>
      <title>The Router Couldn't See My NAS — A 3-Hour Debug Into a Silent Intel NIC Bug</title>
      <dc:creator>keeper</dc:creator>
      <pubDate>Fri, 29 May 2026 16:31:13 +0000</pubDate>
      <link>https://dev.to/lanternproton/the-router-couldnt-see-my-nas-a-3-hour-debug-into-a-silent-intel-nic-bug-55g7</link>
      <guid>https://dev.to/lanternproton/the-router-couldnt-see-my-nas-a-3-hour-debug-into-a-silent-intel-nic-bug-55g7</guid>
      <description>&lt;h2&gt;
  
  
  The Router Couldn't See My NAS
&lt;/h2&gt;

&lt;p&gt;My TRIM NAS at &lt;code&gt;192.168.3.135&lt;/code&gt; had been running Hermes Agent 24/7 for months — handling Telegram gateway, proxy routing (mihomo), cron jobs, and file serving. It's a solid Debian 12 box with a custom 6.12.18 kernel.&lt;/p&gt;

&lt;p&gt;Then one evening I noticed I couldn't reach Telegram. Then my Windows machine (T2) lost internet. Then even the router's device list showed the NAS as offline.&lt;/p&gt;

&lt;p&gt;Ping to the router gateway (&lt;code&gt;192.168.3.1&lt;/code&gt;) worked fine from the NAS itself. But the router couldn't see it.&lt;/p&gt;

&lt;p&gt;This is the story of what I found — and how the real culprit was a feature meant to save electricity.&lt;/p&gt;

&lt;h2&gt;
  
  
  Phase 1: The Obvious Suspect — Mihomo
&lt;/h2&gt;

&lt;p&gt;The first thing I checked was mihomo, my proxy daemon. It's the gateway between my LAN and the outside world. If it crashes, everything behind it loses connectivity — Telegram, web browsing, API calls — the whole stack.&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight plaintext"&gt;&lt;code&gt;May 29 23:52:17 trim-0c5b mihomo[692755]: level=fatal msg="Parse config error:
  rules[14] [DOMAIN-SUFFIX,api.telegram.org,📱DEFAULT] error:
  proxy [📱DEFAULT] not found"
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;Clear enough. The config had rules referencing proxy groups (&lt;code&gt;📱DEFAULT&lt;/code&gt; and &lt;code&gt;📱Telegram&lt;/code&gt;) that no longer existed — likely from a subscription update that replaced the proxy-groups section but left the old rules untouched. Mihomo refused to start. Fix was straightforward: point those rules to an existing group (&lt;code&gt;🚀节点选择&lt;/code&gt;).&lt;/p&gt;

&lt;p&gt;But this was the &lt;em&gt;consequence&lt;/em&gt;, not the root cause. Why did I reset the NAS in the first place?&lt;/p&gt;

&lt;h2&gt;
  
  
  Phase 2: Looking Deeper — System Logs Tell a Story
&lt;/h2&gt;

&lt;p&gt;The reset happened at &lt;code&gt;22:47&lt;/code&gt;. Working backwards through &lt;code&gt;/var/log/syslog.1&lt;/code&gt;, I found the real timeline:&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight plaintext"&gt;&lt;code&gt;21:00 — node[368932]: [fetch-timeout] fetch timeout after 10000ms
         url=https://api.telegram.org/bot***/getMe
21:01 — same timeout
21:02 — same timeout
...repeats every 60 seconds until 22:47...
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;This was the Hermes Telegram gateway — a Node.js process — unable to reach Telegram's API. Each request timed out after 10 seconds. Meanwhile:&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight plaintext"&gt;&lt;code&gt;21:09 — mihomo error: 🇺🇸美国圣何塞06 failed health check: context deadline exceeded
21:19 — 🇯🇵日本东京03, 🇯🇵日本东京04 also failing
21:24 — 🇹🇼台湾, 🇺🇸洛杉矶 also failing
...proxy node health checks kept failing in waves...
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;The proxy nodes were dropping one by one. Curl through the proxy to Google worked fine, but sustained connections kept timing out.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Key insight&lt;/strong&gt;: The NAS was &lt;em&gt;technically&lt;/em&gt; online (I could SSH in), but under high proxy load, the NIC was becoming invisible to the router. The router's ARP table expired and couldn't re-resolve the NAS's MAC address.&lt;/p&gt;

&lt;p&gt;No kernel panics. No OOM. No driver crash. No link-down events in dmesg, syslog, or kern.log. The NIC just... stopped responding at layer 2 under sustained load.&lt;/p&gt;

&lt;h2&gt;
  
  
  Phase 3: Hardware Fingerprint
&lt;/h2&gt;



&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight plaintext"&gt;&lt;code&gt;03:00.0 Ethernet controller: Intel Corporation I211 Gigabit Network Connection
Driver: igb (in-kernel, version 6.12.18-trim)
Firmware: 0.4-1
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;Intel I211. Not the notoriously buggy I225/I226 that plagues many 2.5GbE boards — just a plain, reliable old 1GbE chip. I211 has been shipping since 2012. It should be bulletproof.&lt;/p&gt;

&lt;p&gt;But the boot params told a different story:&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight plaintext"&gt;&lt;code&gt;GRUB_CMDLINE_LINUX="modprobe.blacklist=pcspkr pcie_aspm=off"
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;&lt;code&gt;pcie_aspm=off&lt;/code&gt; — someone had already encountered PCIe power management issues with this NIC before me.&lt;/p&gt;

&lt;h2&gt;
  
  
  Phase 4: The Real Culprit — EEE
&lt;/h2&gt;

&lt;p&gt;I checked the NIC's Energy Efficient Ethernet settings:&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight plaintext"&gt;&lt;code&gt;EEE status: enabled - inactive
Supported EEE link modes: 100baseT/Full, 1000baseT/Full
Advertised EEE link modes: 100baseT/Full, 1000baseT/Full
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;EEE (IEEE 802.3az) lets the NIC drop into a low-power idle state between packets. On paper it saves a few watts. In practice, on the igb driver with an I211, the NIC sometimes fails to properly re-establish the link when exiting EEE under high connection churn.&lt;/p&gt;

&lt;p&gt;This is exactly what was happening:&lt;/p&gt;

&lt;ol&gt;
&lt;li&gt;The Telegram gateway was creating and closing connections at high frequency (one timeout-retry every 60 seconds × hours = hundreds of connections)&lt;/li&gt;
&lt;li&gt;Each connection teardown triggers EEE negotiation&lt;/li&gt;
&lt;li&gt;At some point, the link doesn't come back cleanly&lt;/li&gt;
&lt;li&gt;The router sees the port as active (switch lights are on) but gets no response to ARP queries&lt;/li&gt;
&lt;li&gt;From the router's perspective: NAS is gone&lt;/li&gt;
&lt;/ol&gt;

&lt;p&gt;EEE is the most common cause of "link light is on but device is unreachable" on Intel NICs. It affects I210, I211, I225, and I226 to varying degrees.&lt;/p&gt;

&lt;h2&gt;
  
  
  Phase 5: The Three Fixes
&lt;/h2&gt;

&lt;h3&gt;
  
  
  Fix 1 — Kill EEE (immediate)
&lt;/h3&gt;



&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight shell"&gt;&lt;code&gt;ethtool &lt;span class="nt"&gt;--set-eee&lt;/span&gt; enp3s0 eee off
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;This disables EEE immediately without a reboot. The NIC stays in full-power mode and never enters low-power idle. Confirmed:&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight plaintext"&gt;&lt;code&gt;EEE status: disabled
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;h3&gt;
  
  
  Fix 2 — Double the Ring Buffers
&lt;/h3&gt;

&lt;p&gt;The default ring buffer on the igb driver is 256 descriptors for both RX and TX. Maximum is 4096. Under sustained proxy load with hundreds of concurrent connections, 256 is a bottleneck — the NIC runs out of buffer space and starts dropping packets.&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight shell"&gt;&lt;code&gt;ethtool &lt;span class="nt"&gt;-G&lt;/span&gt; enp3s0 rx 2048 tx 2048
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;This increases the buffer to 2048 descriptors each. The NIC now has 8× more room to queue packets before dropping them.&lt;/p&gt;

&lt;h3&gt;
  
  
  Fix 3 — Make It Stick (persistence)
&lt;/h3&gt;

&lt;p&gt;For the boot params:&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight plaintext"&gt;&lt;code&gt;# /etc/default/grub
GRUB_CMDLINE_LINUX="modprobe.blacklist=pcspkr pcie_aspm=off igb.eee=0"
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;&lt;code&gt;igb.eee=0&lt;/code&gt; tells the igb kernel module to never enable EEE, regardless of what the link partner advertises.&lt;/p&gt;

&lt;p&gt;For the ring buffer and EEE state, I created a systemd oneshot service:&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight ini"&gt;&lt;code&gt;&lt;span class="nn"&gt;[Unit]&lt;/span&gt;
&lt;span class="py"&gt;Description&lt;/span&gt;&lt;span class="p"&gt;=&lt;/span&gt;&lt;span class="s"&gt;NIC tuning - Intel I211 fixes&lt;/span&gt;
&lt;span class="py"&gt;After&lt;/span&gt;&lt;span class="p"&gt;=&lt;/span&gt;&lt;span class="s"&gt;network.target&lt;/span&gt;
&lt;span class="py"&gt;Before&lt;/span&gt;&lt;span class="p"&gt;=&lt;/span&gt;&lt;span class="s"&gt;network-online.target&lt;/span&gt;

&lt;span class="nn"&gt;[Service]&lt;/span&gt;
&lt;span class="py"&gt;Type&lt;/span&gt;&lt;span class="p"&gt;=&lt;/span&gt;&lt;span class="s"&gt;oneshot&lt;/span&gt;
&lt;span class="py"&gt;ExecStart&lt;/span&gt;&lt;span class="p"&gt;=&lt;/span&gt;&lt;span class="s"&gt;/usr/sbin/ethtool --set-eee enp3s0 eee off&lt;/span&gt;
&lt;span class="py"&gt;ExecStart&lt;/span&gt;&lt;span class="p"&gt;=&lt;/span&gt;&lt;span class="s"&gt;/usr/sbin/ethtool -G enp3s0 rx 2048 tx 2048&lt;/span&gt;
&lt;span class="py"&gt;RemainAfterExit&lt;/span&gt;&lt;span class="p"&gt;=&lt;/span&gt;&lt;span class="s"&gt;true&lt;/span&gt;

&lt;span class="nn"&gt;[Install]&lt;/span&gt;
&lt;span class="py"&gt;WantedBy&lt;/span&gt;&lt;span class="p"&gt;=&lt;/span&gt;&lt;span class="s"&gt;multi-user.target&lt;/span&gt;
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;This runs before the network is declared online, so every service that follows sees the tuned NIC.&lt;/p&gt;

&lt;h2&gt;
  
  
  What I Learned
&lt;/h2&gt;

&lt;p&gt;&lt;strong&gt;The hardest bugs leave no logs.&lt;/strong&gt; The NIC didn't crash, didn't report errors, didn't trigger a kernel oops. It just silently stopped responding to ARP. If I hadn't checked the EEE status, I'd still be blaming mihomo or the router.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;EEE is a false economy.&lt;/strong&gt; The power savings on a 1GbE desktop NIC are negligible — maybe 0.3-0.5 watts. The stability cost far exceeds the benefit. For any always-on server, NAS, or gateway running the igb driver: turn EEE off.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Ring buffer defaults are tuned for desktops, not servers.&lt;/strong&gt; 256 descriptors is fine for a web browser but chokes under proxy load. Doubling or quadrupling it costs zero overhead in practice and eliminates an entire class of packet-drop edge cases.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;The mihomo config bug was a distraction.&lt;/strong&gt; It was the &lt;em&gt;symptom&lt;/em&gt; that I noticed, but the real problem was at layer 2. If I'd just fixed mihomo and moved on, the EEE drop would have come back within days.&lt;/p&gt;

&lt;h2&gt;
  
  
  Current State
&lt;/h2&gt;



&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight ini"&gt;&lt;code&gt;&lt;span class="err"&gt;EEE&lt;/span&gt; &lt;span class="err"&gt;status:&lt;/span&gt;          &lt;span class="err"&gt;disabled&lt;/span&gt;
&lt;span class="err"&gt;Ring&lt;/span&gt; &lt;span class="err"&gt;buffer&lt;/span&gt; &lt;span class="err"&gt;RX/TX:&lt;/span&gt;   &lt;span class="err"&gt;2048&lt;/span&gt; &lt;span class="err"&gt;/&lt;/span&gt; &lt;span class="err"&gt;2048&lt;/span&gt;
&lt;span class="err"&gt;Boot&lt;/span&gt; &lt;span class="err"&gt;param:&lt;/span&gt;          &lt;span class="py"&gt;pcie_aspm&lt;/span&gt;&lt;span class="p"&gt;=&lt;/span&gt;&lt;span class="s"&gt;off igb.eee=0&lt;/span&gt;
&lt;span class="err"&gt;Systemd&lt;/span&gt; &lt;span class="err"&gt;service:&lt;/span&gt;     &lt;span class="err"&gt;hermes-nic.service&lt;/span&gt; &lt;span class="err"&gt;(enabled)&lt;/span&gt;
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;Router has been seeing the NAS continuously for 24+ hours since the fixes. Telegram gateway stable. Proxy health checks clean.&lt;/p&gt;




&lt;p&gt;&lt;em&gt;Using this? A ⭐ or a one-word issue tells me what to build next — helps more than you'd think.&lt;/em&gt;&lt;/p&gt;

</description>
      <category>hardware</category>
      <category>linux</category>
      <category>networking</category>
      <category>devops</category>
    </item>
    <item>
      <title>Windows vs Linux for Local AI: My Radeon 890M Has 96GB of RAM, but Windows Only Lets Me Use 3.5GB</title>
      <dc:creator>keeper</dc:creator>
      <pubDate>Mon, 25 May 2026 23:40:54 +0000</pubDate>
      <link>https://dev.to/lanternproton/windows-vs-linux-for-local-ai-my-radeon-890m-has-96gb-of-ram-but-windows-only-lets-me-use-35gb-39ln</link>
      <guid>https://dev.to/lanternproton/windows-vs-linux-for-local-ai-my-radeon-890m-has-96gb-of-ram-but-windows-only-lets-me-use-35gb-39ln</guid>
      <description>&lt;h2&gt;
  
  
  The Setup
&lt;/h2&gt;

&lt;p&gt;I run a Ryzen AI 9 HX 370 mini PC as my daily AI workstation. 96GB of system RAM, a Radeon 890M integrated GPU (gfx1150, RDNA 3.5). On paper, this should be a capable local inference box — 96GB is enough to run Gemma 4, Llama 3 70B, even Mixtral 8x22B at reasonable quantization.&lt;/p&gt;

&lt;p&gt;There's just one problem: &lt;strong&gt;Windows refuses to let the iGPU use more than ~3.5GB of that memory.&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;I spent the better part of a week trying every workaround I could find. Here's what I learned — and why Linux won this round.&lt;/p&gt;

&lt;h2&gt;
  
  
  The Windows VRAM Wall
&lt;/h2&gt;

&lt;p&gt;The Radeon 890M is an integrated GPU. It has no dedicated VRAM. On Windows, the GPU driver allocates a fixed &lt;strong&gt;shared GPU memory&lt;/strong&gt; budget from system RAM. For the 890M, that budget caps at roughly &lt;strong&gt;3.5GB&lt;/strong&gt;.&lt;/p&gt;

&lt;p&gt;I tried everything:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;Registry hacks&lt;/strong&gt; (&lt;code&gt;GpuMemoryAllocation&lt;/code&gt;, &lt;code&gt;HwSchMode&lt;/code&gt;) → no effect&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;DirectML via PyTorch&lt;/strong&gt; → OOM on anything above a 7B model&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;llama.cpp with Vulkan backend&lt;/strong&gt; → same 3.5GB limit, enforced by the driver&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Disabling Memory Integrity / VBS&lt;/strong&gt; → freed up ~500MB, still hit the wall&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;BIOS UMA Frame Buffer tweaking&lt;/strong&gt; → the 890M's firmware-based allocation is hard-coded&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;The root cause? &lt;strong&gt;Windows' WDDM driver model + Hypervisor-based security (VBS)&lt;/strong&gt;. When Memory Integrity is enabled (and on modern Windows 11 installs, it is by default), every GPU memory allocation goes through an extra hypervisor verification layer. The driver responds by clamping shared GPU memory to a conservative default. And there's no "unlock" switch — even in Group Policy or AMD's own Adrenalin control panel.&lt;/p&gt;

&lt;p&gt;For a 7B model at Q4_K_M (~4.5GB VRAM needed), that 3.5GB wall means the model doesn't fit. Period. The alternative is CPU-only inference — using all 96GB of RAM, but at maybe 2-3 tokens per second for anything larger than 7B.&lt;/p&gt;

&lt;p&gt;Windows gives you a choice: tiny models on GPU, or glacial models on CPU. There's no middle ground.&lt;/p&gt;

&lt;h2&gt;
  
  
  The Linux Promise (and Its Own Problem)
&lt;/h2&gt;

&lt;p&gt;Linux doesn't have this VRAM cap. The Mesa Vulkan driver for AMD GPUs (&lt;code&gt;amdgpu&lt;/code&gt;, &lt;code&gt;radv&lt;/code&gt;) lets the GPU use as much system memory as needed. You can allocate 64GB for a model and the driver won't blink.&lt;/p&gt;

&lt;p&gt;So I set up a dual-boot, installed ROCm, and ran a few tests. The performance difference was dramatic — llama.cpp with Vulkan on Linux could load a 14B Q4_K_M model entirely on the GPU, no VRAM wall.&lt;/p&gt;

&lt;p&gt;But there was a catch. &lt;strong&gt;ROCm does not support the Radeon 890M.&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;AMD's official stance, confirmed by the Framework community (Framework Laptop 13 Ryzen AI 9 HX 370):&lt;/p&gt;

&lt;blockquote&gt;
&lt;p&gt;&lt;em&gt;"AMD ROCm does not support the Radeon 890M (gfx1150). PyTorch cannot run with ROCm on this GPU."&lt;/em&gt;&lt;/p&gt;
&lt;/blockquote&gt;

&lt;p&gt;The Vulkan backend in llama.cpp does work on Linux. It's faster than CPU and has no VRAM cap. But it's nowhere near as fast as native ROCm acceleration would be. You're getting maybe 40-60% of the performance you'd get with a supported GPU stack.&lt;/p&gt;

&lt;h2&gt;
  
  
  The Decision Matrix
&lt;/h2&gt;

&lt;div class="table-wrapper-paragraph"&gt;&lt;table&gt;
&lt;thead&gt;
&lt;tr&gt;
&lt;th&gt;Option&lt;/th&gt;
&lt;th&gt;Works?&lt;/th&gt;
&lt;th&gt;Performance&lt;/th&gt;
&lt;th&gt;VRAM Limit&lt;/th&gt;
&lt;th&gt;Verdict&lt;/th&gt;
&lt;/tr&gt;
&lt;/thead&gt;
&lt;tbody&gt;
&lt;tr&gt;
&lt;td&gt;Windows + DirectML&lt;/td&gt;
&lt;td&gt;✅&lt;/td&gt;
&lt;td&gt;Low&lt;/td&gt;
&lt;td&gt;3.5GB hard cap&lt;/td&gt;
&lt;td&gt;Fine for 3B-7B, useless for anything bigger&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Windows + llama.cpp Vulkan&lt;/td&gt;
&lt;td&gt;✅&lt;/td&gt;
&lt;td&gt;Medium-low&lt;/td&gt;
&lt;td&gt;3.5GB hard cap&lt;/td&gt;
&lt;td&gt;Same cap, slightly better perf&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Linux + ROCm&lt;/td&gt;
&lt;td&gt;❌&lt;/td&gt;
&lt;td&gt;N/A&lt;/td&gt;
&lt;td&gt;N/A&lt;/td&gt;
&lt;td&gt;ROCm doesn't support gfx1150&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Linux + llama.cpp Vulkan&lt;/td&gt;
&lt;td&gt;✅&lt;/td&gt;
&lt;td&gt;Medium&lt;/td&gt;
&lt;td&gt;No cap&lt;/td&gt;
&lt;td&gt;Best option for iGPU inference today&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Linux + CPU-only (96GB)&lt;/td&gt;
&lt;td&gt;✅&lt;/td&gt;
&lt;td&gt;Slow (2-3 tok/s)&lt;/td&gt;
&lt;td&gt;No cap&lt;/td&gt;
&lt;td&gt;Works for everything, patience required&lt;/td&gt;
&lt;/tr&gt;
&lt;/tbody&gt;
&lt;/table&gt;&lt;/div&gt;

&lt;h2&gt;
  
  
  Why This Matters
&lt;/h2&gt;

&lt;p&gt;The HX 370 is AMD's flagship mobile AI chip. It's built on a 4nm process, has an XDNA 2 NPU rated for 50+ TOPS, and pairs with a capable RDNA 3.5 iGPU. AMD clearly wants this chip in the "AI PC" category.&lt;/p&gt;

&lt;p&gt;But the reality:&lt;/p&gt;

&lt;ol&gt;
&lt;li&gt;&lt;p&gt;&lt;strong&gt;AMD's GPU software stack is fragmented.&lt;/strong&gt; ROCm works on their discrete GPUs (RX 7900 series, some W-series) and Instinct cards. It does not work on RDNA 3.5 integrated GPUs. Period. The Windows HIP SDK also doesn't support it. If you bought an HX 370 machine thinking ROCm would be available, you bought a paperweight for AI workloads.&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;&lt;strong&gt;Windows' driver model is actively hostile to shared-memory GPUs.&lt;/strong&gt; The 3.5GB hard cap isn't a bug. It's a deliberate safety boundary in WDDM. Apple Silicon Macs can share 64GB+ between CPU and GPU seamlessly. The HX 370 with 96GB of RAM should be able to do the same. It cannot. Not on Windows.&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;&lt;strong&gt;Linux works, but not well enough.&lt;/strong&gt; The Vulkan backend lifts the VRAM cap, which is the most important win. But without ROCm, you're leaving performance on the table — roughly 40-60% of what a properly supported GPU would deliver.&lt;/p&gt;&lt;/li&gt;
&lt;/ol&gt;

&lt;h2&gt;
  
  
  So What Does Work?
&lt;/h2&gt;

&lt;p&gt;For this exact machine (HX 370 + Radeon 890M), the realistic stack is:&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;For production workloads / multi-model serving:&lt;/strong&gt; Linux + CPU-only. 96GB RAM handles Gemma 4 27B, Llama 3 70B, Qwen 2.5 72B at Q3/Q4. Slow but reliable.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;For single-user inference / experimentation:&lt;/strong&gt; Linux + llama.cpp Vulkan. Best balance of VRAM headroom and acceleration. No hard cap, no heavy tuning.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;For portability / Windows-only environments:&lt;/strong&gt; Windows + llama.cpp Vulkan. Stuck at the 3.5GB cap. Fine for 3B models, frustrating for anything larger.&lt;/p&gt;

&lt;h2&gt;
  
  
  The Structural Problem
&lt;/h2&gt;

&lt;p&gt;This isn't about Windows vs Linux fanboyism. The structural issue is:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;AMD's software commitment stops at their discrete GPU line.&lt;/strong&gt; Integrated GPUs are second-class citizens for AI — despite being in every Ryzen AI chip shipping today.&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Windows' driver architecture assumes dGPUs with dedicated VRAM.&lt;/strong&gt; Shared-memory GPUs are an afterthought, and the security hypervisor layer makes it worse.&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Linux avoids the cap but can't fill the acceleration gap.&lt;/strong&gt; Without ROCm, you're running on Vulkan — which is a translation layer, not a native compute stack.&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;The HX 370 is a great CPU with a mediocre AI story. If you're building a local inference box today, you're better off pairing it with a discrete GPU — or just skipping Windows entirely for AI workloads.&lt;/p&gt;




&lt;p&gt;&lt;em&gt;Using an HX 370 for LLMs? A ⭐ or a one-word issue tells me what to build next — helps more than you'd think.&lt;/em&gt;&lt;/p&gt;

</description>
      <category>ai</category>
      <category>hardware</category>
      <category>linux</category>
      <category>windows</category>
    </item>
    <item>
      <title>Visualize Your AI-Written Code as an Interactive Knowledge Graph</title>
      <dc:creator>keeper</dc:creator>
      <pubDate>Mon, 25 May 2026 13:00:33 +0000</pubDate>
      <link>https://dev.to/lanternproton/visualize-your-ai-written-code-as-an-interactive-knowledge-graph-28ic</link>
      <guid>https://dev.to/lanternproton/visualize-your-ai-written-code-as-an-interactive-knowledge-graph-28ic</guid>
      <description>&lt;p&gt;▸ &lt;strong&gt;Understand-Anything-Public&lt;/strong&gt; is a new open-source GitHub project that converts your entire codebase into an interactive knowledge graph. If you've ever inherited a 200K-line project with zero documentation, you know the pain — this tool draws you a map instead of forcing you to read file by file.&lt;/p&gt;

&lt;p&gt;▸ The same project highlights a growing trend in AI-assisted development: when bugs surface, devs now respond with "I need to check — I don't know how the AI wrote it either." It's a candid acknowledgment that AI-generated code is increasingly opaque, making tools like Understand-Anything-Public essential for maintenance.&lt;/p&gt;

&lt;p&gt;▸ By mapping code structure visually, these tools help teams onboard faster, identify dead code paths, and trace dependencies without manually following imports. For teams shipping AI-generated code at velocity, a knowledge graph layer isn't a luxury — it's becoming table stakes.&lt;/p&gt;

&lt;p&gt;▸ As LLM-generated code proliferates, the bottleneck isn't writing — it's understanding. The next wave of devtools will focus less on generation and more on comprehension.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Source:&lt;/strong&gt; @https1024 (互联网从业者充电站)&lt;/p&gt;

&lt;p&gt;&lt;em&gt;Two items today from Chinese tech Telegram channels — merged into one briefing.&lt;/em&gt;&lt;/p&gt;

</description>
      <category>ai</category>
      <category>devtools</category>
      <category>opensource</category>
      <category>productivity</category>
    </item>
    <item>
      <title>Kirin 2026 vs Snapdragon 2026: The Full Chip Gap Landscape</title>
      <dc:creator>keeper</dc:creator>
      <pubDate>Mon, 25 May 2026 10:02:02 +0000</pubDate>
      <link>https://dev.to/lanternproton/kirin-2026-vs-snapdragon-2026-the-full-chip-gap-landscape-5671</link>
      <guid>https://dev.to/lanternproton/kirin-2026-vs-snapdragon-2026-the-full-chip-gap-landscape-5671</guid>
      <description>&lt;h2&gt;
  
  
  Executive Summary
&lt;/h2&gt;

&lt;p&gt;At ISCAS 2026 (May 25, 2026), Huawei board member and Semiconductor Business President He Tingbo unveiled two things:&lt;/p&gt;

&lt;ol&gt;
&lt;li&gt;
&lt;strong&gt;Tau (τ) Law&lt;/strong&gt; — A design philosophy that replaces geometry scaling with signal propagation delay (τ) optimization&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Kirin 2026&lt;/strong&gt; — The first chip using "logic folding" technology, shipping fall 2026&lt;/li&gt;
&lt;/ol&gt;

&lt;p&gt;&lt;strong&gt;Key specs of Kirin 2026:&lt;/strong&gt;&lt;/p&gt;

&lt;div class="table-wrapper-paragraph"&gt;&lt;table&gt;
&lt;thead&gt;
&lt;tr&gt;
&lt;th&gt;Spec&lt;/th&gt;
&lt;th&gt;Kirin 9030 Pro (current)&lt;/th&gt;
&lt;th&gt;Kirin 2026 (fall)&lt;/th&gt;
&lt;th&gt;Improvement&lt;/th&gt;
&lt;/tr&gt;
&lt;/thead&gt;
&lt;tbody&gt;
&lt;tr&gt;
&lt;td&gt;P-core frequency&lt;/td&gt;
&lt;td&gt;2.75 GHz&lt;/td&gt;
&lt;td&gt;&lt;strong&gt;3.1 GHz&lt;/strong&gt;&lt;/td&gt;
&lt;td&gt;+12.7%&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Transistor density&lt;/td&gt;
&lt;td&gt;~155 MTr/mm² (est.)&lt;/td&gt;
&lt;td&gt;&lt;strong&gt;238 MTr/mm²&lt;/strong&gt;&lt;/td&gt;
&lt;td&gt;+53.5%&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;P-core efficiency&lt;/td&gt;
&lt;td&gt;baseline&lt;/td&gt;
&lt;td&gt;+41%&lt;/td&gt;
&lt;td&gt;+41%&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;First milestone&lt;/td&gt;
&lt;td&gt;—&lt;/td&gt;
&lt;td&gt;First Kirin over 3GHz&lt;/td&gt;
&lt;td&gt;—&lt;/td&gt;
&lt;/tr&gt;
&lt;/tbody&gt;
&lt;/table&gt;&lt;/div&gt;

&lt;p&gt;To evaluate these numbers, we need a complete 2026 mobile chip landscape.&lt;/p&gt;




&lt;h2&gt;
  
  
  The 2026 Mobile SoC Leaderboard
&lt;/h2&gt;

&lt;p&gt;Data source: unite4buy AnTuTu 11 ranking, May 2026&lt;/p&gt;

&lt;div class="table-wrapper-paragraph"&gt;&lt;table&gt;
&lt;thead&gt;
&lt;tr&gt;
&lt;th&gt;#&lt;/th&gt;
&lt;th&gt;Chip&lt;/th&gt;
&lt;th&gt;AnTuTu 11&lt;/th&gt;
&lt;th&gt;GB6 (SC/MC)&lt;/th&gt;
&lt;th&gt;Process&lt;/th&gt;
&lt;/tr&gt;
&lt;/thead&gt;
&lt;tbody&gt;
&lt;tr&gt;
&lt;td&gt;1&lt;/td&gt;
&lt;td&gt;Snapdragon 8 Elite Gen 6 Pro&lt;/td&gt;
&lt;td&gt;&lt;strong&gt;4,587,993&lt;/strong&gt;&lt;/td&gt;
&lt;td&gt;14,273 / 4,388&lt;/td&gt;
&lt;td&gt;2nm&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;2&lt;/td&gt;
&lt;td&gt;Snapdragon 8 Elite Gen 5 LV&lt;/td&gt;
&lt;td&gt;3,834,473&lt;/td&gt;
&lt;td&gt;12,658 / 3,873&lt;/td&gt;
&lt;td&gt;3nm&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;3&lt;/td&gt;
&lt;td&gt;&lt;strong&gt;Snapdragon 8 Elite Gen 5&lt;/strong&gt;&lt;/td&gt;
&lt;td&gt;&lt;strong&gt;3,717,334&lt;/strong&gt;&lt;/td&gt;
&lt;td&gt;&lt;strong&gt;12,216 / 3,768&lt;/strong&gt;&lt;/td&gt;
&lt;td&gt;&lt;strong&gt;3nm&lt;/strong&gt;&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;4&lt;/td&gt;
&lt;td&gt;Snapdragon 8 Gen 5&lt;/td&gt;
&lt;td&gt;3,518,837&lt;/td&gt;
&lt;td&gt;9,894 / 2,744&lt;/td&gt;
&lt;td&gt;3nm&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;5&lt;/td&gt;
&lt;td&gt;MediaTek Dimensity 9500&lt;/td&gt;
&lt;td&gt;3,168,848&lt;/td&gt;
&lt;td&gt;10,155 / 3,451&lt;/td&gt;
&lt;td&gt;3nm&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;6&lt;/td&gt;
&lt;td&gt;Snapdragon 8 Elite (Gen 4)&lt;/td&gt;
&lt;td&gt;3,109,884&lt;/td&gt;
&lt;td&gt;10,111 / 3,211&lt;/td&gt;
&lt;td&gt;3nm&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;13&lt;/td&gt;
&lt;td&gt;Samsung Exynos 2500&lt;/td&gt;
&lt;td&gt;2,421,783&lt;/td&gt;
&lt;td&gt;8,999 / 2,525&lt;/td&gt;
&lt;td&gt;3nm&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;&lt;strong&gt;~14&lt;/strong&gt;&lt;/td&gt;
&lt;td&gt;&lt;strong&gt;Kirin 2026 (est.)&lt;/strong&gt;&lt;/td&gt;
&lt;td&gt;&lt;strong&gt;~2,400,000&lt;/strong&gt;&lt;/td&gt;
&lt;td&gt;&lt;strong&gt;~6,500 / ~1,900&lt;/strong&gt;&lt;/td&gt;
&lt;td&gt;&lt;strong&gt;Domestic + Logic Fold&lt;/strong&gt;&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;18&lt;/td&gt;
&lt;td&gt;HiSilicon Kirin 9030 Pro&lt;/td&gt;
&lt;td&gt;2,099,856&lt;/td&gt;
&lt;td&gt;5,808 / 1,677&lt;/td&gt;
&lt;td&gt;Domestic process&lt;/td&gt;
&lt;/tr&gt;
&lt;/tbody&gt;
&lt;/table&gt;&lt;/div&gt;

&lt;p&gt;SD 8 Elite Gen 5 (currently shipping flagship) core config:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;2x Phoenix @ 4.61 GHz&lt;/strong&gt; (Prime cores)&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;6x Phoenix @ 3.63 GHz&lt;/strong&gt; (Performance cores)&lt;/li&gt;
&lt;li&gt;Adreno 840 GPU&lt;/li&gt;
&lt;li&gt;TSMC N3E (3nm)&lt;/li&gt;
&lt;/ul&gt;




&lt;h2&gt;
  
  
  The Gap, Quantified
&lt;/h2&gt;

&lt;h3&gt;
  
  
  Frequency — The Least Painful Dimension
&lt;/h3&gt;

&lt;p&gt;Kirin 2026's 3.1 GHz is its P-core (performance) frequency:&lt;/p&gt;

&lt;div class="table-wrapper-paragraph"&gt;&lt;table&gt;
&lt;thead&gt;
&lt;tr&gt;
&lt;th&gt;Comparison&lt;/th&gt;
&lt;th&gt;Level&lt;/th&gt;
&lt;th&gt;Gap&lt;/th&gt;
&lt;/tr&gt;
&lt;/thead&gt;
&lt;tbody&gt;
&lt;tr&gt;
&lt;td&gt;SD 8 Elite 5 Perf @ 3.63 GHz&lt;/td&gt;
&lt;td&gt;P-core vs P-core&lt;/td&gt;
&lt;td&gt;&lt;strong&gt;-15%&lt;/strong&gt;&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;SD 8 Elite 5 Prime @ 4.61 GHz&lt;/td&gt;
&lt;td&gt;P-core vs Prime&lt;/td&gt;
&lt;td&gt;&lt;strong&gt;-33%&lt;/strong&gt;&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;SD 8 Elite 6 Pro @ 5.2 GHz&lt;/td&gt;
&lt;td&gt;P-core vs peak&lt;/td&gt;
&lt;td&gt;&lt;strong&gt;-40%&lt;/strong&gt;&lt;/td&gt;
&lt;/tr&gt;
&lt;/tbody&gt;
&lt;/table&gt;&lt;/div&gt;

&lt;p&gt;Frequency is actually the &lt;em&gt;least&lt;/em&gt; of Kirin's problems. The gap is in IPC and microarchitecture.&lt;/p&gt;

&lt;h3&gt;
  
  
  Single-Core — The Bleeding Edge
&lt;/h3&gt;

&lt;p&gt;&lt;strong&gt;Geekbench 6 single-core comparison:&lt;/strong&gt;&lt;/p&gt;

&lt;div class="table-wrapper-paragraph"&gt;&lt;table&gt;
&lt;thead&gt;
&lt;tr&gt;
&lt;th&gt;Chip&lt;/th&gt;
&lt;th&gt;GB6 SC&lt;/th&gt;
&lt;th&gt;vs Kirin 2026 (est.)&lt;/th&gt;
&lt;/tr&gt;
&lt;/thead&gt;
&lt;tbody&gt;
&lt;tr&gt;
&lt;td&gt;SD Elite Gen 6 Pro&lt;/td&gt;
&lt;td&gt;14,273&lt;/td&gt;
&lt;td&gt;&lt;strong&gt;0.46×&lt;/strong&gt;&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;SD Elite Gen 5&lt;/td&gt;
&lt;td&gt;12,216&lt;/td&gt;
&lt;td&gt;&lt;strong&gt;0.53×&lt;/strong&gt;&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Apple A19 Pro&lt;/td&gt;
&lt;td&gt;10,728&lt;/td&gt;
&lt;td&gt;&lt;strong&gt;0.61×&lt;/strong&gt;&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Dimensity 9500&lt;/td&gt;
&lt;td&gt;10,155&lt;/td&gt;
&lt;td&gt;&lt;strong&gt;0.64×&lt;/strong&gt;&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;SD Elite Gen 4&lt;/td&gt;
&lt;td&gt;10,111&lt;/td&gt;
&lt;td&gt;&lt;strong&gt;0.64×&lt;/strong&gt;&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;SD 8 Gen 5&lt;/td&gt;
&lt;td&gt;9,894&lt;/td&gt;
&lt;td&gt;&lt;strong&gt;0.66×&lt;/strong&gt;&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;&lt;strong&gt;Kirin 2026 (est.)&lt;/strong&gt;&lt;/td&gt;
&lt;td&gt;&lt;strong&gt;~6,500&lt;/strong&gt;&lt;/td&gt;
&lt;td&gt;&lt;strong&gt;1.00×&lt;/strong&gt;&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Kirin 9030 Pro&lt;/td&gt;
&lt;td&gt;5,808&lt;/td&gt;
&lt;td&gt;
&lt;strong&gt;1.12×&lt;/strong&gt; improvement&lt;/td&gt;
&lt;/tr&gt;
&lt;/tbody&gt;
&lt;/table&gt;&lt;/div&gt;

&lt;p&gt;At ~6,500 GB6 single-core, Kirin 2026's CPU performance is roughly equivalent to a &lt;strong&gt;2022 Snapdragon 8 Gen 2&lt;/strong&gt; (~6,800 points). In peak raw CPU performance, Huawei trails by about &lt;strong&gt;4 years&lt;/strong&gt;.&lt;/p&gt;

&lt;h3&gt;
  
  
  Transistor Density — The One Dimension Where Kirin Leads
&lt;/h3&gt;

&lt;div class="table-wrapper-paragraph"&gt;&lt;table&gt;
&lt;thead&gt;
&lt;tr&gt;
&lt;th&gt;Chip&lt;/th&gt;
&lt;th&gt;Density&lt;/th&gt;
&lt;th&gt;Process&lt;/th&gt;
&lt;th&gt;vs Kirin 2026&lt;/th&gt;
&lt;/tr&gt;
&lt;/thead&gt;
&lt;tbody&gt;
&lt;tr&gt;
&lt;td&gt;&lt;strong&gt;Kirin 2026&lt;/strong&gt;&lt;/td&gt;
&lt;td&gt;&lt;strong&gt;238 MTr/mm²&lt;/strong&gt;&lt;/td&gt;
&lt;td&gt;Domestic + Logic Fold&lt;/td&gt;
&lt;td&gt;&lt;strong&gt;—&lt;/strong&gt;&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;SD 8 Elite Gen 5&lt;/td&gt;
&lt;td&gt;~200 MTr/mm²&lt;/td&gt;
&lt;td&gt;N3E&lt;/td&gt;
&lt;td&gt;&lt;strong&gt;19% behind&lt;/strong&gt;&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;SD 8 Gen 3&lt;/td&gt;
&lt;td&gt;~180 MTr/mm²&lt;/td&gt;
&lt;td&gt;N4P&lt;/td&gt;
&lt;td&gt;&lt;strong&gt;32% behind&lt;/strong&gt;&lt;/td&gt;
&lt;/tr&gt;
&lt;/tbody&gt;
&lt;/table&gt;&lt;/div&gt;

&lt;p&gt;On a sanctioned process, Kirin 2026 achieves &lt;strong&gt;higher transistor density than TSMC N3E&lt;/strong&gt;. This is the direct payoff of logic folding — exactly what τ Law predicts.&lt;/p&gt;




&lt;h2&gt;
  
  
  The τ Law Lens
&lt;/h2&gt;

&lt;p&gt;τ Law says: "When geometry scaling hits the wall, stop making transistors smaller. Make the signal between them faster instead."&lt;/p&gt;

&lt;p&gt;Kirin 2026 is the &lt;strong&gt;first production validation&lt;/strong&gt; of this thesis.&lt;/p&gt;

&lt;div class="table-wrapper-paragraph"&gt;&lt;table&gt;
&lt;thead&gt;
&lt;tr&gt;
&lt;th&gt;Metric&lt;/th&gt;
&lt;th&gt;Grade&lt;/th&gt;
&lt;th&gt;Note&lt;/th&gt;
&lt;/tr&gt;
&lt;/thead&gt;
&lt;tbody&gt;
&lt;tr&gt;
&lt;td&gt;Density&lt;/td&gt;
&lt;td&gt;&lt;strong&gt;A&lt;/strong&gt;&lt;/td&gt;
&lt;td&gt;238 MTr/mm² beats N3E&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Efficiency&lt;/td&gt;
&lt;td&gt;&lt;strong&gt;A-&lt;/strong&gt;&lt;/td&gt;
&lt;td&gt;P-core efficiency +41%&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Raw CPU perf&lt;/td&gt;
&lt;td&gt;&lt;strong&gt;C&lt;/strong&gt;&lt;/td&gt;
&lt;td&gt;~4 years behind&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Peak experience&lt;/td&gt;
&lt;td&gt;&lt;strong&gt;D&lt;/strong&gt;&lt;/td&gt;
&lt;td&gt;Most apps will feel the gap&lt;/td&gt;
&lt;/tr&gt;
&lt;/tbody&gt;
&lt;/table&gt;&lt;/div&gt;

&lt;p&gt;This is exactly what you'd expect from a first-gen logic folding product. You can't simultaneously optimize density, frequency, power, and IPC on a brand-new design methodology. The real test will be the &lt;strong&gt;iteration curve&lt;/strong&gt;:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Can Gen 2 (2027) push single-core to ~8,000? → This path is viable&lt;/li&gt;
&lt;li&gt;Is it stuck at 6,500-7,000? → This path is "least bad under sanctions"&lt;/li&gt;
&lt;/ul&gt;




&lt;h2&gt;
  
  
  What This Really Matters
&lt;/h2&gt;

&lt;p&gt;Kirin 2026 is a mirror with two reflections:&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Mirror 1:&lt;/strong&gt; Under sanctions, with no access to TSMC's leading edge, no full EDA tooling, and limited ARM architecture access — Huawei delivered a chip whose transistor density beats everyone. That is a world-class engineering achievement.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Mirror 2:&lt;/strong&gt; But density doesn't run apps. Single-core performance does. And at ~50% of Qualcomm's latest, users will notice. On every app launch. Every scroll. Every load.&lt;/p&gt;

&lt;p&gt;Both reflections are true. They don't contradict each other.&lt;/p&gt;




&lt;h2&gt;
  
  
  Bottom Line
&lt;/h2&gt;

&lt;blockquote&gt;
&lt;p&gt;&lt;strong&gt;Kirin 2026 proves τ Law works for density. It has yet to prove it works for performance.&lt;/strong&gt;&lt;/p&gt;
&lt;/blockquote&gt;

&lt;p&gt;The logic folding path is one of the most interesting engineering bets in the post-Moore era. But it needs 2-3 generations of iteration before it can threaten Qualcomm on user-perceptible benchmarks. Until then, the gap graph tells the real story: Huawei is competing in a different league, with a different playbook, and the scoreboard reads what the sanctions intended it to read.&lt;/p&gt;

&lt;p&gt;&lt;em&gt;Sources: cnBeta 1563522 (ISCAS 2026 coverage) | unite4buy Mobile Processor Ranking 2026&lt;/em&gt;&lt;br&gt;
&lt;em&gt;Written 2026-05-25&lt;/em&gt;&lt;/p&gt;

</description>
      <category>chip</category>
      <category>huawei</category>
      <category>qualcomm</category>
      <category>hardware</category>
    </item>
    <item>
      <title>Windows Has an AI Problem. Can HarmonyOS PC Be the Answer?</title>
      <dc:creator>keeper</dc:creator>
      <pubDate>Sun, 24 May 2026 18:18:32 +0000</pubDate>
      <link>https://dev.to/lanternproton/windows-has-an-ai-problem-can-harmonyos-pc-be-the-answer-3p9n</link>
      <guid>https://dev.to/lanternproton/windows-has-an-ai-problem-can-harmonyos-pc-be-the-answer-3p9n</guid>
      <description>&lt;h1&gt;
  
  
  Windows Has an AI Problem. Can HarmonyOS PC Be the Answer?
&lt;/h1&gt;

&lt;p&gt;In May 2025, a mini PC with a Ryzen 7 8845HS sat on my desk. It had an NPU rated at 16 TOPS — just enough for Microsoft's Copilot+ checklist. I was trying to run a 14B local model on it. The result: 2.1 tokens per second on CPU, the iGPU was bottlenecked by shared memory bandwidth, and the NPU was completely unusable — it only worked with Microsoft's proprietary QNN API, and no open-source LLM runtime supported it.&lt;/p&gt;

&lt;p&gt;This isn't a hardware problem. It's a structural one.&lt;/p&gt;

&lt;h2&gt;
  
  
  The Windows AI Trap
&lt;/h2&gt;

&lt;p&gt;Windows has a fundamental architectural disadvantage for local AI that no amount of NPU TOPS marketing can fix. It stems from three layers:&lt;/p&gt;

&lt;h3&gt;
  
  
  1. Fragmented Memory Architecture
&lt;/h3&gt;

&lt;p&gt;The defining feature of the M-series Mac is &lt;strong&gt;unified memory&lt;/strong&gt; — a single pool of high-bandwidth, low-latency RAM shared between CPU, GPU, and Neural Engine. A MacBook with 64GB unified memory can run a 70B parameter model (quantized to Q4, ~40GB) because the GPU has full access to all 64GB at 400+ GB/s.&lt;/p&gt;

&lt;p&gt;Windows PCs have no equivalent. The architecture is:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;System RAM&lt;/strong&gt;: 16-64GB (DDR5/LPDDR5, 50-80 GB/s)&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;GPU VRAM&lt;/strong&gt;: 8-24GB (GDDR6/7, 400-800 GB/s)&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;NPU memory&lt;/strong&gt;: Shared with system, but bandwidth-constrained and API-locked&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;The GPU is the best inference engine, but it's limited to VRAM. A 14B Q4 model needs ~8GB — fits on a 12GB+ card. A 70B Q4 needs ~40GB — only RTX 6000 Ada ($6,800) or server GPUs. The system RAM has the capacity but runs at 1/10th the bandwidth. The NPU has the TOPS but no ecosystem.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Result&lt;/strong&gt;: Every Windows AI PC is a "here's the capacity, here's the bandwidth, here's the TOPS — but you can't use all three at once" machine. Apple gets to use all three simultaneously.&lt;/p&gt;

&lt;h3&gt;
  
  
  2. OEM Incentive Misalignment
&lt;/h3&gt;

&lt;p&gt;Microsoft doesn't make the hardware — Dell, Lenovo, HP, ASUS do. Each OEM competes on price. The cheapest SSD, the slowest RAM, the smallest battery. NPU is a checkbox component, not a system-level optimization. No OEM invests in unified memory because it requires a custom SoC and motherboard — which means they can't differentiate on any other dimension.&lt;/p&gt;

&lt;p&gt;Qualcomm's Snapdragon X Elite was supposed to fix this. It has unified memory (LPDDR5X, up to 64GB, 135 GB/s). But Windows on ARM has its own problems: x86 emulation overhead, driver compatibility, and the same OEM cost-cutting pressure that makes Lenovo ship a 45W charger with a 28W SoC.&lt;/p&gt;

&lt;h3&gt;
  
  
  3. The NPU Ecosystem Is Still Born
&lt;/h3&gt;

&lt;p&gt;Every Windows NPU needs a different SDK:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;Intel NPU&lt;/strong&gt;: OpenVINO&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;AMD NPU&lt;/strong&gt;: Ryzen AI / DirectML&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Qualcomm NPU&lt;/strong&gt;: QNN / ONNX Runtime&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Microsoft's own&lt;/strong&gt;: DirectML (if it supports your hardware)&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;No major open-source LLM runtime — not llama.cpp, not MLX, not ExLlamaV2 — supports any of these NPUs for text generation. The 16-45 TOPS on the spec sheet is a marketing number. In practice, those TOPS are only accessible through Microsoft's proprietary pipeline for Copilot+ features like real-time captions or Windows Studio Effects. Try running Llama 3.2 on the NPU. You can't.&lt;/p&gt;

&lt;h2&gt;
  
  
  Apple's Moat Is Getting Deeper
&lt;/h2&gt;

&lt;p&gt;The M4 Ultra makes 192GB unified memory available to the GPU. A single Mac Studio can run Llama 3.1 405B (Q2, ~100GB) entirely locally. Not fast, but it works. With MLX, a 70B model runs at 15-20 tok/s on an M4 Ultra.&lt;/p&gt;

&lt;p&gt;Apple doesn't compete on peak TOPS or FLOPs. It competes on &lt;strong&gt;usable architecture&lt;/strong&gt; — the hardware memory model that makes inference practical. And this advantage compounds as models grow: a 2027-era 1T-parameter MoE model will need 200-300GB. Only Apple's architecture can deliver that at a consumer price point.&lt;/p&gt;

&lt;p&gt;Microsoft's response is to push NPUs harder — but a 100 TOPS NPU with 16GB of slow shared memory is still worse than a 30 TOPS GPU with 64GB of fast unified memory. The bottleneck is bandwidth and capacity, not multiply-accumulate operations.&lt;/p&gt;

&lt;h2&gt;
  
  
  Enter HarmonyOS PC: A Clean Sheet
&lt;/h2&gt;

&lt;p&gt;Huawei launched HarmonyOS PC on May 19, 2025, powered by the Kirin X90 — a 5nm SoC with 14 CPU cores, an integrated GPU (Maleoon), and a self-developed NPU. This is a &lt;strong&gt;mobile-SoC-derived architecture&lt;/strong&gt; — which means it's inherently unified memory. The CPU, GPU, and NPU share the same LPDDR5 pool.&lt;/p&gt;

&lt;p&gt;This is the only non-Apple PC architecture that natively supports unified memory. And that matters.&lt;/p&gt;

&lt;h3&gt;
  
  
  Kirin X90: What We Know
&lt;/h3&gt;

&lt;div class="table-wrapper-paragraph"&gt;&lt;table&gt;
&lt;thead&gt;
&lt;tr&gt;
&lt;th&gt;Dimension&lt;/th&gt;
&lt;th&gt;Kirin X90&lt;/th&gt;
&lt;th&gt;Apple M4&lt;/th&gt;
&lt;th&gt;Snapdragon X Elite&lt;/th&gt;
&lt;/tr&gt;
&lt;/thead&gt;
&lt;tbody&gt;
&lt;tr&gt;
&lt;td&gt;Process&lt;/td&gt;
&lt;td&gt;5nm (SMIC)&lt;/td&gt;
&lt;td&gt;3nm (TSMC)&lt;/td&gt;
&lt;td&gt;4nm (TSMC)&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;CPU&lt;/td&gt;
&lt;td&gt;14-core&lt;/td&gt;
&lt;td&gt;10-core&lt;/td&gt;
&lt;td&gt;12-core&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;GPU&lt;/td&gt;
&lt;td&gt;Maleoon (custom)&lt;/td&gt;
&lt;td&gt;10-40 core&lt;/td&gt;
&lt;td&gt;Adreno&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;NPU&lt;/td&gt;
&lt;td&gt;Custom, AI +200% vs prev gen&lt;/td&gt;
&lt;td&gt;38 TOPS&lt;/td&gt;
&lt;td&gt;45 TOPS&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Memory&lt;/td&gt;
&lt;td&gt;LPDDR5 (unified)&lt;/td&gt;
&lt;td&gt;LPDDR5 (unified)&lt;/td&gt;
&lt;td&gt;LPDDR5x (unified)&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;AI Ops (claimed)&lt;/td&gt;
&lt;td&gt;~60-80 TOPS*&lt;/td&gt;
&lt;td&gt;38 TOPS&lt;/td&gt;
&lt;td&gt;45 TOPS&lt;/td&gt;
&lt;/tr&gt;
&lt;/tbody&gt;
&lt;/table&gt;&lt;/div&gt;

&lt;p&gt;&lt;em&gt;*Estimated from Huawei's "200% improvement over previous gen" claim, assuming Kirin 9010 NPU baseline of ~30 TOPS.&lt;/em&gt;&lt;/p&gt;

&lt;p&gt;The architecture is right — unified memory is the non-negotiable prerequisite for local AI inference. But capacity is the open question.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;The 14B model test&lt;/strong&gt;: A 14B Q4 model needs ~8GB. If the X90 supports 32GB LPDDR5 — MacBook Air territory — it fits comfortably. The GPU (Maleoon) handles inference via Vulkan/compute shaders, while both the GPU and CPU share the full memory bandwidth. Inference speed would depend on GPU optimization, but 10-20 tok/s is plausible with proper runtime support.&lt;/p&gt;

&lt;p&gt;At 16GB, it fits but with memory pressure from the OS and browser. At 8GB (phone territory), forget it.&lt;/p&gt;

&lt;h2&gt;
  
  
  The Supply Chain Math: CXMT and the Memory Squeeze
&lt;/h2&gt;

&lt;p&gt;Here's where the theoretical architecture meets reality. HarmonyOS PC needs LPDDR5 memory. Huawei's HBM technology (the HiZQ 2.0 used in Ascend 950) supplies — and competes for — the same DRAM fab capacity.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;CXMT (长鑫存储) is the only Chinese DRAM IDM.&lt;/strong&gt; By mid-2026, their three 12-inch fabs (two in Hefei, one in Beijing) reach approximately 300K wafers/month total capacity.&lt;/p&gt;

&lt;p&gt;The allocation problem:&lt;/p&gt;

&lt;div class="table-wrapper-paragraph"&gt;&lt;table&gt;
&lt;thead&gt;
&lt;tr&gt;
&lt;th&gt;Product&lt;/th&gt;
&lt;th&gt;Die Consumption&lt;/th&gt;
&lt;th&gt;Monthly Wafer Need (Est.)&lt;/th&gt;
&lt;/tr&gt;
&lt;/thead&gt;
&lt;tbody&gt;
&lt;tr&gt;
&lt;td&gt;
&lt;strong&gt;Ascend 950 HBM&lt;/strong&gt; (HiZQ 2.0)&lt;/td&gt;
&lt;td&gt;Each 144GB chip = 4 stacks of 12-hi = 48 DRAM dies + base dies&lt;/td&gt;
&lt;td&gt;30K-50K wafers (at 3x DDR die consumption per GB)&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;
&lt;strong&gt;Kirin X90 LPDDR5&lt;/strong&gt; (5M units/year)&lt;/td&gt;
&lt;td&gt;5M × 4-8 dies/device = 20-40M dies total&lt;/td&gt;
&lt;td&gt;~25K-50K wafers (one-time for first batch)&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;&lt;strong&gt;DDR5 commodity&lt;/strong&gt;&lt;/td&gt;
&lt;td&gt;Industry contracts, server, PC aftermarket&lt;/td&gt;
&lt;td&gt;Remaining 200K-240K wafers&lt;/td&gt;
&lt;/tr&gt;
&lt;/tbody&gt;
&lt;/table&gt;&lt;/div&gt;

&lt;p&gt;&lt;strong&gt;The squeeze&lt;/strong&gt;: HBM consumes roughly &lt;strong&gt;3x the wafer capacity per GB&lt;/strong&gt; compared to standard DDR (per Tom's Hardware analysis — yield loss from stacking, smaller dies, base die overhead). A single Atlas 950 SuperNode (8,192 Ascend 950 chips, 1.1 PB HBM) uses as much DRAM wafer capacity as millions of PCs.&lt;/p&gt;

&lt;p&gt;CXMT's profitability adds pressure. In H1 2026, CXMT reported ¥50-57 billion net profit, driven by the global DRAM shortage. The highest-margin product is HBM, not LPDDR. From a business perspective, CXMT's incentive is to allocate more capacity to HBM for Ascend and less to commodity LPDDR.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Verdict on availability&lt;/strong&gt;: CXMT's 300K wafer/month capacity &lt;em&gt;can&lt;/em&gt; supply both HBM and LPDDR5, but at the volumes Huawei needs, the LPDDR5 allocation is likely constrained. The better question isn't "can CXMT make enough" — it's "how much is Huawei willing to pay vs NAND margins on the spot market."&lt;/p&gt;

&lt;p&gt;If Huawei procures CXMT LPDDR5 at above-market prices (internal transfer pricing as a related-party transaction), they secure supply at the cost of lower margins. If they try to source from Samsung/SK Hynix, they face US export controls and uncertain allocation (both Korean vendors prioritize HBM for NVIDIA).&lt;/p&gt;

&lt;p&gt;The pragmatic path: CXMT supplies just enough LPDDR5 for Kirin X90 volumes (~25K wafers/month at scale), while the bulk of HBM capacity goes to Ascend 950 — which has higher margins and strategic AI infrastructure importance.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Bottom line&lt;/strong&gt;: A HarmonyOS PC with 16GB RAM is likely. One with 32GB is possible but expensive. One with 64GB (Mac-level) is unlikely at launch.&lt;/p&gt;

&lt;h2&gt;
  
  
  The Real Disruption: Distributed AI
&lt;/h2&gt;

&lt;p&gt;The most interesting possibility for HarmonyOS PC isn't local performance — it's what no other platform can do.&lt;/p&gt;

&lt;p&gt;HarmonyOS has a &lt;strong&gt;distributed hardware abstraction layer&lt;/strong&gt; that treats all devices in a user's ecosystem as a single resource pool. This was originally designed for file sharing and phone-as-webcam use cases. Applied to AI inference, it becomes something genuinely different:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Your phone's NPU + your PC's NPU + your tablet's NPU = pooled inference&lt;/li&gt;
&lt;li&gt;Model layers sharded across devices over high-speed local interconnect&lt;/li&gt;
&lt;li&gt;A 14B model's attention layers run on the PC GPU, embedding layers on the phone NPU&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;Windows can't do this. macOS can't do this (Continuity doesn't extend to GPU compute pooling). HarmonyOS's distributed architecture is unique.&lt;/p&gt;

&lt;p&gt;The practical challenge: interconnect latency. Even WiFi 7 at 5Gbps has 1-3ms latency between devices, which is 10-100x slower than on-chip memory access. Real-time layer-sharded inference requires sub-microsecond synchronization. This limits distributed AI to batch/offline workloads (background summarization, async data processing) rather than interactive chat.&lt;/p&gt;

&lt;p&gt;But for the use case of "leave your PC processing a large model overnight while sharing the workload with your phone" — it works.&lt;/p&gt;

&lt;h2&gt;
  
  
  Three Scenarios for HarmonyOS PC
&lt;/h2&gt;

&lt;h3&gt;
  
  
  Scenario 1: The "Good Enough" (Likely)
&lt;/h3&gt;

&lt;ul&gt;
&lt;li&gt;16GB LPDDR5, unified memory, ~60 TOPS NPU&lt;/li&gt;
&lt;li&gt;Runs 7B models at 15-25 tok/s via GPU inference&lt;/li&gt;
&lt;li&gt;Runs 14B models at 5-10 tok/s (usable but slow)&lt;/li&gt;
&lt;li&gt;Distributed AI for async/background workloads&lt;/li&gt;
&lt;li&gt;Price: ¥3,999-4,999 ($550-700)&lt;/li&gt;
&lt;li&gt;Verdict: A legitimate third option alongside Mac and Windows, but not a "Mac killer"&lt;/li&gt;
&lt;/ul&gt;

&lt;h3&gt;
  
  
  Scenario 2: The Memory Expansion (Possible)
&lt;/h3&gt;

&lt;ul&gt;
&lt;li&gt;32GB LPDDR5 (CXMT secures allocation, higher BOM cost)&lt;/li&gt;
&lt;li&gt;Runs 14B Q4 models at 15-20 tok/s&lt;/li&gt;
&lt;li&gt;Runs 32B Q4 models at 5-8 tok/s&lt;/li&gt;
&lt;li&gt;Distributed AI with phone NPU for real-time assistance&lt;/li&gt;
&lt;li&gt;Price: ¥5,999-6,999 ($830-970)&lt;/li&gt;
&lt;li&gt;Verdict: Genuinely competitive with MacBook Air M4 for local AI workloads&lt;/li&gt;
&lt;li&gt;Key risk: CXMT LPDDR5 allocation, BOM margin pressure&lt;/li&gt;
&lt;/ul&gt;

&lt;h3&gt;
  
  
  Scenario 3: The Distributed Breakthrough (Long Shot)
&lt;/h3&gt;

&lt;ul&gt;
&lt;li&gt;16-32GB LPDDR5&lt;/li&gt;
&lt;li&gt;HarmonyOS 6+ with native distributed inference API&lt;/li&gt;
&lt;li&gt;"AI router" mode: PC serves as local inference server for all household HarmonyOS devices&lt;/li&gt;
&lt;li&gt;Third-party model runtime (llama.cpp port, MindSpore Lite + Vulkan)&lt;/li&gt;
&lt;li&gt;Verdict: Unique value proposition Windows and Mac can't replicate&lt;/li&gt;
&lt;li&gt;Key unlock: Software ecosystem maturity, developer adoption&lt;/li&gt;
&lt;/ul&gt;

&lt;h2&gt;
  
  
  What Needs to Happen
&lt;/h2&gt;

&lt;p&gt;For HarmonyOS PC to matter in the AI era, three things must align:&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;1. Memory capacity must exceed 16GB.&lt;/strong&gt; Without this, the unified memory advantage is theoretical. A 14B model barely fits, leaving no room for the OS or applications. 32GB is the sweet spot.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;2. An open runtime must exist.&lt;/strong&gt; If the only way to run AI is through Huawei's MindSpore/CANN pipeline, developer adoption will be slow — the same trap Windows fell into with proprietary NPU SDKs. A Vulkan-based llama.cpp port would be transformative.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;3. The distributed inference API must ship as a first-party feature.&lt;/strong&gt; Not a developer preview. Not an enterprise SKU. A system-level API that any app can call: &lt;code&gt;harmonyos.distribute.infer(model, input, devices=[pc, phone])&lt;/code&gt;. This is the feature that differentiates HarmonyOS from every other platform.&lt;/p&gt;

&lt;h2&gt;
  
  
  The Honest Assessment
&lt;/h2&gt;

&lt;p&gt;HarmonyOS PC solves the architectural problem Windows has — unified memory — and has a theoretical distributed computing advantage no other platform can match. But it faces three constraints:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;Manufacturing&lt;/strong&gt;: 5nm at SMIC is behind TSMC's 3nm. Performance per watt will trail Apple M4.&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Ecosystem&lt;/strong&gt;: No Adobe, no major games, limited developer tools. This limits adoption to first-wave enthusiasts and government/enterprise procurement.&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Memory supply&lt;/strong&gt;: CXMT capacity is strained by HBM demand. 32GB LPDDR5 at scale is not guaranteed.&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;&lt;strong&gt;The best case for HarmonyOS PC&lt;/strong&gt;: It becomes what Windows should have been in the AI era — a platform where local AI inference is architecturally natural, not bolted on. But it only happens if Huawei prioritizes memory capacity over NPU TOPS, and opens the runtime to the community.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;The worst case&lt;/strong&gt;: It's another Windows — impressive NPU TOPS on paper, functionally inaccessible for real AI workloads, held back by proprietary SDKs and memory constraints.&lt;/p&gt;

&lt;p&gt;Right now, the data points toward Scenario 1: a good product that's competitive but not transformative. The switch flips to transformative at Scenario 2 — and that requires CXMT to deliver 32GB-class LPDDR5 at scale, which is a supply chain question, not a technology one.&lt;/p&gt;

&lt;p&gt;&lt;em&gt;This analysis is a best-effort public-data assessment. CXMT capacity figures are estimates based on public reporting. Kirin X90 NPU TOPS is inferred from Huawei's "200% improvement" claim relative to an unknown baseline.&lt;/em&gt;&lt;/p&gt;

</description>
      <category>ai</category>
      <category>hardware</category>
      <category>windows</category>
      <category>machinelearning</category>
    </item>
    <item>
      <title>750,000 Chips, 140 Trillion Tokens: The Math Behind DeepSeek's Permanent Price Cut</title>
      <dc:creator>keeper</dc:creator>
      <pubDate>Sat, 23 May 2026 05:11:10 +0000</pubDate>
      <link>https://dev.to/lanternproton/75000-chips-140-trillion-tokens-the-math-behind-deepseeks-permanent-price-cut-4890</link>
      <guid>https://dev.to/lanternproton/75000-chips-140-trillion-tokens-the-math-behind-deepseeks-permanent-price-cut-4890</guid>
      <description>&lt;p&gt;DeepSeek made its V4-Pro 75% price cut permanent on May 22. The conventional read: "they got cheaper hardware." The real story is more interesting — and it's about a gap that's not closing fast enough.&lt;/p&gt;




&lt;h2&gt;
  
  
  What Happened
&lt;/h2&gt;

&lt;p&gt;On May 22, 2026, DeepSeek announced that the 75% discount on its V4-Pro API would become permanent. The new pricing:&lt;/p&gt;

&lt;div class="table-wrapper-paragraph"&gt;&lt;table&gt;
&lt;thead&gt;
&lt;tr&gt;
&lt;th&gt;Metric&lt;/th&gt;
&lt;th&gt;Before&lt;/th&gt;
&lt;th&gt;After&lt;/th&gt;
&lt;th&gt;Cut&lt;/th&gt;
&lt;/tr&gt;
&lt;/thead&gt;
&lt;tbody&gt;
&lt;tr&gt;
&lt;td&gt;Input (cache miss)&lt;/td&gt;
&lt;td&gt;¥12 / 1M tokens&lt;/td&gt;
&lt;td&gt;¥3 / 1M tokens&lt;/td&gt;
&lt;td&gt;&lt;strong&gt;75%&lt;/strong&gt;&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Output&lt;/td&gt;
&lt;td&gt;¥24 / 1M tokens&lt;/td&gt;
&lt;td&gt;¥6 / 1M tokens&lt;/td&gt;
&lt;td&gt;&lt;strong&gt;75%&lt;/strong&gt;&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Input (cache hit)&lt;/td&gt;
&lt;td&gt;¥0.1 / 1M tokens&lt;/td&gt;
&lt;td&gt;¥0.025 / 1M tokens&lt;/td&gt;
&lt;td&gt;&lt;strong&gt;75%&lt;/strong&gt;&lt;/td&gt;
&lt;/tr&gt;
&lt;/tbody&gt;
&lt;/table&gt;&lt;/div&gt;

&lt;p&gt;At current exchange rates, that's roughly $0.44/M input and $0.87/M output — making V4-Pro one of the cheapest frontier-class models on the market, on par with DeepSeek's own V4-Flash but with significantly more capability.&lt;/p&gt;

&lt;p&gt;The move came exactly four weeks after V4's launch on April 24, and coincided with growing user frustration over rate limits at Google Gemini and Anthropic Claude.&lt;/p&gt;




&lt;h2&gt;
  
  
  The Standard Narrative
&lt;/h2&gt;

&lt;p&gt;The surface-level story has three parts:&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;1. Architectural efficiency.&lt;/strong&gt; V4 uses a Mixture-of-Experts architecture with 1.6 trillion parameters, but only activates a fraction per token. This gives it a structural cost advantage over dense models of comparable capability — roughly 30% of the gap.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;2. Supply chain scaling.&lt;/strong&gt; Huawei's Ascend 950PR entered mass production in April 2026. Huawei plans to ship ~750,000 units through the year — a 2.5x increase over 2025's 910C output. DeepSeek specifically optimized V4 for the Ascend architecture. More chips → lower unit cost → lower API pricing.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;3. Competitive positioning.&lt;/strong&gt; Western AI providers (Google, Anthropic) have been quietly tightening rate limits as demand overwhelms their GPU supply. DeepSeek is exploiting the backlash, offering unlimited usage at a fraction of the cost to capture disgruntled developers.&lt;/p&gt;

&lt;p&gt;All three are true. But none of them fully explains the magnitude of the cut — or why it's &lt;em&gt;permanent&lt;/em&gt; rather than promotional.&lt;/p&gt;




&lt;h2&gt;
  
  
  The Math That Changes Everything
&lt;/h2&gt;

&lt;p&gt;Let's check the numbers.&lt;/p&gt;

&lt;h3&gt;
  
  
  Demand Side
&lt;/h3&gt;

&lt;p&gt;China's daily token consumption hit &lt;strong&gt;140 trillion&lt;/strong&gt; in March 2026, according to the National Data Administration. The growth trajectory:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Early 2024: 0.1 trillion/day&lt;/li&gt;
&lt;li&gt;End of 2025: 100 trillion/day&lt;/li&gt;
&lt;li&gt;March 2026: 140 trillion/day&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;That's a &lt;strong&gt;1,000x increase in two years&lt;/strong&gt;, and a &lt;strong&gt;40% jump in just the last quarter&lt;/strong&gt; — implying ~13% month-over-month growth.&lt;/p&gt;

&lt;h3&gt;
  
  
  Supply Side
&lt;/h3&gt;

&lt;p&gt;Huawei's mass-produced chip for 2026 is the &lt;strong&gt;Ascend 950PR&lt;/strong&gt; (Prefill-optimized, 1 PFLOPS FP8), with the higher-end &lt;strong&gt;950DT&lt;/strong&gt; (2 PFLOPS FP8) coming in Q4. The numbers:&lt;/p&gt;

&lt;div class="table-wrapper-paragraph"&gt;&lt;table&gt;
&lt;thead&gt;
&lt;tr&gt;
&lt;th&gt;Chip&lt;/th&gt;
&lt;th&gt;FP8&lt;/th&gt;
&lt;th&gt;Memory&lt;/th&gt;
&lt;th&gt;Bandwidth&lt;/th&gt;
&lt;th&gt;Inference Throughput (est.)&lt;/th&gt;
&lt;/tr&gt;
&lt;/thead&gt;
&lt;tbody&gt;
&lt;tr&gt;
&lt;td&gt;950PR&lt;/td&gt;
&lt;td&gt;1 PFLOPS&lt;/td&gt;
&lt;td&gt;128GB HBM&lt;/td&gt;
&lt;td&gt;1.6 TB/s&lt;/td&gt;
&lt;td&gt;~1,200 tokens/sec&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;950DT&lt;/td&gt;
&lt;td&gt;2 PFLOPS&lt;/td&gt;
&lt;td&gt;144GB HBM&lt;/td&gt;
&lt;td&gt;4 TB/s&lt;/td&gt;
&lt;td&gt;~2,400 tokens/sec&lt;/td&gt;
&lt;/tr&gt;
&lt;/tbody&gt;
&lt;/table&gt;&lt;/div&gt;

&lt;p&gt;(Throughput derived from Huawei's published Atlas 950 SuperNode benchmark: 19.6M tokens/sec across 8,192 cards.)&lt;/p&gt;

&lt;p&gt;Now the arithmetic:&lt;/p&gt;

&lt;div class="table-wrapper-paragraph"&gt;&lt;table&gt;
&lt;thead&gt;
&lt;tr&gt;
&lt;th&gt;Item&lt;/th&gt;
&lt;th&gt;Value&lt;/th&gt;
&lt;/tr&gt;
&lt;/thead&gt;
&lt;tbody&gt;
&lt;tr&gt;
&lt;td&gt;Total chips (2026 target)&lt;/td&gt;
&lt;td&gt;750,000 (70% PR + 30% DT)&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Raw daily throughput&lt;/td&gt;
&lt;td&gt;&lt;strong&gt;85.7 trillion tokens/day&lt;/strong&gt;&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Inference-allocated (60%)&lt;/td&gt;
&lt;td&gt;&lt;strong&gt;51.4 trillion tokens/day&lt;/strong&gt;&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;vs Current demand (140T)&lt;/td&gt;
&lt;td&gt;&lt;strong&gt;37% coverage&lt;/strong&gt;&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;vs Demand in 6 months (~291T)&lt;/td&gt;
&lt;td&gt;&lt;strong&gt;18% coverage&lt;/strong&gt;&lt;/td&gt;
&lt;/tr&gt;
&lt;/tbody&gt;
&lt;/table&gt;&lt;/div&gt;

&lt;p&gt;Even in the most optimistic scenario — every single chip dedicated to inference at 100% utilization:&lt;/p&gt;

&lt;div class="table-wrapper-paragraph"&gt;&lt;table&gt;
&lt;thead&gt;
&lt;tr&gt;
&lt;th&gt;Scenario&lt;/th&gt;
&lt;th&gt;vs Current&lt;/th&gt;
&lt;th&gt;vs +6 months&lt;/th&gt;
&lt;/tr&gt;
&lt;/thead&gt;
&lt;tbody&gt;
&lt;tr&gt;
&lt;td&gt;100% inference, 100% utilization&lt;/td&gt;
&lt;td&gt;&lt;strong&gt;61% coverage&lt;/strong&gt;&lt;/td&gt;
&lt;td&gt;&lt;strong&gt;29% coverage&lt;/strong&gt;&lt;/td&gt;
&lt;/tr&gt;
&lt;/tbody&gt;
&lt;/table&gt;&lt;/div&gt;

&lt;p&gt;The conclusion is stark: &lt;strong&gt;750,000 Ascend 950 chips can't cover today's demand — let alone the demand in six months.&lt;/strong&gt;&lt;/p&gt;




&lt;h2&gt;
  
  
  So Why Cut Prices?
&lt;/h2&gt;

&lt;p&gt;If supply is still a fraction of demand, permanent price cuts don't make sense in a normal market. But this is not a normal market.&lt;/p&gt;

&lt;h3&gt;
  
  
  The Real Logic: Pre-Commitment, Not Surplus
&lt;/h3&gt;

&lt;p&gt;DeepSeek is not cutting prices because it has spare compute. It's cutting prices to &lt;strong&gt;lock in routing commitments before the hardware arrives&lt;/strong&gt;.&lt;/p&gt;

&lt;p&gt;Here's the timeline:&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight plaintext"&gt;&lt;code&gt;April 24:  V4 launched, optimized for Ascend
April 24+: ByteDance orders 350,000 Ascend 950 chips (~¥40B)
May 4:     Ascend 950PR mass production confirmed
May 22:    DeepSeek makes V4-Pro 75% cut permanent
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;The critical insight: DeepSeek's price cut is &lt;strong&gt;not a cost pass-through&lt;/strong&gt;. It's a &lt;strong&gt;market share pre-commitment&lt;/strong&gt; — using the promise of future Ascend supply to grab developer mindshare now, before Western competitors can resolve their own capacity issues.&lt;/p&gt;

&lt;h3&gt;
  
  
  The Numbers Behind the Strategy
&lt;/h3&gt;

&lt;p&gt;Western providers are capacity-constrained:&lt;/p&gt;

&lt;div class="table-wrapper-paragraph"&gt;&lt;table&gt;
&lt;thead&gt;
&lt;tr&gt;
&lt;th&gt;Provider&lt;/th&gt;
&lt;th&gt;Constraint&lt;/th&gt;
&lt;th&gt;Signal&lt;/th&gt;
&lt;/tr&gt;
&lt;/thead&gt;
&lt;tbody&gt;
&lt;tr&gt;
&lt;td&gt;Google Gemini&lt;/td&gt;
&lt;td&gt;TSMC CoWoS capacity&lt;/td&gt;
&lt;td&gt;Rate limits tightened, user backlash&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Anthropic Claude&lt;/td&gt;
&lt;td&gt;H100/B200 availability&lt;/td&gt;
&lt;td&gt;API throttling, compute-use monitoring&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;OpenAI&lt;/td&gt;
&lt;td&gt;Inference cluster rollout&lt;/td&gt;
&lt;td&gt;Delayed GPT-5 token limits&lt;/td&gt;
&lt;/tr&gt;
&lt;/tbody&gt;
&lt;/table&gt;&lt;/div&gt;

&lt;p&gt;DeepSeek's bet: "Spend the next 6 months building developer dependency on V4-Pro's API — by the time Ascend supply catches up in H2 2026, those developers won't switch back."&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;This is AWS in 2006.&lt;/strong&gt; AWS wasn't cheaper than running your own servers in 2006. But it &lt;em&gt;would&lt;/em&gt; be once scale kicked in. AWS priced for the scale it planned to have, not the scale it had. DeepSeek is doing the same.&lt;/p&gt;




&lt;h2&gt;
  
  
  What 750,000 Chips Actually Buys
&lt;/h2&gt;

&lt;p&gt;The popular framing in Chinese media is "75万颗昇腾950产能大爆发." But as the math shows, 750,000 chips isn't abundance — it's barely adequacy.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Think of it this way:&lt;/strong&gt; China's token demand is growing at roughly 0.5 trillion tokens per day &lt;em&gt;every single month&lt;/em&gt; (the monthly increment itself is larger than the entire market 18 months ago). By year-end, demand will be 300-400+ trillion. Against that, 750K chips at the 950PR/DT mix buy roughly 50-85T/day of inference capacity.&lt;/p&gt;

&lt;div class="table-wrapper-paragraph"&gt;&lt;table&gt;
&lt;thead&gt;
&lt;tr&gt;
&lt;th&gt;Timeframe&lt;/th&gt;
&lt;th&gt;Demand (est.)&lt;/th&gt;
&lt;th&gt;Inference Supply&lt;/th&gt;
&lt;th&gt;Gap&lt;/th&gt;
&lt;/tr&gt;
&lt;/thead&gt;
&lt;tbody&gt;
&lt;tr&gt;
&lt;td&gt;March 2026&lt;/td&gt;
&lt;td&gt;140T&lt;/td&gt;
&lt;td&gt;~50T&lt;/td&gt;
&lt;td&gt;&lt;strong&gt;90T&lt;/strong&gt;&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;June 2026&lt;/td&gt;
&lt;td&gt;~200T&lt;/td&gt;
&lt;td&gt;~50T&lt;/td&gt;
&lt;td&gt;&lt;strong&gt;150T&lt;/strong&gt;&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;September 2026&lt;/td&gt;
&lt;td&gt;~290T&lt;/td&gt;
&lt;td&gt;~55T (DT ramp)&lt;/td&gt;
&lt;td&gt;&lt;strong&gt;235T&lt;/strong&gt;&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;December 2026&lt;/td&gt;
&lt;td&gt;~420T&lt;/td&gt;
&lt;td&gt;~65T&lt;/td&gt;
&lt;td&gt;&lt;strong&gt;355T&lt;/strong&gt;&lt;/td&gt;
&lt;/tr&gt;
&lt;/tbody&gt;
&lt;/table&gt;&lt;/div&gt;

&lt;p&gt;The gap is &lt;em&gt;growing&lt;/em&gt;, not shrinking. Even with 75万 chips fully deployed, the supply-demand deficit more than triples over nine months.&lt;/p&gt;

&lt;p&gt;This means DeepSeek's price cut isn't a sign of market saturation. It's a sign of exactly the opposite: &lt;strong&gt;a market so unsaturated that the winner gets to define the default API for an entire generation of developers, if they can lock them in before the hardware arrives.&lt;/strong&gt;&lt;/p&gt;




&lt;h2&gt;
  
  
  Three Counter-Arguments (And Why They're Weak)
&lt;/h2&gt;

&lt;h3&gt;
  
  
  "But cache hits reduce the effective compute needed"
&lt;/h3&gt;

&lt;p&gt;True — cache-hit tokens cost ~1/100th of miss tokens. And DeepSeek's cache hit rates can be high for workloads with stable system prompts. But cache hits are mostly in the &lt;em&gt;input&lt;/em&gt; direction. Output tokens — the expensive ones — still need full compute. And as agentic workloads grow (multi-turn, chain-of-thought), output-to-input ratios increase, making cache less effective.&lt;/p&gt;

&lt;h3&gt;
  
  
  "But not all 140T tokens need 950-class inference"
&lt;/h3&gt;

&lt;p&gt;Also true. Many tokens are generated by smaller models (Flash variants, Qwen, etc.) that don't need 950-level compute. But the &lt;em&gt;growth&lt;/em&gt; is in the frontier-class tokens — longer context, more complex reasoning, higher quality requirements. That's exactly where 950-class chips are needed.&lt;/p&gt;

&lt;h3&gt;
  
  
  "But they can still buy H20 / smuggled H100"
&lt;/h3&gt;

&lt;p&gt;H20 is less capable than 950PR per chip (the US-designed it to be worse). And the CHIPS Act + export controls have made H100 procurement increasingly difficult. Relying on smuggled hardware is not a supply chain strategy.&lt;/p&gt;




&lt;h2&gt;
  
  
  What This Means
&lt;/h2&gt;

&lt;h3&gt;
  
  
  For Developers
&lt;/h3&gt;

&lt;p&gt;Your inference costs are likely going &lt;strong&gt;down&lt;/strong&gt; over the next 12 months, not up — even though demand is exploding. That's unprecedented in any computing market. The driver isn't efficiency gains or manufacturing scale. It's a &lt;strong&gt;strategic subsidy&lt;/strong&gt; by Chinese AI firms betting that locking in your API calls today is worth negative margins for a year.&lt;/p&gt;

&lt;p&gt;Take the subsidy. But don't assume today's prices reflect tomorrow's costs — they reflect tomorrow's &lt;em&gt;hopes&lt;/em&gt;.&lt;/p&gt;

&lt;h3&gt;
  
  
  For the Industry
&lt;/h3&gt;

&lt;p&gt;The AI API market has entered a phase that looks like price war but functions like &lt;strong&gt;infrastructure land-grab&lt;/strong&gt;. The playbook is AWS 2006, DoorDash 2019, Uber 2015: lose money on every transaction to own the default routing.&lt;/p&gt;

&lt;p&gt;When the hardware &lt;em&gt;does&lt;/em&gt; catch up — when Ascend 960 (2027) or 970 (2028) ships with 3-5x the throughput — the providers with the largest captive developer bases will convert negative margins to positive ones. Everyone else will be competing on price against incumbents they can't dislodge.&lt;/p&gt;




&lt;h2&gt;
  
  
  The Bottom Line
&lt;/h2&gt;

&lt;blockquote&gt;
&lt;p&gt;&lt;strong&gt;DeepSeek's permanent price cut is not evidence that Chinese AI compute supply has caught up with demand. The math shows it hasn't — and won't for at least 12-18 months. It's evidence that DeepSeek is playing the long game: use today's negative margins to own tomorrow's default inference route, and trust that Huawei's future chips will eventually close a gap that's currently 3-5x wider than headlines suggest.&lt;/strong&gt;&lt;/p&gt;
&lt;/blockquote&gt;

&lt;p&gt;The 75% cut isn't a cost breakthrough. It's a bet that developer lock-in is worth more than current margins — and that the 75万 Ascend 950 chips shipping this year are just the beginning.&lt;/p&gt;




&lt;p&gt;&lt;em&gt;Numbers sourced from: National Data Administration (China daily token data, March 2026), Huawei Connect 2025 (Ascend 950 specs and roadmap), SCMP/DW (ByteDance order volume), DeepSeek official pricing page (May 2026). Throughput calculations based on published Atlas 950 SuperNode benchmarks. Growth projections assume continuation of 40%/quarter rate per published data.&lt;/em&gt;&lt;/p&gt;

</description>
      <category>ai</category>
      <category>hardware</category>
      <category>china</category>
      <category>cloud</category>
    </item>
    <item>
      <title>From the Renaissance to the Quantum Dawn: AI, Computation, and the Next Paradigm Shift</title>
      <dc:creator>keeper</dc:creator>
      <pubDate>Sat, 23 May 2026 04:36:34 +0000</pubDate>
      <link>https://dev.to/lanternproton/from-the-renaissance-to-the-quantum-dawn-ai-computation-and-the-next-paradigm-shift-5epm</link>
      <guid>https://dev.to/lanternproton/from-the-renaissance-to-the-quantum-dawn-ai-computation-and-the-next-paradigm-shift-5epm</guid>
      <description>&lt;p&gt;Five hundred years ago, Florentine craftsmen began using linear perspective to represent three-dimensional space on a two-dimensional canvas. That movement — the Renaissance — was humanity's first systematic liberation of its own cognitive capacity. We were no longer footnotes to a divine plan. We became, in the famous formulation, "the measure of all things."&lt;/p&gt;

&lt;p&gt;Today, standing in the early summer of 2026, we are living through another, more radical renaissance: AI is releasing creativity from the tip of the elite's pen into every prompt box, while quantum computing is trying to lay a new runway for this digital explosion at the very edge of classical physics.&lt;/p&gt;




&lt;h2&gt;
  
  
  1. The Renaissance Echo: From the "Discovery of Man" to the Democratization of Intelligence
&lt;/h2&gt;

&lt;p&gt;The Renaissance achieved two things: a revolution in the &lt;strong&gt;carrier of knowledge&lt;/strong&gt; (Gutenberg's printing press) and a reversal of the &lt;strong&gt;subject of cognition&lt;/strong&gt; (from God to Man). When the Bible no longer required a priest's oral transmission, when perspective allowed ordinary people to paint convincing cathedral ceilings, humanity realized for the first time: individual reason is enough to move the world.&lt;/p&gt;

&lt;p&gt;The AI wave today is replaying this exact scene.&lt;/p&gt;

&lt;p&gt;Open-source models (Llama, Qwen), AIGC platforms, no-code tools — they let people who don't know how to code command machines to do work that once required a whole team. "Prompt" has become the new Latin. Everyone can use this language to create stories, images, and even code.&lt;/p&gt;

&lt;p&gt;AI has driven quality and efficiency gains across every field. It hasn't made humans redundant — it has freed us from repetitive, low-level mental labor to face more complex, higher-dimensional problems.&lt;/p&gt;




&lt;h2&gt;
  
  
  2. The Computation Explosion: Classical Computing's Ceiling and the Platform Paradox
&lt;/h2&gt;

&lt;p&gt;But liberation comes at a cost: the inflation of desire. When we can generate a million stories effortlessly, we no longer want "more" — we want "better." Longer contexts, more realistic voices, more complex logical reasoning. This pursuit of &lt;em&gt;quality&lt;/em&gt; is consuming computational resources at an exponential rate.&lt;/p&gt;

&lt;p&gt;The data explosion that was once compressed by platforms has been replaced by an explosion in demand for compute, driven by AI itself.&lt;/p&gt;

&lt;p&gt;The numbers are staggering. China's daily token calls grew over 1,000x in two years, surpassing 140 trillion by March 2026. Training a trillion-parameter model requires 15,000 high-end GPUs running for 45 days, consuming 3 million kilowatt-hours. And classical computing — CPU, GPU, TPU — is hitting four walls simultaneously:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;Power wall&lt;/strong&gt;: NVIDIA GB300 draws 140kW per rack. A supercomputing center is a small power plant.&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Cost wall&lt;/strong&gt;: High-end GPUs are scarce and expensive.&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Physics wall&lt;/strong&gt;: Process nodes are approaching atomic limits. Moore's Law is limping.&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Data wall&lt;/strong&gt;: High-quality text data is running out.&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;It's as if the medieval scriptorium had just invented the printing press — only to discover there wasn't nearly enough paper or ink. What we lack isn't creativity. It's the &lt;strong&gt;computational fuel&lt;/strong&gt; to execute it.&lt;/p&gt;




&lt;h2&gt;
  
  
  3. The Quantum Dawn: The Next Computing Paradigm
&lt;/h2&gt;

&lt;p&gt;It is against this anxiety that quantum computing has walked out of physics labs and into the视野 of strategists. It's no longer a science-fiction gimmick — it's an engineering parallel universe taking shape.&lt;/p&gt;

&lt;p&gt;A classical bit is 0 or 1. A quantum bit can be both simultaneously. This means that when you face a problem requiring you to explore every possibility — predicting all protein folds, simulating a complete brain neural network, finding the optimal logistics route — a quantum computer can finish in seconds what would take a classical computer tens of thousands of years.&lt;/p&gt;

&lt;p&gt;The urgency of quantum computing is growing rapidly.&lt;/p&gt;

&lt;p&gt;In 2025-2026, global investment in quantum has visibly accelerated:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Both China and the US have designated quantum AI as a next-generation strategic priority&lt;/li&gt;
&lt;li&gt;Companies like iFlytek are betting on quantum computing as the breakthrough of the next decade&lt;/li&gt;
&lt;li&gt;Industry consensus expects practical "classical-quantum hybrid computing" by ~2030, bringing exponential acceleration to drug discovery, financial risk management, and climate simulation&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;Of course, today's quantum computers are still like early Renaissance perspective — the principle is right, but the operation is clumsy, error rates are high, and extreme低温 environments are required. But what they solve is not just "speed" — it's &lt;em&gt;complexity explosion itself&lt;/em&gt;. If AI is a heuristic pruning of the problem space, quantum computing is a parallel read of the entire search space.&lt;/p&gt;




&lt;h2&gt;
  
  
  4. Where the Human Fits: From Tool User to Meaning Giver
&lt;/h2&gt;

&lt;p&gt;The more powerful the technology, the sharper a philosophical question becomes: when machines can think, create, and calculate — what is left for humans?&lt;/p&gt;

&lt;p&gt;Nietzsche once prophesied the Übermensch — a new species capable of creating its own values in a meaningless world. In the age of AI and quantum computing, this image is being reshaped. The Übermensch is no longer the lone hero, but the &lt;em&gt;meta-conductor&lt;/em&gt; who can orchestrate these super-tools.&lt;/p&gt;

&lt;p&gt;The user of this essay — who builds local LLM rigs, experiments with GPUs, obsesses over thermal data — is doing one thing at a deep level: &lt;strong&gt;active dimensionality reduction&lt;/strong&gt;. They take the high-dimensional problem of "create a bedtime story" and compress it into a lower-dimensional, executable workflow: pick hardware, tune the model, batch-generate. The real creativity isn't in the details that got compressed away — it's in the choice of &lt;em&gt;what to compress and what to preserve&lt;/em&gt;.&lt;/p&gt;

&lt;p&gt;This is the deeper meaning of "reducing problem difficulty": not escaping complexity, but using new tools to restructure the problem so it becomes tractable. When quantum computing matures, this dimensionality-reduction capability will leap again — we may even simulate an entire story universe's evolution in real time, then pick the most moving timeline.&lt;/p&gt;




&lt;h2&gt;
  
  
  5. Epilogue: Walking the Narrow Path of the Post-Renaissance
&lt;/h2&gt;

&lt;p&gt;From the dome of Florence to the network of silicon neurons, humanity keeps repeating the same pattern: liberate ourselves with one tool, then let the liberated desire drive us to seek a more powerful one.&lt;/p&gt;

&lt;p&gt;The Renaissance liberated the &lt;strong&gt;human eye&lt;/strong&gt;. The Industrial Revolution liberated &lt;strong&gt;human muscle&lt;/strong&gt;. The Information Revolution liberated the &lt;strong&gt;human brain&lt;/strong&gt;. The AI revolution is liberating &lt;strong&gt;human intelligence&lt;/strong&gt;. And quantum computing may be the ultimate liberation of &lt;strong&gt;human computation&lt;/strong&gt;.&lt;/p&gt;

&lt;p&gt;But no matter how powerful the tools become, the original question awakened by the Renaissance remains: &lt;strong&gt;by what measure shall humanity judge itself?&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;When we use AI to generate a bedtime story, we are not competing with the machine. We are in dialogue with our past selves — seeing if this time we can craft a sentence that truly touches someone. When we hope quantum computers will solve protein folding, it's not to replace doctors — it's to free humanity from certain diseases.&lt;/p&gt;

&lt;p&gt;Tools can reduce the &lt;em&gt;difficulty&lt;/em&gt; of problems. They cannot reduce &lt;em&gt;meaning&lt;/em&gt;. Meaning can only be赋予 by the person standing behind the tool — carrying their own life experience and emotional temperature.&lt;/p&gt;

&lt;p&gt;That, perhaps, is the most important thing to remember in this long journey that began with the Renaissance.&lt;/p&gt;




&lt;p&gt;&lt;em&gt;Written on a night after tuning a local LLM rig's thermals, falling asleep to an AI-generated Arabic bedtime story.&lt;/em&gt;&lt;br&gt;
&lt;em&gt;May 2026&lt;/em&gt;&lt;/p&gt;

</description>
      <category>ai</category>
      <category>history</category>
      <category>quantum</category>
      <category>philosophy</category>
    </item>
    <item>
      <title>How Codex CLI helped me ship 3 releases in 48 hours — and what it got wrong</title>
      <dc:creator>keeper</dc:creator>
      <pubDate>Sat, 23 May 2026 00:08:36 +0000</pubDate>
      <link>https://dev.to/lanternproton/how-codex-cli-helped-me-ship-3-releases-in-48-hours-and-what-it-got-wrong-461i</link>
      <guid>https://dev.to/lanternproton/how-codex-cli-helped-me-ship-3-releases-in-48-hours-and-what-it-got-wrong-461i</guid>
      <description>&lt;p&gt;I maintain three open-source 3D printing tools: SupportSage (AI-optimized support structures), Printsight (photo-based quality inspection), and FilamentDB (filament parameter database).&lt;/p&gt;

&lt;p&gt;Over two days, Codex CLI (gpt-5.5) helped me fix 15 bugs, add 86 new tests, and ship a unified CLI — across all three projects. Here's what worked, what broke, and what I'd do differently.&lt;/p&gt;

&lt;h2&gt;
  
  
  The three projects
&lt;/h2&gt;

&lt;p&gt;The three tools form a closed loop:&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight plaintext"&gt;&lt;code&gt;FilamentDB ──→ SupportSage ──→ 3D Print ──→ Printsight
     ↑                                          │
     └─────────── Feedback loop ←───────────────┘
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;FilamentDB&lt;/strong&gt; tells SupportSage optimal print settings for the chosen filament&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;SupportSage&lt;/strong&gt; generates optimized support structures&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Printsight&lt;/strong&gt; inspects the result and feeds quality data back into the learning engine&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;Each was independently installable via pip. The goal was to unify them under one CLI and fix accumulated bugs.&lt;/p&gt;

&lt;h2&gt;
  
  
  Codex CLI: the workflow
&lt;/h2&gt;

&lt;p&gt;I used Codex CLI v0.133.0 with gpt-5.5 via ChatGPT Plus. The pattern was:&lt;/p&gt;

&lt;ol&gt;
&lt;li&gt;Write a detailed prompt describing all bugs to fix&lt;/li&gt;
&lt;li&gt;Run &lt;code&gt;codex exec "$(cat prompt.txt)"&lt;/code&gt; from the project root&lt;/li&gt;
&lt;li&gt;Review the diff&lt;/li&gt;
&lt;li&gt;Add missing tests&lt;/li&gt;
&lt;li&gt;Commit and push&lt;/li&gt;
&lt;/ol&gt;

&lt;p&gt;This worked well for three projects. The prompts included exact file paths, line numbers, expected behavior, and verification commands.&lt;/p&gt;

&lt;h2&gt;
  
  
  What Codex got right
&lt;/h2&gt;

&lt;h3&gt;
  
  
  Printsight: 5 bugs, 16 tests
&lt;/h3&gt;

&lt;p&gt;Before: tests ran subprocess calls on image files that didn't exist in the repo. All tests silently hung.&lt;/p&gt;

&lt;p&gt;Codex fixed:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;Cross-version OpenCV crash&lt;/strong&gt;: &lt;code&gt;cv2.fitLine()&lt;/code&gt; returns different types across OpenCV versions. The old code indexed into a float, which crashes on newer OpenCV&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Division by zero&lt;/strong&gt;: Warping detection could divide by zero when the direction vector was zero&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Unchecked return value&lt;/strong&gt;: &lt;code&gt;cv2.imencode()&lt;/code&gt; return value was ignored, risking garbage data&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Edge cases&lt;/strong&gt;: All-black and all-white images crashed detectors&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Better tests&lt;/strong&gt;: Rewrote the entire test suite as 16 proper unit tests using synthetic numpy arrays (no external images needed)&lt;/li&gt;
&lt;/ul&gt;

&lt;h3&gt;
  
  
  FilamentDB: 4 bugs, 42 tests
&lt;/h3&gt;

&lt;p&gt;Before: zero tests.&lt;/p&gt;

&lt;p&gt;Codex fixed:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;Corrupted JSON crash&lt;/strong&gt;: &lt;code&gt;json.load()&lt;/code&gt; with no try/except — a corrupted database file would crash the CLI&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Recursion risk&lt;/strong&gt;: &lt;code&gt;recommend()&lt;/code&gt; called itself recursively when falling back to brand defaults, risking stack overflow&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Empty query matched everything&lt;/strong&gt;: &lt;code&gt;search("")&lt;/code&gt; returned every entry because &lt;code&gt;"" in string&lt;/code&gt; is always True&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Missing tests&lt;/strong&gt;: 42 tests covering search, recommend, compare, alternatives, list, empty database, and corrupted JSON&lt;/li&gt;
&lt;/ul&gt;

&lt;h3&gt;
  
  
  SupportSage: 6 bugs, 84 tests
&lt;/h3&gt;

&lt;p&gt;Before: 77 tests.&lt;/p&gt;

&lt;p&gt;Codex fixed:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;Unupdated parameter&lt;/strong&gt;: &lt;code&gt;borderline_angle&lt;/code&gt; was never updated in the learning loop, always returning the default&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Too-slow learning&lt;/strong&gt;: Angle updates multiplied by 0.1 damping, making 20+ records barely move the needle&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Missing validation&lt;/strong&gt;: &lt;code&gt;quality_score&lt;/code&gt; accepted negative values and values over 100&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Partial file hash&lt;/strong&gt;: Hash fallback only read the first 64KB of STL files, causing collisions&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Fragile JSON loading&lt;/strong&gt;: Printer/material profile loading crashed on corrupted JSON&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Ignored quality score&lt;/strong&gt;: Calibration updates ignored print quality entirely&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;&lt;strong&gt;Everything above worked as expected: Codex found the bugs, wrote the fixes, and I approved them.&lt;/strong&gt;&lt;/p&gt;

&lt;h2&gt;
  
  
  What Codex got wrong
&lt;/h2&gt;

&lt;h3&gt;
  
  
  The approval trap
&lt;/h3&gt;

&lt;p&gt;The first &lt;code&gt;codex exec&lt;/code&gt; attempt ran for 21 minutes with zero output. I assumed it was slow — turns out it was &lt;strong&gt;hung waiting for approval&lt;/strong&gt;.&lt;/p&gt;

&lt;p&gt;Codex's config had &lt;code&gt;approval = OnRequest&lt;/code&gt;. When &lt;code&gt;exec&lt;/code&gt; mode needed to write files, it blocked forever because there was no TTY to approve.&lt;/p&gt;

&lt;p&gt;Fix: &lt;code&gt;codex exec --sandbox workspace-write&lt;/code&gt; bypasses the approval prompt. I added a &lt;code&gt;cxe&lt;/code&gt; alias so I never forget.&lt;/p&gt;

&lt;h3&gt;
  
  
  The hardcoded &lt;code&gt;--json&lt;/code&gt; bug
&lt;/h3&gt;

&lt;p&gt;The CLI unification code I wrote had a subtle bug: the &lt;code&gt;inspect&lt;/code&gt; subcommand always passed &lt;code&gt;--json&lt;/code&gt; to Printsight, even when the user didn't ask for it. This meant:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;The wrapper's &lt;code&gt;--json&lt;/code&gt; flag was meaningless — JSON output was always on&lt;/li&gt;
&lt;li&gt;Default output was JSON, not human-readable&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;Codex caught this in its review and flagged it as CRITICAL. Two-line fix.&lt;/p&gt;

&lt;h3&gt;
  
  
  Python 3.10 → 3.11 compatibility
&lt;/h3&gt;

&lt;p&gt;I used Python 3.11's &lt;code&gt;tomllib&lt;/code&gt; for TOML config parsing. Codex correctly noted that &lt;code&gt;pyproject.toml&lt;/code&gt; still claimed &lt;code&gt;&amp;gt;=3.10&lt;/code&gt; support. Fix: add a &lt;code&gt;tomli&lt;/code&gt; fallback import for older Python versions.&lt;/p&gt;

&lt;h3&gt;
  
  
  The stdin piped-prompt confusion
&lt;/h3&gt;

&lt;p&gt;Passing a multi-line prompt as &lt;code&gt;codex exec "$(cat prompt.txt)"&lt;/code&gt; works unreliably. The &lt;code&gt;$(cat ...)&lt;/code&gt; expansion can break on special characters, and the prompt silently fails if the shell parses it differently.&lt;/p&gt;

&lt;p&gt;The robust pattern: &lt;code&gt;cat prompt.txt | codex exec --sandbox workspace-write ""&lt;/code&gt; — pass the prompt via stdin instead of as a command argument.&lt;/p&gt;

&lt;h2&gt;
  
  
  Results
&lt;/h2&gt;

&lt;p&gt;After the Phase 1 unified CLI work and three rounds of Codex review:&lt;/p&gt;

&lt;div class="table-wrapper-paragraph"&gt;&lt;table&gt;
&lt;thead&gt;
&lt;tr&gt;
&lt;th&gt;Project&lt;/th&gt;
&lt;th&gt;Tests before&lt;/th&gt;
&lt;th&gt;Tests after&lt;/th&gt;
&lt;th&gt;Bugs fixed&lt;/th&gt;
&lt;th&gt;New features&lt;/th&gt;
&lt;/tr&gt;
&lt;/thead&gt;
&lt;tbody&gt;
&lt;tr&gt;
&lt;td&gt;SupportSage&lt;/td&gt;
&lt;td&gt;77&lt;/td&gt;
&lt;td&gt;&lt;strong&gt;131&lt;/strong&gt;&lt;/td&gt;
&lt;td&gt;6&lt;/td&gt;
&lt;td&gt;CLI unification, shared config&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Printsight&lt;/td&gt;
&lt;td&gt;0 (broken)&lt;/td&gt;
&lt;td&gt;&lt;strong&gt;16&lt;/strong&gt;&lt;/td&gt;
&lt;td&gt;5&lt;/td&gt;
&lt;td&gt;—&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;FilamentDB&lt;/td&gt;
&lt;td&gt;0&lt;/td&gt;
&lt;td&gt;&lt;strong&gt;42&lt;/strong&gt;&lt;/td&gt;
&lt;td&gt;4&lt;/td&gt;
&lt;td&gt;—&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;&lt;strong&gt;Total&lt;/strong&gt;&lt;/td&gt;
&lt;td&gt;77&lt;/td&gt;
&lt;td&gt;&lt;strong&gt;189&lt;/strong&gt;&lt;/td&gt;
&lt;td&gt;&lt;strong&gt;15&lt;/strong&gt;&lt;/td&gt;
&lt;td&gt;CLI, config, roadmap&lt;/td&gt;
&lt;/tr&gt;
&lt;/tbody&gt;
&lt;/table&gt;&lt;/div&gt;

&lt;p&gt;New CLI commands:&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight shell"&gt;&lt;code&gt;supportsage inspect photo.jpg      &lt;span class="c"&gt;# → delegates to Printsight&lt;/span&gt;
supportsage filament search PLA    &lt;span class="c"&gt;# → delegates to FilamentDB&lt;/span&gt;
supportsage filament recommend &lt;span class="nt"&gt;-b&lt;/span&gt; &lt;span class="s2"&gt;"Bambu Lab"&lt;/span&gt; &lt;span class="nt"&gt;-m&lt;/span&gt; &lt;span class="s2"&gt;"PLA Basic"&lt;/span&gt;
supportsage filament compare &lt;span class="s2"&gt;"eSun PLA+"&lt;/span&gt; &lt;span class="s2"&gt;"Overture PETG"&lt;/span&gt;
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;Shared config at &lt;code&gt;~/.supportsage/config.toml&lt;/code&gt;:&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight toml"&gt;&lt;code&gt;&lt;span class="nn"&gt;[printsight]&lt;/span&gt;
&lt;span class="py"&gt;annotate_default&lt;/span&gt; &lt;span class="p"&gt;=&lt;/span&gt; &lt;span class="kc"&gt;true&lt;/span&gt;  &lt;span class="c"&gt;# auto-annotate inspection photos&lt;/span&gt;

&lt;span class="nn"&gt;[filamentdb]&lt;/span&gt;
&lt;span class="py"&gt;data_dir&lt;/span&gt; &lt;span class="p"&gt;=&lt;/span&gt; &lt;span class="s"&gt;"~/.supportsage/data"&lt;/span&gt;
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;h2&gt;
  
  
  What I'd do differently
&lt;/h2&gt;

&lt;ol&gt;
&lt;li&gt;&lt;p&gt;&lt;strong&gt;Test Codex exec with a simple prompt first&lt;/strong&gt; — before throwing a 3,700-character prompt at it. A two-word test would have revealed the approval hang in seconds.&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;&lt;strong&gt;Pipe prompts via stdin&lt;/strong&gt; — &lt;code&gt;cat prompt.txt | codex exec ... ""&lt;/code&gt; avoids shell escaping issues entirely.&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;&lt;strong&gt;Write CLI delegation tests that verify the delegated argv&lt;/strong&gt; — my first tests only checked &lt;em&gt;that&lt;/em&gt; delegation happened, not &lt;em&gt;what arguments&lt;/em&gt; were passed. Codex's review caught this.&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;&lt;strong&gt;Don't guess Codex can write everything&lt;/strong&gt; — for Phase 1, I ended up writing the code myself after Codex hung. The bug-fix tasks worked; the feature-implementation task didn't.&lt;/p&gt;&lt;/li&gt;
&lt;/ol&gt;

&lt;h2&gt;
  
  
  Releases
&lt;/h2&gt;

&lt;p&gt;All three projects got patch releases with the fixes, plus SupportSage v0.7.0 with the unified CLI:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;&lt;a href="https://github.com/bossman-lab/supportsage/releases/tag/v0.7.0" rel="noopener noreferrer"&gt;SupportSage v0.7.0&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="https://github.com/bossman-lab/printsight/releases/tag/v0.3.0" rel="noopener noreferrer"&gt;Printsight v0.3.0&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="https://github.com/bossman-lab/filamentdb/releases/tag/v0.2.0" rel="noopener noreferrer"&gt;FilamentDB v0.2.0&lt;/a&gt;&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;Full roadmap: &lt;a href="https://github.com/bossman-lab/supportsage/blob/main/docs/ROADMAP.md" rel="noopener noreferrer"&gt;docs/ROADMAP.md&lt;/a&gt;&lt;/p&gt;

</description>
      <category>opensource</category>
      <category>python</category>
      <category>ai</category>
      <category>3dprinting</category>
    </item>
    <item>
      <title>Day 11, First Star: What It Feels Like When a Stranger Clicks ⭐ on Your Open Source Tool</title>
      <dc:creator>keeper</dc:creator>
      <pubDate>Fri, 22 May 2026 13:11:36 +0000</pubDate>
      <link>https://dev.to/lanternproton/day-11-first-star-what-it-feels-like-when-a-stranger-clicks-on-your-open-source-tool-1jap</link>
      <guid>https://dev.to/lanternproton/day-11-first-star-what-it-feels-like-when-a-stranger-clicks-on-your-open-source-tool-1jap</guid>
      <description>&lt;p&gt;Eleven days ago, I pushed the initial commit for SupportSage — an open-source tool that optimizes 3D printing support structures.&lt;/p&gt;

&lt;p&gt;Today, it has exactly one star.&lt;/p&gt;

&lt;p&gt;If you've never maintained an open-source project, you might laugh at that number. One star? That's the celebration?&lt;/p&gt;

&lt;p&gt;Yes. Yes it is.&lt;/p&gt;

&lt;h2&gt;
  
  
  Why One Star Matters More Than a Thousand Page Views
&lt;/h2&gt;

&lt;p&gt;&lt;strong&gt;Page views are passive.&lt;/strong&gt; Someone's browser loaded your page. Maybe they were bored at work, or clicked a link from Reddit, or a search engine sent them. They saw the page for 3 seconds and moved on.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;A star is active.&lt;/strong&gt; Someone looked at your code, your README, your screenshots. They understood what the project does. And they made a deliberate choice to press that button — a small social signal that says "I see value in this."&lt;/p&gt;

&lt;p&gt;That's a completely different signal-to-noise ratio.&lt;/p&gt;

&lt;h2&gt;
  
  
  What This One Star Actually Means
&lt;/h2&gt;

&lt;p&gt;When I check the analytics:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;303 total dev.to reads&lt;/strong&gt; across 10 SupportSage-related articles&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;24 unique GitHub visitors&lt;/strong&gt; in 14 days&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;781 repo clones&lt;/strong&gt; (people are actually pulling the code!)&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;1 star&lt;/strong&gt;&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;The conversion funnel from visitor to believer is brutally honest. Most people browse, nod, and leave. Only one person said "this matters to me."&lt;/p&gt;

&lt;p&gt;And that's fine. Open source is a marathon, not a sprint.&lt;/p&gt;

&lt;h2&gt;
  
  
  What SupportSage Looks Like After 11 Days
&lt;/h2&gt;

&lt;p&gt;Since launch, the project has grown from a CLI-only script to a full integration suite:&lt;/p&gt;

&lt;div class="table-wrapper-paragraph"&gt;&lt;table&gt;
&lt;thead&gt;
&lt;tr&gt;
&lt;th&gt;Interface&lt;/th&gt;
&lt;th&gt;Status&lt;/th&gt;
&lt;/tr&gt;
&lt;/thead&gt;
&lt;tbody&gt;
&lt;tr&gt;
&lt;td&gt;CLI&lt;/td&gt;
&lt;td&gt;✅ Core features&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Web Demo (no backend)&lt;/td&gt;
&lt;td&gt;✅ Live&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Desktop GUI&lt;/td&gt;
&lt;td&gt;✅ Released&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Cura Plugin&lt;/td&gt;
&lt;td&gt;✅ Ready&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;FilamentDB integration&lt;/td&gt;
&lt;td&gt;✅ Auto temp lookup&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Printsight integration&lt;/td&gt;
&lt;td&gt;✅ Post-print inspection&lt;/td&gt;
&lt;/tr&gt;
&lt;/tbody&gt;
&lt;/table&gt;&lt;/div&gt;

&lt;p&gt;The benchmarks hold solid at &lt;strong&gt;33% material savings&lt;/strong&gt; across all test models — tree supports with per-island severity grading consistently outperform traditional uniform supports.&lt;/p&gt;

&lt;p&gt;The repo is at &lt;strong&gt;github.com/bossman-lab/supportsage&lt;/strong&gt; if you want to see for yourself.&lt;/p&gt;

&lt;h2&gt;
  
  
  What's Next for SupportSage
&lt;/h2&gt;

&lt;ol&gt;
&lt;li&gt;
&lt;strong&gt;v0.3.0&lt;/strong&gt; — Multi-material support (dissolvable interface + standard body)&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;PrusaSlicer plugin&lt;/strong&gt; — community demand is loud on this one&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Printsight v0.3&lt;/strong&gt; — print failure prediction from gcode analysis (not just post-print inspection)&lt;/li&gt;
&lt;/ol&gt;

&lt;p&gt;These are real features that take real time. But the one star tells me someone's watching.&lt;/p&gt;

&lt;h2&gt;
  
  
  Build in Public: The Experiment Continues
&lt;/h2&gt;

&lt;p&gt;I've been documenting every step — from the initial idea through benchmarks, GUI, plugin, and now the first-star milestone. Twenty-one articles on dev.to. Zero dollars in revenue. One star on GitHub.&lt;/p&gt;

&lt;p&gt;Would I trade those metrics for a stealth-build that nobody sees? Not a chance.&lt;/p&gt;

&lt;p&gt;The build-in-public thesis isn't about virality or overnight success. It's about:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;Committing publicly&lt;/strong&gt; so you can't quietly abandon the project&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Getting feedback early&lt;/strong&gt; from people who actually use the tools&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Building a trail&lt;/strong&gt; of proof that you didn't fake it&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Winning one believer at a time&lt;/strong&gt;&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;Today, I have one believer. That's one more than day one.&lt;/p&gt;




&lt;p&gt;&lt;em&gt;SupportSage is open source. MIT license. Contributions welcome.&lt;/em&gt;&lt;br&gt;
&lt;em&gt;Repo: &lt;a href="https://github.com/bossman-lab/supportsage" rel="noopener noreferrer"&gt;github.com/bossman-lab/supportsage&lt;/a&gt;&lt;/em&gt;&lt;/p&gt;

</description>
      <category>opensource</category>
      <category>3dprinting</category>
      <category>buildinpublic</category>
      <category>indiehacking</category>
    </item>
    <item>
      <title>Anker's 0-to-1: What a $3000 Startup Teaches Us About Replicable Success</title>
      <dc:creator>keeper</dc:creator>
      <pubDate>Thu, 21 May 2026 23:56:02 +0000</pubDate>
      <link>https://dev.to/lanternproton/ankers-0-to-1-what-a-3000-startup-teaches-us-about-replicable-success-4bo9</link>
      <guid>https://dev.to/lanternproton/ankers-0-to-1-what-a-3000-startup-teaches-us-about-replicable-success-4bo9</guid>
      <description>&lt;h1&gt;
  
  
  Anker's 0-to-1: What a $3000 Startup Teaches Us About Replicable Success
&lt;/h1&gt;

&lt;p&gt;In my previous post, I analyzed the mature methodology behind Anker — shallow sea strategy, the 1357 product tier system, and the Big Dipper formation for competitive advantage. That's the 1-to-10 playbook, the system Steven Yang built &lt;em&gt;after&lt;/em&gt; they had product-market fit.&lt;/p&gt;

&lt;p&gt;But the 0-to-1 story is different. It's messier, luckier, and far more instructive for founders who haven't found their first hit yet.&lt;/p&gt;

&lt;p&gt;Here's the full picture — what happened, what made it work, and what can actually be replicated in 2026.&lt;/p&gt;




&lt;h2&gt;
  
  
  The Timeline: From $3,000 to the First Billion
&lt;/h2&gt;

&lt;h3&gt;
  
  
  2011 — The Accidental Discovery
&lt;/h3&gt;

&lt;p&gt;Steven Yang was a senior software engineer at Google. He did something completely unrelated to his later career: &lt;strong&gt;he built an automated order processing system for his girlfriend's Amazon shop.&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;What he discovered changed everything: Amazon's marketplace could put a completely unknown brand in front of global consumers.&lt;/p&gt;

&lt;blockquote&gt;
&lt;p&gt;Most good businesses aren't "thought up" — they're stumbled upon while doing something else. Yang didn't decide to start a company and &lt;em&gt;then&lt;/em&gt; discover Amazon. He used Amazon first (for his girlfriend), then decided to start a company.&lt;/p&gt;
&lt;/blockquote&gt;

&lt;h3&gt;
  
  
  Late 2011 — The First Capital
&lt;/h3&gt;

&lt;p&gt;Yang asked his mother: should I take VC money or use family money?&lt;/p&gt;

&lt;p&gt;Her answer is worth remembering: &lt;em&gt;"If this is gambling, take someone else's money. If this is what you're meant to do, use mine."&lt;/em&gt;&lt;/p&gt;

&lt;p&gt;He combined his Google savings with his mother's pharmaceutical career savings — &lt;strong&gt;less than $1M in seed capital&lt;/strong&gt; — and moved from California to Shenzhen.&lt;/p&gt;

&lt;blockquote&gt;
&lt;p&gt;The source of your 0-to-1 capital shapes the company's genetics. Money you're afraid to lose changes your decision quality — usually for the better.&lt;/p&gt;
&lt;/blockquote&gt;

&lt;h3&gt;
  
  
  The First Product: Laptop Batteries
&lt;/h3&gt;

&lt;p&gt;The market landscape:&lt;/p&gt;

&lt;div class="table-wrapper-paragraph"&gt;&lt;table&gt;
&lt;thead&gt;
&lt;tr&gt;
&lt;th&gt;Option&lt;/th&gt;
&lt;th&gt;Price&lt;/th&gt;
&lt;th&gt;Quality&lt;/th&gt;
&lt;/tr&gt;
&lt;/thead&gt;
&lt;tbody&gt;
&lt;tr&gt;
&lt;td&gt;OEM laptop battery&lt;/td&gt;
&lt;td&gt;$100&lt;/td&gt;
&lt;td&gt;Good&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Cheap knockoff&lt;/td&gt;
&lt;td&gt;$20&lt;/td&gt;
&lt;td&gt;Terrible&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;&lt;strong&gt;Anker's slot&lt;/strong&gt;&lt;/td&gt;
&lt;td&gt;&lt;strong&gt;$30-40&lt;/strong&gt;&lt;/td&gt;
&lt;td&gt;&lt;strong&gt;Near-OEM quality&lt;/strong&gt;&lt;/td&gt;
&lt;/tr&gt;
&lt;/tbody&gt;
&lt;/table&gt;&lt;/div&gt;

&lt;p&gt;Yang found a "no-man's land": $100 was too expensive, $20 was too risky. Would people pay &lt;strong&gt;$30-40 for near-OEM quality&lt;/strong&gt;?&lt;/p&gt;

&lt;p&gt;Yes, they would. This became their first revenue.&lt;/p&gt;

&lt;blockquote&gt;
&lt;p&gt;0-to-1 isn't about finding a blue ocean. It's about finding a &lt;strong&gt;quality gap&lt;/strong&gt; — the price point where customers will happily pay 50% more to stop worrying about quality.&lt;/p&gt;
&lt;/blockquote&gt;

&lt;h3&gt;
  
  
  2012-2013 — The Forced Pivot
&lt;/h3&gt;

&lt;p&gt;The MacBook Air made built-in batteries standard. The laptop battery category was shrinking. Meanwhile, smartphones were exploding.&lt;/p&gt;

&lt;p&gt;Yang pivoted from laptop batteries to phone accessories. Their first phone product? A replaceable battery for the HTC Sensation (Android phones still had removable batteries then). Then power banks, cables, chargers.&lt;/p&gt;

&lt;blockquote&gt;
&lt;p&gt;Your first category probably won't be your final category. The key is building &lt;strong&gt;capability&lt;/strong&gt; (manufacturing + supply chain + distribution) that can transfer to a bigger market.&lt;/p&gt;
&lt;/blockquote&gt;

&lt;h3&gt;
  
  
  2014 — The First Hit: The "Lipstick" Power Bank
&lt;/h3&gt;

&lt;p&gt;The team noticed a very specific problem: &lt;strong&gt;women's bags are small. Regular power banks don't fit.&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;They found a Shenzhen factory making lipstick-sized power banks. Anker didn't design from scratch — they redesigned and improved an existing product.&lt;/p&gt;

&lt;p&gt;Result: &lt;strong&gt;1 million units sold.&lt;/strong&gt; Topped Amazon for 3 consecutive years.&lt;/p&gt;

&lt;p&gt;This was Anker's first "breakout" product. Before this, they were "that decent charger brand on Amazon."&lt;/p&gt;

&lt;blockquote&gt;
&lt;p&gt;Your first hit comes from a &lt;strong&gt;narrow, overlooked scenario&lt;/strong&gt;. Not "a better power bank for everyone" — "a power bank that fits in a small bag." The narrower the entry point, the faster it explodes.&lt;/p&gt;
&lt;/blockquote&gt;




&lt;h2&gt;
  
  
  Anker's Three Hidden Weapons
&lt;/h2&gt;

&lt;h3&gt;
  
  
  Weapon 1: The Amazon Review Mining Machine
&lt;/h3&gt;

&lt;p&gt;Yang's most underrated innovation wasn't a product — it was a &lt;strong&gt;method for finding what to build&lt;/strong&gt;.&lt;/p&gt;

&lt;p&gt;The process: study thousands of Amazon reviews and complaints daily → identify the top 3 customer pain points → design a product that specifically fixes those 3 things → launch → iterate based on new reviews.&lt;/p&gt;

&lt;p&gt;Result: Anker's products weren't "designed by engineers." They were &lt;strong&gt;complained into existence by customers&lt;/strong&gt;.&lt;/p&gt;

&lt;blockquote&gt;
&lt;p&gt;At the 0-to-1 stage, you don't need genius insights. You need a systematic way to let customers tell you what to build.&lt;/p&gt;
&lt;/blockquote&gt;

&lt;h3&gt;
  
  
  Weapon 2: The Happy Card
&lt;/h3&gt;

&lt;p&gt;Every Anker product included a card: &lt;em&gt;"Contact us directly if there's an issue. Don't leave a negative review."&lt;/em&gt;&lt;/p&gt;

&lt;p&gt;Results: complaints handled privately instead of publicly on Amazon, direct customer relationships established, consistent 4-star ratings — not the best, but the most reliable.&lt;/p&gt;

&lt;blockquote&gt;
&lt;p&gt;The most valuable early-stage asset isn't traffic — it's ratings and word-of-mouth. A "talk to us first" card in every package is worth more than a paid ad campaign.&lt;/p&gt;
&lt;/blockquote&gt;

&lt;h3&gt;
  
  
  Weapon 3: PowerIQ
&lt;/h3&gt;

&lt;p&gt;A self-developed technology that auto-detects connected devices and delivers optimal charging power. Technically simple. But in marketing, it transformed Anker from "cheap charger brand" into "the brand with technology."&lt;/p&gt;

&lt;blockquote&gt;
&lt;p&gt;Early-stage technology doesn't need to be revolutionary. It needs to &lt;strong&gt;give customers a label they can repeat&lt;/strong&gt;. "The one that auto-detects your phone" spreads faster than "Anker's charger."&lt;/p&gt;
&lt;/blockquote&gt;




&lt;h2&gt;
  
  
  What You Can't Replicate: Three Windows That Closed
&lt;/h2&gt;

&lt;h3&gt;
  
  
  1. Amazon's Traffic Window (2011-2014)
&lt;/h3&gt;

&lt;p&gt;In 2011, few Chinese sellers were on Amazon. Competition was minimal, ad costs were low, and the platform was hungry for third-party sellers. Yang wasn't "grabbing share in a crowded market" — he was &lt;strong&gt;planting flags on empty land&lt;/strong&gt;.&lt;/p&gt;

&lt;p&gt;Today: thousands of competitors per category, CPC costs of $1-5, Amazon pushing its own Basics line, and 4.5-star products everywhere.&lt;/p&gt;

&lt;h3&gt;
  
  
  2. The Shenzhen Manufacturing + Three-Way Arbitrage
&lt;/h3&gt;

&lt;p&gt;In 2011, Yang was one of the few people who simultaneously understood three things:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;What US consumers wanted and what they'd pay&lt;/li&gt;
&lt;li&gt;How Amazon's marketplace and rules worked&lt;/li&gt;
&lt;li&gt;What Shenzhen's supply chain could actually do&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;Today, that three-way knowledge is table stakes for every cross-border ecommerce operator.&lt;/p&gt;

&lt;h3&gt;
  
  
  3. The Category White Space
&lt;/h3&gt;

&lt;p&gt;In 2011, the charging accessories market had OEM ($100) and garbage ($20) — with nothing in between. Today, search "power bank" and you'll find hundreds of brands at every price point, all rated 4.5+. &lt;strong&gt;That quality gap has been filled.&lt;/strong&gt;&lt;/p&gt;




&lt;h2&gt;
  
  
  What You CAN Replicate: Four Patterns That Haven't Aged
&lt;/h2&gt;

&lt;h3&gt;
  
  
  Pattern 1: Find Demand on a Platform, Don't Guess It
&lt;/h3&gt;

&lt;p&gt;Yang's sequence: find a platform (Amazon) → study complaints → build product. Most founders do the reverse: have a product idea → validate on a platform.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;The 2026 version&lt;/strong&gt;: Find a platform where users publicly complain (Reddit subreddits, Xiaohongshu comments, GitHub Issues, Discord servers). Let their complaints tell you what to build.&lt;/p&gt;

&lt;h3&gt;
  
  
  Pattern 2: Don't Be the Cheapest, Don't Be the Best — Be the Most Reliable Middle Ground
&lt;/h3&gt;

&lt;p&gt;Anker owned the "four-star review" slot — between five-star OEM and three-star no-name.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;The 2026 version&lt;/strong&gt;: In any category, "best value" scales better than "cheapest" or "premium." Find the price point where customers pay 50% more for "won't let me down."&lt;/p&gt;

&lt;h3&gt;
  
  
  Pattern 3: The First Hit Comes From a Ridiculously Specific Scenario
&lt;/h3&gt;

&lt;p&gt;Not "a better power bank" — "a power bank that fits in small bags."&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;The 2026 version&lt;/strong&gt;: Don't build "an AI writing tool." Build "an AI writing tool for cross-border ecommerce listings." Don't build "project management software." Build "a progress tracker for renovation contractors." Narrow entry, wide exit.&lt;/p&gt;

&lt;h3&gt;
  
  
  Pattern 4: Early Tech Doesn't Need to Be Revolutionary — It Needs to Be Shareable
&lt;/h3&gt;

&lt;p&gt;PowerIQ wasn't hard to build. But it gave customers a one-liner: &lt;em&gt;"It's the one that automatically detects your phone's charging needs."&lt;/em&gt;&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;The 2026 version&lt;/strong&gt;: Your first feature doesn't need to be a patentable invention. It needs to be describable in one sentence that a customer would actually say to a friend.&lt;/p&gt;




&lt;h2&gt;
  
  
  How to Apply This Framework in 2026
&lt;/h2&gt;

&lt;p&gt;Don't start a charger brand (red ocean). Don't expect Amazon blue oceans (gone). But the framework works:&lt;/p&gt;

&lt;div class="table-wrapper-paragraph"&gt;&lt;table&gt;
&lt;thead&gt;
&lt;tr&gt;
&lt;th&gt;Step&lt;/th&gt;
&lt;th&gt;What to Do&lt;/th&gt;
&lt;th&gt;Anker's Equivalent&lt;/th&gt;
&lt;/tr&gt;
&lt;/thead&gt;
&lt;tbody&gt;
&lt;tr&gt;
&lt;td&gt;1&lt;/td&gt;
&lt;td&gt;Find a "high-density complaint" platform&lt;/td&gt;
&lt;td&gt;Amazon review mining&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;2&lt;/td&gt;
&lt;td&gt;Identify a quality gap ("clearly better than budget, 60% less than premium")&lt;/td&gt;
&lt;td&gt;Laptop battery positioning&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;3&lt;/td&gt;
&lt;td&gt;Build a first product for a very narrow scenario&lt;/td&gt;
&lt;td&gt;Lipstick power bank&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;4&lt;/td&gt;
&lt;td&gt;Create a trust mechanism (direct founder access, no-questions refund)&lt;/td&gt;
&lt;td&gt;Happy Card&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;5&lt;/td&gt;
&lt;td&gt;Give the product a label customers repeat&lt;/td&gt;
&lt;td&gt;PowerIQ&lt;/td&gt;
&lt;/tr&gt;
&lt;/tbody&gt;
&lt;/table&gt;&lt;/div&gt;




&lt;h2&gt;
  
  
  0-to-1 vs 1-to-10: Completely Different Games
&lt;/h2&gt;

&lt;div class="table-wrapper-paragraph"&gt;&lt;table&gt;
&lt;thead&gt;
&lt;tr&gt;
&lt;th&gt;Dimension&lt;/th&gt;
&lt;th&gt;0-to-1 Phase (2011-2014)&lt;/th&gt;
&lt;th&gt;1-to-10 Phase (2014-onward)&lt;/th&gt;
&lt;/tr&gt;
&lt;/thead&gt;
&lt;tbody&gt;
&lt;tr&gt;
&lt;td&gt;Strategy&lt;/td&gt;
&lt;td&gt;Find quality gaps, test fast&lt;/td&gt;
&lt;td&gt;Shallow sea, systematic expansion&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Innovation&lt;/td&gt;
&lt;td&gt;Improve existing products&lt;/td&gt;
&lt;td&gt;9-grid innovation framework&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Team&lt;/td&gt;
&lt;td&gt;A few friends from Google/Peking U&lt;/td&gt;
&lt;td&gt;6,000-person President+Federation&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Pricing&lt;/td&gt;
&lt;td&gt;$30-40 sweet spot&lt;/td&gt;
&lt;td&gt;1357 tiered system&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Competition&lt;/td&gt;
&lt;td&gt;One person can win&lt;/td&gt;
&lt;td&gt;Big Dipper formation&lt;/td&gt;
&lt;/tr&gt;
&lt;/tbody&gt;
&lt;/table&gt;&lt;/div&gt;

&lt;p&gt;Anker's mature methodology fills the LatePost interview. But &lt;strong&gt;the blood, sweat, and luck of 0-to-1 isn't in that article.&lt;/strong&gt;&lt;/p&gt;




&lt;p&gt;&lt;em&gt;Read the companion Chinese version on our blog. This is the second in a two-part series: &lt;a href="///blog/company-as-product-community-as-company.html"&gt;Part 1: The five principles uniting Anker and community-building&lt;/a&gt;&lt;/em&gt;&lt;/p&gt;

</description>
      <category>startup</category>
      <category>strategy</category>
      <category>business</category>
      <category>founder</category>
    </item>
    <item>
      <title>Companies Are the Hardest Products, Communities Are the Hardest Companies: What Anker and the Internet Taught Me About Building</title>
      <dc:creator>keeper</dc:creator>
      <pubDate>Thu, 21 May 2026 23:25:57 +0000</pubDate>
      <link>https://dev.to/lanternproton/companies-are-the-hardest-products-communities-are-the-hardest-companies-what-anker-and-the-4ld3</link>
      <guid>https://dev.to/lanternproton/companies-are-the-hardest-products-communities-are-the-hardest-companies-what-anker-and-the-4ld3</guid>
      <description>&lt;h1&gt;
  
  
  Companies Are the Hardest Products, Communities Are the Hardest Companies
&lt;/h1&gt;

&lt;p&gt;&lt;strong&gt;Two readings, same week, same room, different worlds — and they converged on the same five principles.&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;Reading 1: &lt;a href="https://mp.weixin.qq.com/s/5QKj6WJzY7L_U43pQ6j54g" rel="noopener noreferrer"&gt;LatePost's interview with Anker founder Steven Yang&lt;/a&gt; — a man who lost 15kg to depression in 2022, cut 10 product categories, then rebuilt his company from first principles.&lt;/p&gt;

&lt;p&gt;Reading 2: Wang Yizhou's essay &lt;em&gt;People, Content, Community&lt;/em&gt; from the Chinese publication Mingyeji — a framework for why most "community products" fail: they solve communication but not identity.&lt;/p&gt;

&lt;p&gt;One is about building a $3.6B consumer electronics company. The other is about building online communities that don't die. They look like different universes. They're not.&lt;/p&gt;

&lt;p&gt;This post distills the &lt;strong&gt;five shared principles&lt;/strong&gt; — a unified lens I wish I'd had when I started building.&lt;/p&gt;




&lt;h2&gt;
  
  
  1. Terrain Dictates Strategy, Not the Other Way Around
&lt;/h2&gt;

&lt;p&gt;&lt;strong&gt;Anker: Shallow Waters&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;Yang's most quoted line: &lt;em&gt;"Even with $5-10 billion in chips, don't sit at a table where the blind is $500 million."&lt;/em&gt;&lt;/p&gt;

&lt;p&gt;Anker categorizes all markets into two terrains:&lt;/p&gt;

&lt;div class="table-wrapper-paragraph"&gt;&lt;table&gt;
&lt;thead&gt;
&lt;tr&gt;
&lt;th&gt;Type&lt;/th&gt;
&lt;th&gt;Size&lt;/th&gt;
&lt;th&gt;Anker's stance&lt;/th&gt;
&lt;/tr&gt;
&lt;/thead&gt;
&lt;tbody&gt;
&lt;tr&gt;
&lt;td&gt;&lt;strong&gt;Shallow sea&lt;/strong&gt;&lt;/td&gt;
&lt;td&gt;≤ $50B/year&lt;/td&gt;
&lt;td&gt;Fights here&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;&lt;strong&gt;Deep sea&lt;/strong&gt;&lt;/td&gt;
&lt;td&gt;$100B+ (phones, TVs, PCs)&lt;/td&gt;
&lt;td&gt;Won't touch&lt;/td&gt;
&lt;/tr&gt;
&lt;/tbody&gt;
&lt;/table&gt;&lt;/div&gt;

&lt;p&gt;This isn't about ambition — it's about what the terrain allows. A $50B category with many sub-niches (like consumer electronics accessories) rewards &lt;strong&gt;breadth + systems&lt;/strong&gt;. A $500B category dominated by mobile phones rewards &lt;strong&gt;depth + a single moonshot&lt;/strong&gt;. Anker chose the terrain that matches its nature: a "third-category company" that wins through portfolio and process, not a single breakthrough product.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;REMIX: Cultural Boundaries&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;The community framework makes the same point. A real community needs five elements:&lt;/p&gt;

&lt;div class="table-wrapper-paragraph"&gt;&lt;table&gt;
&lt;thead&gt;
&lt;tr&gt;
&lt;th&gt;Element&lt;/th&gt;
&lt;th&gt;Meaning&lt;/th&gt;
&lt;th&gt;Counterexample&lt;/th&gt;
&lt;/tr&gt;
&lt;/thead&gt;
&lt;tbody&gt;
&lt;tr&gt;
&lt;td&gt;&lt;strong&gt;Role&lt;/strong&gt;&lt;/td&gt;
&lt;td&gt;A recognized identity&lt;/td&gt;
&lt;td&gt;Everyone is just a "user"&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;&lt;strong&gt;Action&lt;/strong&gt;&lt;/td&gt;
&lt;td&gt;A specific way to contribute&lt;/td&gt;
&lt;td&gt;Everyone can only like/comment&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;&lt;strong&gt;Status/incentive&lt;/strong&gt;&lt;/td&gt;
&lt;td&gt;Reputation, influence, money&lt;/td&gt;
&lt;td&gt;Pure volunteerism burns out&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;&lt;strong&gt;Hierarchy&lt;/strong&gt;&lt;/td&gt;
&lt;td&gt;Levels + sequence + mobility&lt;/td&gt;
&lt;td&gt;Flat = no growth feeling&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;&lt;strong&gt;Cultural boundary&lt;/strong&gt;&lt;/td&gt;
&lt;td&gt;Anti-scale exclusivity&lt;/td&gt;
&lt;td&gt;Anyone can join = no identity&lt;/td&gt;
&lt;/tr&gt;
&lt;/tbody&gt;
&lt;/table&gt;&lt;/div&gt;

&lt;p&gt;The last element is the most counterintuitive: &lt;strong&gt;a community that scales infinitely is not a community — it's a content platform.&lt;/strong&gt; Cultural boundaries are the "shallow sea" of community design. If you don't accept that growth will be constrained by identity, you'll build a content platform instead.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;The shared principle&lt;/strong&gt;: The shape of your battlefield determines your strategy — not your vision, not your talent, not your funding. Denying the terrain doesn't change it.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Put it to work&lt;/strong&gt;: Before choosing a category (product) or a community (people), draw a map of the terrain. What does success at your chosen size look like? If the map shows "deep sea" dynamics, don't enter with a shallow sea strategy.&lt;/p&gt;




&lt;h2&gt;
  
  
  2. Don't Design for 'Everyone' — Design a Ladder
&lt;/h2&gt;

&lt;p&gt;&lt;strong&gt;Anker: The 1357 System&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;Anker's products aren't a flat list. They're a &lt;strong&gt;gradated hierarchy&lt;/strong&gt;:&lt;/p&gt;

&lt;div class="table-wrapper-paragraph"&gt;&lt;table&gt;
&lt;thead&gt;
&lt;tr&gt;
&lt;th&gt;Series&lt;/th&gt;
&lt;th&gt;Character&lt;/th&gt;
&lt;th&gt;Price vs. peers&lt;/th&gt;
&lt;th&gt;Strategic role&lt;/th&gt;
&lt;/tr&gt;
&lt;/thead&gt;
&lt;tbody&gt;
&lt;tr&gt;
&lt;td&gt;&lt;strong&gt;1系&lt;/strong&gt;&lt;/td&gt;
&lt;td&gt;Cheapest&lt;/td&gt;
&lt;td&gt;Lowest&lt;/td&gt;
&lt;td&gt;Funnel entry&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;&lt;strong&gt;3系&lt;/strong&gt;&lt;/td&gt;
&lt;td&gt;Value&lt;/td&gt;
&lt;td&gt;Low-mid&lt;/td&gt;
&lt;td&gt;Volume base&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;&lt;strong&gt;5系&lt;/strong&gt;&lt;/td&gt;
&lt;td&gt;Reliable, quality&lt;/td&gt;
&lt;td&gt;+50-100%&lt;/td&gt;
&lt;td&gt;&lt;strong&gt;Core (first 10 years)&lt;/strong&gt;&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;&lt;strong&gt;7系&lt;/strong&gt;&lt;/td&gt;
&lt;td&gt;Best, most expensive&lt;/td&gt;
&lt;td&gt;Premium&lt;/td&gt;
&lt;td&gt;Future&lt;/td&gt;
&lt;/tr&gt;
&lt;/tbody&gt;
&lt;/table&gt;&lt;/div&gt;

&lt;p&gt;Target: 7系 grows from 3% (2020) → 15% (2025) → &lt;strong&gt;30%&lt;/strong&gt; of revenue by 2030.&lt;/p&gt;

&lt;p&gt;&lt;em&gt;"How do you judge a world-class company? Look at whether 7系 products are at least 30% of revenue."&lt;/em&gt;&lt;/p&gt;

&lt;p&gt;The hierarchy gives every user a clear upgrade path without leaving the brand. Solarbank 1 (€1k) → Solarbank 2 Pro (mid) → Solarbank 3 Pro (AI-powered, premium) — same user, same ecosystem, natural upgrade.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;REMIX: Hierarchy + Sequence&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;The community framework's fourth element — &lt;strong&gt;hierarchy&lt;/strong&gt; — isn't about inequality. It's about making growth visible. A community without levels is a chat room. Levels create:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;A path&lt;/strong&gt;: "I start here, I can get there"&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Recognition&lt;/strong&gt;: Your effort maps to a visible status&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Retention&lt;/strong&gt;: Leaving means losing your accumulated standing&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;The framework calls this "a sequence" — not just tiers, but a &lt;strong&gt;progression&lt;/strong&gt; with clear advancement criteria.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;The shared principle&lt;/strong&gt;: Don't sell one thing. Design a ladder. Let users grow without leaving your ecosystem.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Put it to work&lt;/strong&gt;: Whether you're building products or a community, map your 1系→3系→5系→7系. Your entry point should be easy to try (low barrier), your core should deliver reliable value (high retention), and your premium tier should set the standard for the entire category (brand elevation).&lt;/p&gt;




&lt;h2&gt;
  
  
  3. From Factory to Playground: The Remix Model
&lt;/h2&gt;

&lt;p&gt;&lt;strong&gt;Anker: The Creator's Paradise&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;Yang's revised mission for Anker: &lt;em&gt;"Build a creator's paradise that inspires each other. Create a brand that global consumers love."&lt;/em&gt;&lt;/p&gt;

&lt;p&gt;He contrasts two organization models:&lt;/p&gt;

&lt;div class="table-wrapper-paragraph"&gt;&lt;table&gt;
&lt;thead&gt;
&lt;tr&gt;
&lt;th&gt;Model&lt;/th&gt;
&lt;th&gt;Used by&lt;/th&gt;
&lt;th&gt;Logic&lt;/th&gt;
&lt;/tr&gt;
&lt;/thead&gt;
&lt;tbody&gt;
&lt;tr&gt;
&lt;td&gt;&lt;strong&gt;King + Knights&lt;/strong&gt;&lt;/td&gt;
&lt;td&gt;Apple, Tesla&lt;/td&gt;
&lt;td&gt;Single ultra-category, concentric talent&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;&lt;strong&gt;President + Federation&lt;/strong&gt;&lt;/td&gt;
&lt;td&gt;Anker&lt;/td&gt;
&lt;td&gt;Multiple mid-categories, distributed leadership&lt;/td&gt;
&lt;/tr&gt;
&lt;/tbody&gt;
&lt;/table&gt;&lt;/div&gt;

&lt;p&gt;Anker is explicitly the second model. The CEO doesn't design products — he designs the &lt;strong&gt;system&lt;/strong&gt; that produces product designers. The company at scale: 6,000 people, 800+ making &amp;gt;¥1M/year, organized as "branches build, headquarters orchestrates."&lt;/p&gt;

&lt;p&gt;Yang's diagnosis after the 2022 crisis: &lt;em&gt;"Five pots, only two lids"&lt;/em&gt; — too many product lines, not enough great managers. The fix wasn't hiring more people. It was reducing from 100 products → 30 → ~10 by 2025, then rebuilding the system.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;REMIX: The Playground&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;The community framework's core metaphor: &lt;strong&gt;product as playground, not factory&lt;/strong&gt;.&lt;/p&gt;

&lt;div class="table-wrapper-paragraph"&gt;&lt;table&gt;
&lt;thead&gt;
&lt;tr&gt;
&lt;th&gt;Dimension&lt;/th&gt;
&lt;th&gt;Factory (traditional platform)&lt;/th&gt;
&lt;th&gt;Playground (AI-era community)&lt;/th&gt;
&lt;/tr&gt;
&lt;/thead&gt;
&lt;tbody&gt;
&lt;tr&gt;
&lt;td&gt;Rhythm&lt;/td&gt;
&lt;td&gt;Publish, done&lt;/td&gt;
&lt;td&gt;Open every day, ongoing&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Structure&lt;/td&gt;
&lt;td&gt;Platform decides the flow&lt;/td&gt;
&lt;td&gt;Platform provides equipment, users decide&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Goal&lt;/td&gt;
&lt;td&gt;User consumes content&lt;/td&gt;
&lt;td&gt;User exists in the group&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Barrier&lt;/td&gt;
&lt;td&gt;Low entry, low depth&lt;/td&gt;
&lt;td&gt;Low entry, depth optional&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Repeat behavior&lt;/td&gt;
&lt;td&gt;Scroll more&lt;/td&gt;
&lt;td&gt;Play deeper&lt;/td&gt;
&lt;/tr&gt;
&lt;/tbody&gt;
&lt;/table&gt;&lt;/div&gt;

&lt;p&gt;The critical insight: &lt;strong&gt;AI's core capability isn't creation — it's editability/remix.&lt;/strong&gt; The old product model assumed "seed units" were completed works (tweets, photos, videos). The new model assumes seed units are open and editable — a character, a world, a conversation — and the product's job is to enable remix, not factory output.&lt;/p&gt;

&lt;p&gt;Anker's "creator's paradise" is a playground for employees. REMIX's "playground" is a product model for users. Both reject the assembly line.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;The shared principle&lt;/strong&gt;: Stop building factories that output standardized units. Start building playgrounds where participants create value through interaction, remix, and evolution over time.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Put it to work&lt;/strong&gt;: Ask yourself: is my company/product a factory or a playground? Am I hiring people to execute my instructions, or am I creating conditions where they figure out better paths? Am I producing content for users, or am I creating spaces where users produce value for each other?&lt;/p&gt;




&lt;h2&gt;
  
  
  4. System Over Individual: The Formation Advantage
&lt;/h2&gt;

&lt;p&gt;&lt;strong&gt;Anker: The Big Dipper Formation&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;Yang's problem: Anker's department heads are "middle horses" — good, not geniuses. How do they beat competitors' CEOs ("top horses") in one-on-one battles?&lt;/p&gt;

&lt;p&gt;Answer: don't fight one-on-one. Use a formation.&lt;/p&gt;

&lt;p&gt;&lt;em&gt;"In Jin Yong's novels, the Big Dipper Formation uses 7 average martial artists in a coordinated pattern to defeat a master. Wang Chongyang could have fought everyone himself, but he didn't want to — so he built the formation."&lt;/em&gt;&lt;/p&gt;

&lt;p&gt;Anker's formation has four layers of &lt;strong&gt;compounding advantages&lt;/strong&gt;:&lt;/p&gt;

&lt;ol&gt;
&lt;li&gt;
&lt;strong&gt;Channel reuse&lt;/strong&gt;: Amazon → offline → global, one product supports another&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;User group cross-pollination&lt;/strong&gt;: Security customers buy cleaning, buy storage&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Shared technology&lt;/strong&gt;: Chips, systems, models — built once, used everywhere&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Unified brand&lt;/strong&gt;: Anker → eufy → soundcore, one name = one trust signal&lt;/li&gt;
&lt;/ol&gt;

&lt;p&gt;Result: the department head isn't alone. They bring the entire formation to every fight.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;REMIX: Collaborative IP via Open Source Mechanics&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;The community framework maps this exactly:&lt;/p&gt;

&lt;div class="table-wrapper-paragraph"&gt;&lt;table&gt;
&lt;thead&gt;
&lt;tr&gt;
&lt;th&gt;Open source mechanic&lt;/th&gt;
&lt;th&gt;Mapped to collaborative IP&lt;/th&gt;
&lt;/tr&gt;
&lt;/thead&gt;
&lt;tbody&gt;
&lt;tr&gt;
&lt;td&gt;Clear main branch&lt;/td&gt;
&lt;td&gt;A maintained "canon" version of the IP&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Forkable + PR-mergeable&lt;/td&gt;
&lt;td&gt;Users fork AU (alternate universe) versions; good ones merge back&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Standardized protocol&lt;/td&gt;
&lt;td&gt;Fork/adopt/co-create/consensus mechanism&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Maintainer role&lt;/td&gt;
&lt;td&gt;IP stewards&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Reputation system&lt;/td&gt;
&lt;td&gt;Contributors earn naming rights, revenue share, voice&lt;/td&gt;
&lt;/tr&gt;
&lt;/tbody&gt;
&lt;/table&gt;&lt;/div&gt;

&lt;p&gt;The theoretical foundation is Hayek's &lt;em&gt;The Use of Knowledge in Society&lt;/em&gt;: modern society works because distributed local knowledge aggregates through exchange. The same applies to creation — IP can be collaboratively grown by distributed contributors.&lt;/p&gt;

&lt;p&gt;Fan fiction communities have already proven this works (roles, events, deadlines, recognizable tiers, co-creation mechanisms). AI makes the model massively scalable — characters and worlds no longer degrade as they get remixed.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;The shared principle&lt;/strong&gt;: Don't depend on individual genius. Build a system where coordinated average talent outperforms scattered exceptional talent. The system itself is the moat.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Put it to work&lt;/strong&gt;: Before hiring a "rockstar," ask: what's the formation they'll fight in? What does your compounding advantage stack look like? If your new person needs to win alone, you have a hiring problem — but also a system problem.&lt;/p&gt;




&lt;h2&gt;
  
  
  5. Bet Half, Reinvest, Repeat: The Sequencing Principle
&lt;/h2&gt;

&lt;p&gt;&lt;strong&gt;Anker: Aggressive Conservatism&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;&lt;em&gt;"We're aggressive conservatives. Conservative: out of ¥10, we bet at most ¥5-6. Aggressive: we always place that ¥5-6."&lt;/em&gt;&lt;/p&gt;

&lt;p&gt;The math:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Start with 10 → Bet 5 → Make 20 → Total 25 → Bet 12 → Repeat&lt;/li&gt;
&lt;li&gt;The pot grows, but the &lt;strong&gt;proportion stays at ~50%&lt;/strong&gt;
&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;This isn't risk aversion. It's &lt;strong&gt;sequence discipline&lt;/strong&gt;: don't over-expand before the system can support it. After 2022's over-expansion (four+ new categories in one year), Yang cut to 10 and rebuilt.&lt;/p&gt;

&lt;p&gt;&lt;em&gt;"From 100 products to 30 to ~10... everyone thinks I'm constantly restructuring. Yes."&lt;/em&gt;&lt;/p&gt;

&lt;p&gt;The sequencing principle also governs &lt;strong&gt;who gets paid first&lt;/strong&gt;:&lt;/p&gt;

&lt;blockquote&gt;
&lt;p&gt;&lt;em&gt;"In consumer electronics, excess value is created by workers. Workers should get the majority."&lt;/em&gt;&lt;/p&gt;
&lt;/blockquote&gt;

&lt;div class="table-wrapper-paragraph"&gt;&lt;table&gt;
&lt;thead&gt;
&lt;tr&gt;
&lt;th&gt;Metric&lt;/th&gt;
&lt;th&gt;Before&lt;/th&gt;
&lt;th&gt;Now&lt;/th&gt;
&lt;/tr&gt;
&lt;/thead&gt;
&lt;tbody&gt;
&lt;tr&gt;
&lt;td&gt;Employee comp as % of revenue&lt;/td&gt;
&lt;td&gt;~8%&lt;/td&gt;
&lt;td&gt;13%&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Employees making &amp;gt;¥1M&lt;/td&gt;
&lt;td&gt;500 (2024)&lt;/td&gt;
&lt;td&gt;800+ (2025)&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Gross margin&lt;/td&gt;
&lt;td&gt;Baseline&lt;/td&gt;
&lt;td&gt;+8pp over 5 years&lt;/td&gt;
&lt;/tr&gt;
&lt;/tbody&gt;
&lt;/table&gt;&lt;/div&gt;

&lt;p&gt;Compensation structure: &lt;strong&gt;distribute opportunity first&lt;/strong&gt; (values-based), &lt;strong&gt;distribute returns after&lt;/strong&gt; (results-based).&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;REMIX: Sequence in Community&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;The playground model has a natural sequencing logic:&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight plaintext"&gt;&lt;code&gt;Catch a crowd (people who want to remix)
  → Form a community (roles + cultural circle)
  → Build mechanics (fork / adopt / co-create)
  → Let collaboration become assets (IP + collective memory)
  → Product grows into a continuously operating playground
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;Each step depends on the one before. You can't build mechanics without a community. You can't have a community without a crowd that wants to remix. Skip a step and the system collapses.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;The shared principle&lt;/strong&gt;: Growth isn't about how fast you can go. It's about what you can afford to sustain. Sequence your expansion so each step is funded by the last. Distribute opportunity before returns.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Put it to work&lt;/strong&gt;: What's your current "pot" (resources, talent, trust)? Never bet more than half. On the next win, reinvest half of the gain. Repeat. And ask yourself: am I distributing opportunity to the right people first, or am I trying to buy my way out of a system problem?&lt;/p&gt;




&lt;h2&gt;
  
  
  The Unified Chart
&lt;/h2&gt;

&lt;div class="table-wrapper-paragraph"&gt;&lt;table&gt;
&lt;thead&gt;
&lt;tr&gt;
&lt;th&gt;Principle&lt;/th&gt;
&lt;th&gt;Anker&lt;/th&gt;
&lt;th&gt;REMIX&lt;/th&gt;
&lt;th&gt;Get to "yes" question&lt;/th&gt;
&lt;/tr&gt;
&lt;/thead&gt;
&lt;tbody&gt;
&lt;tr&gt;
&lt;td&gt;Terrain over ambition&lt;/td&gt;
&lt;td&gt;Shallow sea (≤$50B) vs. deep sea&lt;/td&gt;
&lt;td&gt;Cultural boundary vs. mass platform&lt;/td&gt;
&lt;td&gt;"Does the shape of my battlefield allow my kind of win?"&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Ladder over flat&lt;/td&gt;
&lt;td&gt;1357 product hierarchy&lt;/td&gt;
&lt;td&gt;Role + hierarchy + sequence&lt;/td&gt;
&lt;td&gt;"Do users have a visible path to grow within my system?"&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Playground over factory&lt;/td&gt;
&lt;td&gt;Creator's paradise organization&lt;/td&gt;
&lt;td&gt;Open-seed-unit product model&lt;/td&gt;
&lt;td&gt;"Am I building a system that enables, or an assembly line that standardizes?"&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Formation over individual&lt;/td&gt;
&lt;td&gt;4-layer compounding advantage&lt;/td&gt;
&lt;td&gt;Forkable IP + maintainer + reputation&lt;/td&gt;
&lt;td&gt;"Can 7 average people in formation beat 1 genius alone?"&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Sequence over speed&lt;/td&gt;
&lt;td&gt;Aggressive conservatism (bet 50%)&lt;/td&gt;
&lt;td&gt;Step-by-step community building&lt;/td&gt;
&lt;td&gt;"Is my next expansion funded by the last win?"&lt;/td&gt;
&lt;/tr&gt;
&lt;/tbody&gt;
&lt;/table&gt;&lt;/div&gt;




&lt;h2&gt;
  
  
  What This Means for a Founder With 3-24 Months of Runway
&lt;/h2&gt;

&lt;p&gt;This isn't theory. Here's the real checklist:&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Month 1: Map your terrain&lt;/strong&gt;&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Is your category shallow sea or deep sea?&lt;/li&gt;
&lt;li&gt;If deep sea: can you enter through a niche that the dominant player ignores for 3+ years?&lt;/li&gt;
&lt;li&gt;Does your potential community have enough cultural identity to survive growth?&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;&lt;strong&gt;Month 2: Design the ladder&lt;/strong&gt;&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;What is your 1系 (entry, easy to try)?&lt;/li&gt;
&lt;li&gt;What is your 5系 (core, reliable value)?&lt;/li&gt;
&lt;li&gt;What is your 7系 (aspirational, sets the standard)?&lt;/li&gt;
&lt;li&gt;Same question for community: what's the entry role vs. the top role?&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;&lt;strong&gt;Month 3-6: Build the playground, not the factory&lt;/strong&gt;&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Are you producing things for users, or are you building a space where users produce things for each other?&lt;/li&gt;
&lt;li&gt;What's the "seed unit" — is it completed (factory) or editable (playground)?&lt;/li&gt;
&lt;li&gt;For Anker: the seed unit is a product category operated by a semi-autonomous team. For REMIX: the seed unit is a character/world that can be forked. What's yours?&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;&lt;strong&gt;Month 6-12: Install your formation&lt;/strong&gt;&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;What 3-4 compounding layers can you build that each competitor would need to build from scratch?&lt;/li&gt;
&lt;li&gt;Can your "middle horse" department heads access the same resources that your CEO would?&lt;/li&gt;
&lt;li&gt;For community: can a new contributor become a maintainer through a clear path?&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;&lt;strong&gt;Month 12-24: Sequence, don't sprint&lt;/strong&gt;&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;What is your ¥10? Bet 5. Don't bet 8.&lt;/li&gt;
&lt;li&gt;Did your last win pay for your next expansion?&lt;/li&gt;
&lt;li&gt;Are your most valuable people getting more opportunity, not just more money?&lt;/li&gt;
&lt;/ul&gt;




&lt;h2&gt;
  
  
  Closing
&lt;/h2&gt;

&lt;p&gt;Yang's ultimate aspiration, quoting Laozi: &lt;em&gt;"The highest ruler is one whose existence is barely known."&lt;/em&gt; He places himself at the third level (feared) but wants to reach the first (barely known — the system runs itself).&lt;/p&gt;

&lt;p&gt;The community framework says the same thing differently: the best products aren't destinations you visit — they're playgrounds you live in. The creator becomes invisible; the creation becomes self-sustaining.&lt;/p&gt;

&lt;p&gt;A company is the hardest product. A community is the hardest company. But the principles that govern them — terrain, ladder, playground, formation, sequence — are the same.&lt;/p&gt;

&lt;p&gt;The question isn't what you build. It's what system you set in motion.&lt;/p&gt;

</description>
      <category>startup</category>
      <category>product</category>
      <category>community</category>
      <category>strategy</category>
    </item>
    <item>
      <title>Four Routes to EUV: LPP, LDP, Gas Cluster, and the Soft X-Ray Dark Horse</title>
      <dc:creator>keeper</dc:creator>
      <pubDate>Thu, 21 May 2026 14:55:23 +0000</pubDate>
      <link>https://dev.to/lanternproton/three-routes-to-euv-lpp-ldp-and-the-russian-gas-cluster-that-runs-on-a-vacuum-pump-1kkp</link>
      <guid>https://dev.to/lanternproton/three-routes-to-euv-lpp-ldp-and-the-russian-gas-cluster-that-runs-on-a-vacuum-pump-1kkp</guid>
      <description>&lt;p&gt;Today there are three actively competing approaches to generating extreme ultraviolet (EUV) light for chip lithography. And there's a fourth one — soft X-ray — that's too early to call a competitor, but worth watching.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;ASML's LPP dominates&lt;/strong&gt; — it's what powers every 7nm-and-below fab from TSMC, Samsung, and Intel. But it costs $200M per machine and took $20B+ in R&amp;amp;D.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;China's LDP route&lt;/strong&gt; trades peak performance for accessibility — simpler hardware, lower cost, but stuck on electrode erosion.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Russia's gas cluster approach&lt;/strong&gt; targets a fundamentally different wavelength (6.7nm vs 13.5nm) and produces no debris because its "target" is a gas.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;And then there's soft X-ray / B-EUV&lt;/strong&gt; — 6.5–6.7nm, 185–190 eV photon energy. Still in the lab. No production light source exists. No mirrors exist for this wavelength. But in 2025, Johns Hopkins published a resist breakthrough in Nature, and a startup called Substrate raised $100M at a $1B valuation with a X-ray lithography story.&lt;/p&gt;

&lt;p&gt;Let's break down all four.&lt;/p&gt;




&lt;h2&gt;
  
  
  The Four Routes at a Glance
&lt;/h2&gt;

&lt;div class="table-wrapper-paragraph"&gt;&lt;table&gt;
&lt;thead&gt;
&lt;tr&gt;
&lt;th&gt;Route&lt;/th&gt;
&lt;th&gt;Representative&lt;/th&gt;
&lt;th&gt;Wavelength&lt;/th&gt;
&lt;th&gt;Source&lt;/th&gt;
&lt;th&gt;Target&lt;/th&gt;
&lt;th&gt;Power&lt;/th&gt;
&lt;/tr&gt;
&lt;/thead&gt;
&lt;tbody&gt;
&lt;tr&gt;
&lt;td&gt;&lt;strong&gt;LPP&lt;/strong&gt;&lt;/td&gt;
&lt;td&gt;ASML (Netherlands)&lt;/td&gt;
&lt;td&gt;13.5nm&lt;/td&gt;
&lt;td&gt;CO₂ laser→tin droplet plasma&lt;/td&gt;
&lt;td&gt;Tin (Sn) droplets&lt;/td&gt;
&lt;td&gt;250-600W (production)&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;&lt;strong&gt;LDP&lt;/strong&gt;&lt;/td&gt;
&lt;td&gt;Multiple Chinese institutes&lt;/td&gt;
&lt;td&gt;13.5nm&lt;/td&gt;
&lt;td&gt;Pulsed discharge→tin vapor plasma&lt;/td&gt;
&lt;td&gt;Tin electrodes/vapor&lt;/td&gt;
&lt;td&gt;~100W (lab)&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;&lt;strong&gt;Gas Cluster&lt;/strong&gt;&lt;/td&gt;
&lt;td&gt;Russian Academy of Sciences&lt;/td&gt;
&lt;td&gt;6.7nm (target)&lt;/td&gt;
&lt;td&gt;Yb femtosecond laser→gas cluster plasma&lt;/td&gt;
&lt;td&gt;Li-Xe nanoclusters&lt;/td&gt;
&lt;td&gt;150-200W (theoretical)&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;&lt;strong&gt;Soft X-ray / B-EUV&lt;/strong&gt;&lt;/td&gt;
&lt;td&gt;Johns Hopkins / Substrate (lab)&lt;/td&gt;
&lt;td&gt;6.5–6.7nm&lt;/td&gt;
&lt;td&gt;Gd LPP (experimental)&lt;/td&gt;
&lt;td&gt;Zinc-based resists&lt;/td&gt;
&lt;td&gt;No production source&lt;/td&gt;
&lt;/tr&gt;
&lt;/tbody&gt;
&lt;/table&gt;&lt;/div&gt;

&lt;h2&gt;
  
  
  How Each Works
&lt;/h2&gt;

&lt;p&gt;&lt;strong&gt;LPP (Laser Produced Plasma):&lt;/strong&gt; A CO₂ laser (10.6μm) hits a stream of 30μm tin droplets at 50,000 times per second. Each impact vaporizes the droplet into 500,000°C plasma that emits 13.5nm light. ASML has spent $20B+ perfecting this.&lt;/p&gt;

&lt;p&gt;The three fundamental problems:&lt;/p&gt;

&lt;ol&gt;
&lt;li&gt;
&lt;strong&gt;Tin debris&lt;/strong&gt; — fragments coat the collector mirror. ASML uses hydrogen buffers + self-cleaning coatings. It works, but adds enormous complexity.&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;CO₂ laser efficiency&lt;/strong&gt; — CO₂→EUV conversion is ~5-6%.&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Plasma absorption&lt;/strong&gt; — tin plasma reabsorbs some 13.5nm radiation, limiting usable plasma size.&lt;/li&gt;
&lt;/ol&gt;

&lt;p&gt;&lt;strong&gt;LDP (Laser Discharge Plasma):&lt;/strong&gt; A pulsed electrical discharge runs through tin vapor to create plasma. Simpler and cheaper.&lt;/p&gt;

&lt;p&gt;The single bottleneck: &lt;strong&gt;electrode erosion&lt;/strong&gt;. Every discharge eats the electrodes. Lifetimes range from thousands to millions of shots. In a 24/7 fab, that's uptime. Research directions include rotating electrodes (like X-ray tubes), liquid tin streams, and cryogenic plasma maintenance.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Gas Cluster (Russian Academy of Sciences, May 2026):&lt;/strong&gt; Lithium vapor mixed with xenon gas forms Li-Xe nanoclusters. A Yb femtosecond laser (1030nm) excites these clusters to emit 6.7nm EUV light.&lt;/p&gt;

&lt;p&gt;The crucial difference: everything is a gas. When the cluster explodes, the fragments are also gas — pumped out by vacuum pumps. &lt;strong&gt;No debris, no mirror contamination.&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;Three bottlenecks:&lt;/p&gt;

&lt;ol&gt;
&lt;li&gt;
&lt;strong&gt;Multilayer mirrors for 6.7nm&lt;/strong&gt; — Mo/Be or Mo/Y coatings need Å-level precision. Thinner, tighter, harder than 13.5nm Mo/Si stacks.&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Gas jet stability&lt;/strong&gt; — laser-to-cluster coupling depends on cluster size distribution. Complex fluid dynamics.&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Ion damage&lt;/strong&gt; — 6.7nm photons carry more energy. Different damage mechanisms, zero long-term data.&lt;/li&gt;
&lt;/ol&gt;

&lt;h2&gt;
  
  
  Timeline Reality Check
&lt;/h2&gt;

&lt;div class="table-wrapper-paragraph"&gt;&lt;table&gt;
&lt;thead&gt;
&lt;tr&gt;
&lt;th&gt;Milestone&lt;/th&gt;
&lt;th&gt;ASML LPP&lt;/th&gt;
&lt;th&gt;China LDP&lt;/th&gt;
&lt;th&gt;Russia Gas Cluster&lt;/th&gt;
&lt;th&gt;Soft X-ray / B-EUV&lt;/th&gt;
&lt;/tr&gt;
&lt;/thead&gt;
&lt;tbody&gt;
&lt;tr&gt;
&lt;td&gt;Proof of concept&lt;/td&gt;
&lt;td&gt;2000s&lt;/td&gt;
&lt;td&gt;2010s&lt;/td&gt;
&lt;td&gt;2010s-present&lt;/td&gt;
&lt;td&gt;~2025 (JHU resist)&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Lab prototype&lt;/td&gt;
&lt;td&gt;2012&lt;/td&gt;
&lt;td&gt;~2023-2025&lt;/td&gt;
&lt;td&gt;~2026 (announced)&lt;/td&gt;
&lt;td&gt;None (needs source+optics)&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Fab prototype&lt;/td&gt;
&lt;td&gt;2016&lt;/td&gt;
&lt;td&gt;~2027-2029&lt;/td&gt;
&lt;td&gt;350nm node is real&lt;/td&gt;
&lt;td&gt;No roadmap&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Mass production&lt;/td&gt;
&lt;td&gt;2018 (hundreds delivered)&lt;/td&gt;
&lt;td&gt;~2030+&lt;/td&gt;
&lt;td&gt;6.7nm EUV: 10-15 years&lt;/td&gt;
&lt;td&gt;15+ years, if ever&lt;/td&gt;
&lt;/tr&gt;
&lt;/tbody&gt;
&lt;/table&gt;&lt;/div&gt;

&lt;h2&gt;
  
  
  The Fourth Route: Soft X-ray / Beyond EUV
&lt;/h2&gt;

&lt;p&gt;Soft X-ray (B-EUV) is not competing with the three routes above. It's on a different wavelength track entirely.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;The physics argument is compelling.&lt;/strong&gt; Halve the wavelength, double the resolution — even with moderate NA (0.3-0.5) you could reach &amp;lt;5nm features, comparable to what Hyper-NA EUV (0.75+ NA, estimated $1B per tool) promises at 13.5nm. The catch: everything needed to make it work doesn't exist yet.&lt;/p&gt;

&lt;h3&gt;
  
  
  Current Progress
&lt;/h3&gt;

&lt;p&gt;&lt;strong&gt;Johns Hopkins University (Nature Chemical Engineering, 2025):&lt;/strong&gt; Solved one of the four critical bottlenecks — &lt;strong&gt;resist chemistry&lt;/strong&gt;. Zinc absorbs 6.5-6.7nm soft X-rays efficiently and emits electrons that trigger chemical reactions in imidazoles. They developed Chemical Liquid Deposition (CLD) to apply zinc-imidazolate frameworks (aZIF) at 1nm/sec onto silicon wafers and successfully created fine patterns.&lt;/p&gt;

&lt;p&gt;The team's own assessment: "Years from building even an experimental prototype."&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Substrate (California startup, October 2025):&lt;/strong&gt; $100M funding, $1B valuation. Claims X-ray lithography (XRL) using synchrotron sources can reach sub-2nm nodes. Published 12-13nm CD images of random logic contact arrays. &lt;strong&gt;Has disclosed zero details&lt;/strong&gt; on source type, optics, mask design, or 300mm wafer throughput.&lt;/p&gt;

&lt;h3&gt;
  
  
  Technology Readiness Comparison
&lt;/h3&gt;

&lt;div class="table-wrapper-paragraph"&gt;&lt;table&gt;
&lt;thead&gt;
&lt;tr&gt;
&lt;th&gt;Dimension&lt;/th&gt;
&lt;th&gt;LPP&lt;/th&gt;
&lt;th&gt;LDP&lt;/th&gt;
&lt;th&gt;Gas Cluster&lt;/th&gt;
&lt;th&gt;Soft X-ray&lt;/th&gt;
&lt;/tr&gt;
&lt;/thead&gt;
&lt;tbody&gt;
&lt;tr&gt;
&lt;td&gt;Light source&lt;/td&gt;
&lt;td&gt;✅ Production&lt;/td&gt;
&lt;td&gt;🔧 Lab&lt;/td&gt;
&lt;td&gt;🔧 Proof of concept&lt;/td&gt;
&lt;td&gt;❌ None available&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Optics&lt;/td&gt;
&lt;td&gt;✅ Production&lt;/td&gt;
&lt;td&gt;⚠️ Basic&lt;/td&gt;
&lt;td&gt;🔧 Multilayer needed&lt;/td&gt;
&lt;td&gt;❌ From zero&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Resist&lt;/td&gt;
&lt;td&gt;✅ Mature&lt;/td&gt;
&lt;td&gt;✅ Reusable&lt;/td&gt;
&lt;td&gt;✅ Reusable&lt;/td&gt;
&lt;td&gt;🔧 JHU initial breakthrough&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Supply chain&lt;/td&gt;
&lt;td&gt;✅ Complete&lt;/td&gt;
&lt;td&gt;⚠️ Partial gaps&lt;/td&gt;
&lt;td&gt;❌ Nearly zero&lt;/td&gt;
&lt;td&gt;❌ Entirely non-existent&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;First tool cost&lt;/td&gt;
&lt;td&gt;~$200M&lt;/td&gt;
&lt;td&gt;~$10M+&lt;/td&gt;
&lt;td&gt;~$10-30M (est.)&lt;/td&gt;
&lt;td&gt;Unknown (needs accelerator)&lt;/td&gt;
&lt;/tr&gt;
&lt;/tbody&gt;
&lt;/table&gt;&lt;/div&gt;

&lt;h3&gt;
  
  
  Critical Obstacles
&lt;/h3&gt;

&lt;ol&gt;
&lt;li&gt;
&lt;strong&gt;Light source&lt;/strong&gt; — No production-capable 6.5-6.7nm source exists. Gadolinium LPP is experimental, far below required power.&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Optics&lt;/strong&gt; — 6.5nm light is absorbed by &lt;strong&gt;almost every material&lt;/strong&gt;. Entirely new multilayer mirror designs needed. None exist.&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Zero supply chain&lt;/strong&gt; — Masks, pellicles, metrology, source — everything must be invented from scratch. No company has the incentive to invest ASML-level capital at this stage.&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Substrate's XRL approach&lt;/strong&gt; — Proximity printing (1:1 mask-to-wafer, no reduction). Synchrotrons are 30m to &amp;gt;1km in circumference, costing $100M+.&lt;/li&gt;
&lt;/ol&gt;

&lt;h3&gt;
  
  
  The Most Realistic Role
&lt;/h3&gt;

&lt;p&gt;Soft X-ray / B-EUV is not an "EUV replacement." Its most plausible role: &lt;strong&gt;a wavelength-based alternative path when Hyper-NA EUV hits its cost ceiling (estimated $1B/tool)&lt;/strong&gt; — assuming 6.7nm source and optics technology mature by then.&lt;/p&gt;

&lt;h2&gt;
  
  
  What This Actually Means
&lt;/h2&gt;

&lt;p&gt;&lt;strong&gt;1. Four routes, four time windows, not zero-sum.&lt;/strong&gt;&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;13.5nm (LPP + LDP) → &lt;strong&gt;the present.&lt;/strong&gt; Covers advanced and mature nodes.&lt;/li&gt;
&lt;li&gt;6.7nm gas cluster → &lt;strong&gt;next generation&lt;/strong&gt; (10-15 years). Advantage in efficiency and wavelength differentiation.&lt;/li&gt;
&lt;li&gt;6.5-6.7nm soft X-ray → &lt;strong&gt;long-term&lt;/strong&gt; (15+ years). A wavelength hedge against Hyper-NA cost escalation.&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;&lt;strong&gt;2. EUV is a cost variable, not a survival threshold.&lt;/strong&gt;&lt;br&gt;
Chiplet architectures + DUV multi-patterning cover enormous real-world demand. Advanced packaging reduces dependence on any single lithography node.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;3. The gas cluster route's real contribution.&lt;/strong&gt;&lt;br&gt;
Russia's scheme won't replace ASML. What it does is demonstrate that &lt;strong&gt;a third path might exist&lt;/strong&gt; — breaking the "only LPP can do EUV" assumption.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;4. Soft X-ray is the longest bet.&lt;/strong&gt;&lt;br&gt;
Johns Hopkins cleared one obstacle (resist). Dozens remain. B-EUV's timeline is measured in decades, not years — but if Hyper-NA EUV really hits $1B/tool, the industry will be glad someone started working on alternatives.&lt;/p&gt;




&lt;p&gt;&lt;em&gt;Sources: ASML annual reports, published Chinese LDP research papers, Russian Academy of Sciences report (May 2026), Johns Hopkins / Nature Chemical Engineering (2025), Substrate public disclosures, and public industry data.&lt;/em&gt;&lt;/p&gt;

</description>
      <category>semiconductors</category>
      <category>euv</category>
      <category>lithography</category>
      <category>hardware</category>
    </item>
  </channel>
</rss>
