<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:dc="http://purl.org/dc/elements/1.1/">
  <channel>
    <title>DEV Community: xulingfeng</title>
    <description>The latest articles on DEV Community by xulingfeng (@xulingfeng).</description>
    <link>https://dev.to/xulingfeng</link>
    <image>
      <url>https://media2.dev.to/dynamic/image/width=90,height=90,fit=cover,gravity=auto,format=auto/https:%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Fuser%2Fprofile_image%2F3941526%2F81ebae49-a110-4efa-82b2-4d873989014a.png</url>
      <title>DEV Community: xulingfeng</title>
      <link>https://dev.to/xulingfeng</link>
    </image>
    <atom:link rel="self" type="application/rss+xml" href="https://dev.to/feed/xulingfeng"/>
    <language>en</language>
    <item>
      <title>I Wrote 10 AI Stories in 10 Days. My Keyboard Started Smoking on Day 4.</title>
      <dc:creator>xulingfeng</dc:creator>
      <pubDate>Tue, 09 Jun 2026 08:29:19 +0000</pubDate>
      <link>https://dev.to/xulingfeng/i-wrote-10-ai-stories-in-10-days-my-keyboard-started-smoking-on-day-4-28o9</link>
      <guid>https://dev.to/xulingfeng/i-wrote-10-ai-stories-in-10-days-my-keyboard-started-smoking-on-day-4-28o9</guid>
      <description>&lt;p&gt;Biggest thing I learned writing the &lt;strong&gt;&lt;a href="https://dev.to/xulingfeng/series/40565"&gt;AI, Ego &amp;amp; Regret&lt;/a&gt;&lt;/strong&gt; series: &lt;strong&gt;I argue with myself way more than I thought.&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;Every post goes through the same loop:&lt;/p&gt;

&lt;p&gt;10 PM: "This story's fire. Gonna blow up tomorrow."&lt;/p&gt;

&lt;p&gt;1 AM: "Wait — did I make it clear that 450ms wasn't just a random number?" → Scrolls back to check. Yes. OK. Move on.&lt;/p&gt;

&lt;p&gt;Next morning: "What was I thinking? Scrap it. Rewrite from scratch."&lt;/p&gt;




&lt;p&gt;The cover images were the worst part. One article went through 6 different backgrounds before circling back to the first one. 45 minutes I'll never get back.&lt;/p&gt;

&lt;p&gt;Then there's that one line: &lt;em&gt;"It was right about yesterday — and yesterday wasn't running anymore."&lt;/em&gt; Rewrote it 11 times. My wife walked by and said, "I thought you were writing code, not poetry."&lt;/p&gt;

&lt;p&gt;Ben Halpern hit me with a 5-reaction combo while I was eating instant noodles. Almost choked.&lt;/p&gt;

&lt;p&gt;Waking up at 3 AM. First instinct: check comments. Nothing. Go back to sleep. 5 minutes later: check again. Still nothing.&lt;/p&gt;

&lt;p&gt;Writing code? I'm normal. Writing stories? I'm the guy verifying his own made-up RabbitMQ number at 1 AM.&lt;/p&gt;




&lt;p&gt;&lt;em&gt;Would I do it again? Yeah. Probably. But I'd get a better keyboard this time.&lt;/em&gt;&lt;/p&gt;




&lt;p&gt;This coffee's about to run out — and I'm not done typing yet. If these stories made you smile, chuckle, or roll your eyes, &lt;a href="https://ko-fi.com/xulingfeng" rel="noopener noreferrer"&gt;buy me a coffee&lt;/a&gt; and keep the keys smoking ☕🔥&lt;/p&gt;

&lt;p&gt;Also — if you've got a story that's been sitting in your head, something that made you laugh, cringe, or question every life decision that led to that moment — send it over. I'll turn it into a story. Yours could be the next one.&lt;/p&gt;

&lt;p&gt;No pressure. Just a keyboard that's already warm.&lt;/p&gt;

</description>
      <category>ai</category>
      <category>discuss</category>
      <category>career</category>
      <category>programming</category>
    </item>
    <item>
      <title>My company packaged 12 years of my experience into an AI Skill, then laid me off. When it crashed, the CTO called at 5x my salary.</title>
      <dc:creator>xulingfeng</dc:creator>
      <pubDate>Mon, 08 Jun 2026 14:00:00 +0000</pubDate>
      <link>https://dev.to/xulingfeng/my-company-packaged-12-years-of-my-experience-into-an-ai-skill-then-laid-me-off-when-it-crashed-4b3e</link>
      <guid>https://dev.to/xulingfeng/my-company-packaged-12-years-of-my-experience-into-an-ai-skill-then-laid-me-off-when-it-crashed-4b3e</guid>
      <description>&lt;blockquote&gt;
&lt;p&gt;A story about knowledge extraction, Kafka consumer rebalance, and what happens when a company discovers their AI Skill knows the past — but not the present.&lt;br&gt;
Based on a submission from a community member. If you have a similar story or something you need to get off your chest — reach out. The next one could be yours.&lt;/p&gt;
&lt;/blockquote&gt;




&lt;p&gt;&lt;strong&gt;Has your company ever extracted your experience into a system — then decided the system was good enough without you?&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Have you watched an AI Skill handle 312 scenarios correctly, and wondered whether you'd be around when number 313 finally showed up?&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;This is that story.&lt;/strong&gt;&lt;/p&gt;




&lt;h2&gt;
  
  
  Act 1 · Knowledge Extraction
&lt;/h2&gt;

&lt;p&gt;The conference room had no windows. Three months.&lt;/p&gt;

&lt;p&gt;Every Monday through Thursday I sat in that plastic chair — a voice recorder, a laptop, a mug on the table. Across from me sat an engineer named Caleb. His job was to ask me questions.&lt;/p&gt;

&lt;p&gt;"Why PostgreSQL over MongoDB?"&lt;/p&gt;

&lt;p&gt;"Why is the retry interval 450 milliseconds?"&lt;/p&gt;

&lt;p&gt;"How did you calculate that alert threshold?"&lt;/p&gt;

&lt;p&gt;I answered. He wrote it down. The red light on the recorder stayed on.&lt;/p&gt;

&lt;p&gt;They called it a &lt;strong&gt;"Knowledge Transfer Initiative."&lt;/strong&gt; The CTO's all-hands email was polished: &lt;em&gt;We're preserving decades of institutional knowledge for the next generation of engineers.&lt;/em&gt;&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;In plain English: your experience is too expensive. We're packaging it into a Skill.&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;I didn't take it seriously at first. Every company has knowledge management. Then month three rolled around. Caleb stopped asking questions. He started validating.&lt;/p&gt;

&lt;p&gt;He pulled up an incident I'd debugged the year before — a production outage. He asked me to reproduce the diagnosis. Then he let the system do it.&lt;/p&gt;

&lt;p&gt;The system said: &lt;em&gt;Message broker latency spike detected. Retry logic at 450ms interval will amplify under queue backpressure. Recommend adaptive backoff.&lt;/em&gt;&lt;/p&gt;

&lt;p&gt;I read it three times. It was right. It was my chain of reasoning — but it said it faster than I could.&lt;/p&gt;

&lt;p&gt;The CTO stood next to me, smiling. I've seen that smile too many times in Silicon Valley. It's not happiness. It's sign-off.&lt;/p&gt;

&lt;h2&gt;
  
  
  Act 2 · "96.8% Accuracy"
&lt;/h2&gt;

&lt;p&gt;The moment I knew something was wrong was the day the validation report came out.&lt;/p&gt;

&lt;p&gt;A full row of people sat across the table — CTO, HRBP, VP of Engineering, and a woman I didn't recognize with a consulting firm logo on her blazer.&lt;/p&gt;

&lt;p&gt;The projector displayed one number: &lt;strong&gt;Knowledge Retention Rate: 96.8%&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;Caleb delivered his findings. "After three rounds of validation, the AI Skill achieved 96.8% diagnostic accuracy across 312 historical failure scenarios. The remaining 3.2% deviation is suboptimal recommendations due to insufficient context — correctable with guidance."&lt;/p&gt;

&lt;p&gt;Applause. The CEO turned to me and said something I'll never forget.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;"Mark, you created your own replacement."&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;Everyone laughed. I laughed too. What else was there to do?&lt;/p&gt;

&lt;p&gt;After the meeting I sat in the parking lot for a long time. My coffee had gone cold. Outside the window, that unchanging California blue — the same one it's always been. My phone buzzed. A text from my wife: &lt;em&gt;What time are you coming home?&lt;/em&gt;&lt;/p&gt;

&lt;p&gt;I typed: "Soon." Then I deleted it.&lt;/p&gt;

&lt;p&gt;I didn't know how to tell her I'd spent three months turning myself into a manual. &lt;strong&gt;And once you write the manual, you don't need the original.&lt;/strong&gt;&lt;/p&gt;

&lt;h2&gt;
  
  
  Act 3 · The Severance
&lt;/h2&gt;

&lt;p&gt;The Monday after, the CTO closed the door for our one-on-one.&lt;/p&gt;

&lt;p&gt;"Mark, I won't sugarcoat it. Your position's been eliminated as part of the restructuring."&lt;/p&gt;

&lt;p&gt;He slid an envelope across the table.&lt;/p&gt;

&lt;p&gt;N+3. Three months of severance. The standard Silicon Valley &lt;em&gt;we're not firing you, we're helping you transition&lt;/em&gt; treatment.&lt;/p&gt;

&lt;p&gt;I signed without stopping. Not because I didn't care — because I'd already played out how this ended before I walked in.&lt;/p&gt;

&lt;p&gt;My last day was a Friday. It took me under forty minutes to clear my desk. One backpack. One monitor stand. Three pen caps — I still don't know how three pen caps ended up in my drawer.&lt;/p&gt;

&lt;p&gt;I sat in the car and searched three things on my phone:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;LLC registration (California, single-member, $70, 15 minutes)&lt;/li&gt;
&lt;li&gt;Professional liability insurance (industry standard for consultants, ~$1,200/year)&lt;/li&gt;
&lt;li&gt;Rate benchmarking for senior infrastructure consultants (median: $215/hour)&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;That night I registered a company. Mark Johnson Consulting LLC. No office, no employees, no VC pitch deck. One clause, written clear: project-based only. No AI in the delivery chain.&lt;/p&gt;

&lt;h2&gt;
  
  
  Act 4 · AI Skill Goes Live
&lt;/h2&gt;

&lt;p&gt;The first full quarter was a showpiece.&lt;/p&gt;

&lt;p&gt;The AI Skill absorbed 70% of tier-2 operations tickets. New engineers went from six months to three weeks to reach productivity. The CEO used a slick slide at the all-hands:&lt;/p&gt;

&lt;p&gt;&lt;em&gt;Before: 12 years of Mark's experience locked in his head.&lt;br&gt;
After: 12 years of Mark's experience available as a prompt.&lt;/em&gt;&lt;/p&gt;

&lt;p&gt;Nobody mentioned Mark himself. They didn't need to. He was already packaged.&lt;/p&gt;

&lt;p&gt;My wife noticed before I did. The weekend after I left, she looked at me across the kitchen and said something I wasn't expecting.&lt;/p&gt;

&lt;p&gt;"You seem different."&lt;/p&gt;

&lt;p&gt;"Different how?"&lt;/p&gt;

&lt;p&gt;"When you sit at your computer and don't talk — that tension's gone. You used to be waiting for things to break. Now you're waiting for clients."&lt;/p&gt;

&lt;p&gt;She was right. I was waiting.&lt;/p&gt;

&lt;p&gt;I saw on LinkedIn they'd hired a new Platform Lead. First thing he did was rebuild the monitoring stack. The Skill? Nobody re-ran validation after they migrated to Kafka. Why would they? The person who built it wasn't there anymore.&lt;/p&gt;
&lt;h2&gt;
  
  
  Act 5 · Crash
&lt;/h2&gt;

&lt;p&gt;It happened at 3:47 AM on a Wednesday.&lt;/p&gt;

&lt;p&gt;The distributed tracing system lit up. P99 latency on the core payment chain went from 80 milliseconds to 12 seconds. Three minutes in, the first transaction timed out and rolled back. At ten minutes, the payment queue started backing up. At twenty-three minutes, the rollbacks triggered a cascade — two downstream services began refusing requests.&lt;/p&gt;

&lt;p&gt;The on-call engineer pulled up the AI Skill. It scanned the logs, identified the pattern, and returned a diagnosis.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Detected message broker latency. Applying known mitigation: activate retry queue with 450ms backoff.&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;That diagnosis had been correct 312 times out of 312 historical scenarios.&lt;/p&gt;

&lt;p&gt;This was number 313.&lt;/p&gt;

&lt;p&gt;The 450ms retry window was a compatibility shim I'd written five years earlier for RabbitMQ. The number wasn't random — I'd spent two weeks load-testing on a RabbitMQ cluster to find the exact gap that cleared its Erlang VM GC cycle.&lt;/p&gt;

&lt;p&gt;But that was RabbitMQ. Retired for three years. They were running Kafka now.&lt;/p&gt;

&lt;p&gt;Kafka's consumer groups use a poll-based protocol — every consumer has to keep polling the coordinator at a configured interval, or the coordinator marks it dead and triggers a rebalance. My retry worker was synchronous — process one message, pull the next. At 450ms a round, a queue of a few dozen messages stretched the poll window by multiple factors.&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight python"&gt;&lt;code&gt;&lt;span class="c1"&gt;# What the Skill did (the 450ms retry window)
&lt;/span&gt;&lt;span class="k"&gt;def&lt;/span&gt; &lt;span class="nf"&gt;handle_queue&lt;/span&gt;&lt;span class="p"&gt;():&lt;/span&gt;
    &lt;span class="k"&gt;while&lt;/span&gt; &lt;span class="nf"&gt;queue_not_empty&lt;/span&gt;&lt;span class="p"&gt;():&lt;/span&gt;
        &lt;span class="n"&gt;msg&lt;/span&gt; &lt;span class="o"&gt;=&lt;/span&gt; &lt;span class="nf"&gt;fetch_one_message&lt;/span&gt;&lt;span class="p"&gt;()&lt;/span&gt;          &lt;span class="c1"&gt;# one at a time
&lt;/span&gt;        &lt;span class="nf"&gt;retry_with_backoff&lt;/span&gt;&lt;span class="p"&gt;(&lt;/span&gt;&lt;span class="mi"&gt;450&lt;/span&gt;&lt;span class="n"&gt;ms&lt;/span&gt;&lt;span class="p"&gt;)&lt;/span&gt;          &lt;span class="c1"&gt;# ⏱ 450ms per message
&lt;/span&gt;        &lt;span class="nf"&gt;process&lt;/span&gt;&lt;span class="p"&gt;(&lt;/span&gt;&lt;span class="n"&gt;msg&lt;/span&gt;&lt;span class="p"&gt;)&lt;/span&gt;                       
        &lt;span class="nf"&gt;poll_consumer&lt;/span&gt;&lt;span class="p"&gt;()&lt;/span&gt;                    &lt;span class="c1"&gt;# "still alive?" → into debt
&lt;/span&gt;
&lt;span class="c1"&gt;# What happened with 60 queued messages
&lt;/span&gt;&lt;span class="n"&gt;poll_timeout&lt;/span&gt; &lt;span class="o"&gt;=&lt;/span&gt; &lt;span class="mi"&gt;5000&lt;/span&gt;                        &lt;span class="c1"&gt;# 5s before coordinator marks dead
&lt;/span&gt;&lt;span class="n"&gt;time_per_round&lt;/span&gt; &lt;span class="o"&gt;=&lt;/span&gt; &lt;span class="mi"&gt;60&lt;/span&gt; &lt;span class="o"&gt;*&lt;/span&gt; &lt;span class="mi"&gt;450&lt;/span&gt;                  &lt;span class="c1"&gt;# 60 messages × 450ms = 27,000ms
# 27,000ms &amp;gt; 5,000ms → poll timeout missed → rebalance triggered
&lt;/span&gt;&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;Once the rebalance kicked in, the whole group started oscillating. New consumers claimed partitions. Old messages got redelivered. Latency climbed again. A sticky problem turned into a snowball.&lt;/p&gt;

&lt;p&gt;The AI executed a strategy that was 100% correct on the old architecture. &lt;strong&gt;On the current one, it was pouring gasoline on the fire.&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;The AI didn't fail because it was wrong. It failed because it was right about yesterday — and yesterday wasn't running anymore.&lt;/strong&gt;&lt;/p&gt;

&lt;h2&gt;
  
  
  Act 6 · The Call
&lt;/h2&gt;

&lt;p&gt;My phone rang at 4:12 AM.&lt;/p&gt;

&lt;p&gt;A name I hadn't called in months — my former CTO.&lt;/p&gt;

&lt;p&gt;His voice was calm. Too calm. When an engineer calls at 4 AM and sounds that steady, something's on fire.&lt;/p&gt;

&lt;p&gt;"Mark. I need to ask you something."&lt;/p&gt;

&lt;p&gt;"Ask."&lt;/p&gt;

&lt;p&gt;He laid out the incident in three sentences. I thought he'd ask about rollback strategy. Damage containment. He didn't.&lt;/p&gt;

&lt;p&gt;"You left a note in the RabbitMQ migration docs. At the time, nobody understood it. We thought it was a stale config comment. I think I understand it now."&lt;/p&gt;

&lt;p&gt;I leaned back against the headboard. The only light in the room was the phone screen. My wife stirred beside me. Didn't wake up.&lt;/p&gt;

&lt;p&gt;The note said: &lt;em&gt;450ms matches RabbitMQ GC window. Do not reuse outside this context.&lt;/em&gt;&lt;/p&gt;

&lt;p&gt;Silence on the other end. About four seconds.&lt;/p&gt;

&lt;p&gt;"I know," he said. "I'm out of time, Mark. Come back."&lt;/p&gt;

&lt;p&gt;"I'm not asking if you'll come back." His voice shifted. "I'm asking you to. I'll pause everything for the rest of the month. You call the shots. Name your price."&lt;/p&gt;

&lt;h2&gt;
  
  
  Act 7 · The Price
&lt;/h2&gt;

&lt;p&gt;"Five times my old salary."&lt;/p&gt;

&lt;p&gt;I heard his breath catch.&lt;/p&gt;

&lt;p&gt;"… Deal. But I need you on-site for two weeks. Contract. Bring your own equipment. No AI touches your delivery chain."&lt;/p&gt;

&lt;p&gt;"Done. Send the contract to my lawyer."&lt;/p&gt;

&lt;p&gt;I didn't have a lawyer. What I meant was: this runs at my pace.&lt;/p&gt;

&lt;p&gt;After I hung up, I scrolled to a name I hadn't called in nearly a year — Mike, college roommate, law school grad. Sent one message.&lt;/p&gt;

&lt;p&gt;&lt;em&gt;Got a contract. Mind glancing at it?&lt;/em&gt;&lt;/p&gt;

&lt;p&gt;He didn't reply. At 4 AM, who's awake except people in IT?&lt;/p&gt;

&lt;p&gt;I waited. Then I typed five more words. I sent them to him, but they were really for me.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;This time I set the price.&lt;/strong&gt;&lt;/p&gt;




&lt;p&gt;&lt;strong&gt;You've been through this before? Your knowledge extracted, packaged into a Skill, turned into the reason you weren't needed anymore. And when the Skill hit something it didn't understand, they came back. Is there a better answer than "no"?&lt;/strong&gt;&lt;/p&gt;




&lt;p&gt;&lt;strong&gt;The Skill knew every past scenario. It just couldn't see that the infrastructure had changed.&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;The company saved six figures on headcount. They just forgot validation only works when the person who wrote it is still there to re-run it.&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;When the system went down — who did they call?&lt;/strong&gt;&lt;/p&gt;




&lt;p&gt;&lt;em&gt;Follow for more stories about AI extraction, implicit knowledge, and what happens when companies discover their Skill only remembers the past.&lt;/em&gt;&lt;/p&gt;




&lt;p&gt;I couldn't teach the AI Skill to understand context. But the stories stay open. &lt;a href="https://ko-fi.com/xulingfeng" rel="noopener noreferrer"&gt;buy me a coffee&lt;/a&gt; ☕&lt;/p&gt;

</description>
      <category>ai</category>
      <category>career</category>
      <category>discuss</category>
      <category>programming</category>
    </item>
    <item>
      <title>Our VP Said AI Would Test Itself. Day 3 Cost $2.8M. I Had the Screenshots Ready.</title>
      <dc:creator>xulingfeng</dc:creator>
      <pubDate>Sun, 07 Jun 2026 09:19:56 +0000</pubDate>
      <link>https://dev.to/xulingfeng/our-vp-said-ai-would-test-itself-i-raised-my-hand-i-got-reassigned-day-3-cost-28m-i-had-the-555j</link>
      <guid>https://dev.to/xulingfeng/our-vp-said-ai-would-test-itself-i-raised-my-hand-i-got-reassigned-day-3-cost-28m-i-had-the-555j</guid>
      <description>&lt;blockquote&gt;
&lt;p&gt;&lt;em&gt;This story is based on a submission from a community member. If you have a similar story or something you need to get off your chest — reach out. The next one could be yours.&lt;/em&gt;&lt;/p&gt;
&lt;/blockquote&gt;




&lt;h2&gt;
  
  
  Act 1 · The Tech Meeting
&lt;/h2&gt;

&lt;blockquote&gt;
&lt;p&gt;"Starting today — no more hand-written code."&lt;/p&gt;
&lt;/blockquote&gt;

&lt;p&gt;Marcus, the new VP of Engineering, put a slide up on the big screen.&lt;/p&gt;

&lt;p&gt;Four words: &lt;strong&gt;WRITING BY HAND IS OVER.&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;I was sitting in the back row, against the wall. Seven years at this company. Three core modules that I'd built from scratch. Two production systems that ran the company's primary revenue stream. Now someone was telling me — don't write anymore.&lt;/p&gt;

&lt;p&gt;The room went quiet for about five seconds. Then people started whispering. Someone pulled out a phone and took a picture of the slide.&lt;/p&gt;

&lt;p&gt;Marcus added: "AI coding isn't optional — it's a mandatory development standard. We benchmarked this. AI writes code 400% faster than humans. Anyone still typing manually is wasting the company's time."&lt;/p&gt;

&lt;p&gt;I raised my hand.&lt;/p&gt;

&lt;p&gt;"Who reviews the code?"&lt;/p&gt;

&lt;p&gt;"AI reviews it."&lt;/p&gt;

&lt;p&gt;"Who writes the tests?"&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;"AI tests itself."&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;"What if AI writes something wrong?"&lt;/p&gt;

&lt;p&gt;Marcus laughed. Not a polite laugh. The kind of laugh you give someone whose question you've already decided doesn't matter.&lt;/p&gt;

&lt;p&gt;"Let me ask you something."&lt;/p&gt;

&lt;p&gt;He paused.&lt;/p&gt;

&lt;p&gt;"Do you really think — &lt;strong&gt;you, one person — have more training data than Orion-7?&lt;/strong&gt; "&lt;/p&gt;

&lt;p&gt;People started laughing. Not supportive laughter. &lt;strong&gt;Pile-on laughter.&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;"Or do you think the world's AI companies — hundreds of billions in investment, tens of thousands of GPUs — built something that's &lt;em&gt;less&lt;/em&gt; reliable than one backend developer?"&lt;/p&gt;

&lt;p&gt;Nobody was looking at me anymore. Everyone was watching him, waiting for the kill shot.&lt;/p&gt;

&lt;p&gt;He didn't take it. He just smiled. "Starting next sprint, it's AI across the board. Anyone who has concerns — my door's open."&lt;/p&gt;




&lt;h2&gt;
  
  
  Act 2 · The Reassignment
&lt;/h2&gt;

&lt;p&gt;I didn't go to his door.&lt;/p&gt;

&lt;p&gt;HR notified me the next day: I was being moved to the Legacy Systems team.&lt;/p&gt;

&lt;p&gt;Three people. Three projects that hadn't been touched in five years. No new requirements. No QA handoffs. No deadlines — because nobody used the software.&lt;/p&gt;

&lt;p&gt;I handed off my work and sat down at my new desk.&lt;/p&gt;

&lt;p&gt;A beat-up laptop sat there. Screen cracked in the bottom-right corner.&lt;/p&gt;

&lt;p&gt;I opened my email — three new messages. All from the repo: my write access had been revoked on all three legacy repositories. Review-only.&lt;/p&gt;

&lt;p&gt;I went to find Marcus.&lt;/p&gt;

&lt;p&gt;"About the repo permissions —"&lt;/p&gt;

&lt;p&gt;"Oh, right." He looked like he'd been waiting for me. "New system code runs under the new policy. You're not familiar with the AI workflow, so I set you up with read-only access to the legacy repos."&lt;/p&gt;

&lt;p&gt;He leaned back in his chair.&lt;/p&gt;

&lt;p&gt;"Oh, and I cancelled your Copilot license. Those three legacy projects — not worth the GPU cycles for AI anyway. Write it by hand. You don't trust AI anyway."&lt;/p&gt;

&lt;p&gt;He smiled. He knew exactly what that line would do. He was waiting to see if I'd say something.&lt;/p&gt;

&lt;p&gt;I didn't.&lt;/p&gt;

&lt;p&gt;I went back to my desk, opened the cracked laptop, connected to the internal network, and cloned the new system's main repo. Public read-only access. He'd forgotten to lock it down.&lt;/p&gt;




&lt;h2&gt;
  
  
  Act 3 · Building a Case
&lt;/h2&gt;

&lt;p&gt;The new team delivered the order module in two weeks.&lt;/p&gt;

&lt;p&gt;The CEO posted in the company-wide Slack channel:&lt;/p&gt;

&lt;blockquote&gt;
&lt;p&gt;"New order module is live. AI-generated code end-to-end. Zero-defect delivery. Let's give the team a round of applause."&lt;/p&gt;
&lt;/blockquote&gt;

&lt;p&gt;The channel filled with clapping emojis. Someone @'d Marcus: "That's insane efficiency."&lt;/p&gt;

&lt;p&gt;I was in the channel. I didn't say anything.&lt;/p&gt;

&lt;p&gt;What the CEO called "live" wasn't a full cutover. It was a canary — the new module running alongside the old system. Low traffic. Simple orders. Nothing that would hit an edge case.&lt;/p&gt;

&lt;p&gt;But the code was sitting on the server.&lt;/p&gt;

&lt;p&gt;I opened a terminal in the new system's code directory and ran one line:&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight shell"&gt;&lt;code&gt;pytest tests/ &lt;span class="nt"&gt;--tb&lt;/span&gt;&lt;span class="o"&gt;=&lt;/span&gt;short &lt;span class="nt"&gt;-q&lt;/span&gt;
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;Output:&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight console"&gt;&lt;code&gt;&lt;span class="go"&gt;..............F.......FFF......
====== 8 failed, 15 passed in 2.87s ======
&lt;/span&gt;&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;&lt;strong&gt;15 passed, 8 failed.&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;I ran another line:&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight shell"&gt;&lt;code&gt;&lt;span class="nb"&gt;grep&lt;/span&gt; &lt;span class="nt"&gt;-rn&lt;/span&gt; &lt;span class="s2"&gt;"TODO"&lt;/span&gt; src/ | &lt;span class="nb"&gt;wc&lt;/span&gt; &lt;span class="nt"&gt;-l&lt;/span&gt;
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;&lt;strong&gt;47.&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;One more — I hit the API:&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight shell"&gt;&lt;code&gt;curl &lt;span class="nt"&gt;-s&lt;/span&gt; https://new-api.xxx.com/v2/order/123456 | jq &lt;span class="s1"&gt;'.discount_detail'&lt;/span&gt;
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;Response:&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight json"&gt;&lt;code&gt;&lt;span class="p"&gt;{&lt;/span&gt;&lt;span class="w"&gt;
  &lt;/span&gt;&lt;span class="nl"&gt;"order_id"&lt;/span&gt;&lt;span class="p"&gt;:&lt;/span&gt;&lt;span class="w"&gt; &lt;/span&gt;&lt;span class="s2"&gt;"ORD-2026-0614-0123456"&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt;&lt;span class="w"&gt;
  &lt;/span&gt;&lt;span class="nl"&gt;"subtotal"&lt;/span&gt;&lt;span class="p"&gt;:&lt;/span&gt;&lt;span class="w"&gt; &lt;/span&gt;&lt;span class="mf"&gt;12500.00&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt;&lt;span class="w"&gt;
  &lt;/span&gt;&lt;span class="nl"&gt;"line_items"&lt;/span&gt;&lt;span class="p"&gt;:&lt;/span&gt;&lt;span class="w"&gt; &lt;/span&gt;&lt;span class="p"&gt;[&lt;/span&gt;&lt;span class="w"&gt;
    &lt;/span&gt;&lt;span class="p"&gt;{&lt;/span&gt;&lt;span class="nl"&gt;"sku"&lt;/span&gt;&lt;span class="p"&gt;:&lt;/span&gt;&lt;span class="w"&gt; &lt;/span&gt;&lt;span class="s2"&gt;"A-200"&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt;&lt;span class="w"&gt; &lt;/span&gt;&lt;span class="nl"&gt;"qty"&lt;/span&gt;&lt;span class="p"&gt;:&lt;/span&gt;&lt;span class="w"&gt; &lt;/span&gt;&lt;span class="mi"&gt;10&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt;&lt;span class="w"&gt; &lt;/span&gt;&lt;span class="nl"&gt;"price"&lt;/span&gt;&lt;span class="p"&gt;:&lt;/span&gt;&lt;span class="w"&gt; &lt;/span&gt;&lt;span class="mf"&gt;800.00&lt;/span&gt;&lt;span class="p"&gt;},&lt;/span&gt;&lt;span class="w"&gt;
    &lt;/span&gt;&lt;span class="p"&gt;{&lt;/span&gt;&lt;span class="nl"&gt;"sku"&lt;/span&gt;&lt;span class="p"&gt;:&lt;/span&gt;&lt;span class="w"&gt; &lt;/span&gt;&lt;span class="s2"&gt;"B-150"&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt;&lt;span class="w"&gt; &lt;/span&gt;&lt;span class="nl"&gt;"qty"&lt;/span&gt;&lt;span class="p"&gt;:&lt;/span&gt;&lt;span class="w"&gt; &lt;/span&gt;&lt;span class="mi"&gt;5&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt;&lt;span class="w"&gt; &lt;/span&gt;&lt;span class="nl"&gt;"price"&lt;/span&gt;&lt;span class="p"&gt;:&lt;/span&gt;&lt;span class="w"&gt; &lt;/span&gt;&lt;span class="mf"&gt;900.00&lt;/span&gt;&lt;span class="p"&gt;}&lt;/span&gt;&lt;span class="w"&gt;
  &lt;/span&gt;&lt;span class="p"&gt;],&lt;/span&gt;&lt;span class="w"&gt;
  &lt;/span&gt;&lt;span class="nl"&gt;"discounts"&lt;/span&gt;&lt;span class="p"&gt;:&lt;/span&gt;&lt;span class="w"&gt; &lt;/span&gt;&lt;span class="p"&gt;[&lt;/span&gt;&lt;span class="w"&gt;
    &lt;/span&gt;&lt;span class="p"&gt;{&lt;/span&gt;&lt;span class="nl"&gt;"type"&lt;/span&gt;&lt;span class="p"&gt;:&lt;/span&gt;&lt;span class="w"&gt; &lt;/span&gt;&lt;span class="s2"&gt;"promo_early"&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt;&lt;span class="w"&gt; &lt;/span&gt;&lt;span class="nl"&gt;"amount"&lt;/span&gt;&lt;span class="p"&gt;:&lt;/span&gt;&lt;span class="w"&gt; &lt;/span&gt;&lt;span class="mf"&gt;500.00&lt;/span&gt;&lt;span class="p"&gt;},&lt;/span&gt;&lt;span class="w"&gt;
    &lt;/span&gt;&lt;span class="p"&gt;{&lt;/span&gt;&lt;span class="nl"&gt;"type"&lt;/span&gt;&lt;span class="p"&gt;:&lt;/span&gt;&lt;span class="w"&gt; &lt;/span&gt;&lt;span class="s2"&gt;"member_gold"&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt;&lt;span class="w"&gt; &lt;/span&gt;&lt;span class="nl"&gt;"amount"&lt;/span&gt;&lt;span class="p"&gt;:&lt;/span&gt;&lt;span class="w"&gt; &lt;/span&gt;&lt;span class="mf"&gt;1875.00&lt;/span&gt;&lt;span class="p"&gt;},&lt;/span&gt;&lt;span class="w"&gt;
    &lt;/span&gt;&lt;span class="p"&gt;{&lt;/span&gt;&lt;span class="nl"&gt;"type"&lt;/span&gt;&lt;span class="p"&gt;:&lt;/span&gt;&lt;span class="w"&gt; &lt;/span&gt;&lt;span class="s2"&gt;"volume_tier_3"&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt;&lt;span class="w"&gt; &lt;/span&gt;&lt;span class="nl"&gt;"amount"&lt;/span&gt;&lt;span class="p"&gt;:&lt;/span&gt;&lt;span class="w"&gt; &lt;/span&gt;&lt;span class="mf"&gt;1250.00&lt;/span&gt;&lt;span class="p"&gt;}&lt;/span&gt;&lt;span class="w"&gt;
  &lt;/span&gt;&lt;span class="p"&gt;],&lt;/span&gt;&lt;span class="w"&gt;
  &lt;/span&gt;&lt;span class="nl"&gt;"discount_detail"&lt;/span&gt;&lt;span class="p"&gt;:&lt;/span&gt;&lt;span class="w"&gt; &lt;/span&gt;&lt;span class="kc"&gt;null&lt;/span&gt;&lt;span class="w"&gt;
&lt;/span&gt;&lt;span class="p"&gt;}&lt;/span&gt;&lt;span class="w"&gt;
&lt;/span&gt;&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;The API docs said: &lt;strong&gt;"discount_detail: object, required. Contains breakdown of all applied discounts."&lt;/strong&gt; It returned &lt;code&gt;null&lt;/code&gt;.&lt;/p&gt;

&lt;p&gt;I stared at the screen for thirty seconds.&lt;/p&gt;

&lt;p&gt;Then I created a folder.&lt;/p&gt;

&lt;p&gt;Called it &lt;code&gt;evidence/&lt;/code&gt;.&lt;/p&gt;

&lt;p&gt;I opened my email. Wrote to Marcus with the subject "New Order Module — Test Failures Found." Attached the pytest output, the TODO count, and the &lt;code&gt;null&lt;/code&gt; response.&lt;/p&gt;

&lt;p&gt;He didn't reply.&lt;/p&gt;

&lt;p&gt;For the next month, nobody touched that code. Marcus was preparing the Q3 board deck. The AI team was writing prompts for the next module. And me — I was on Legacy Systems. My job was to not touch anything.&lt;/p&gt;




&lt;h2&gt;
  
  
  Act 4 · The Glory
&lt;/h2&gt;

&lt;p&gt;The new order module ran canary for a month. Low traffic. Simple orders. Zero incidents.&lt;/p&gt;

&lt;p&gt;The CEO was happy. Marcus asked for a product demo.&lt;/p&gt;

&lt;p&gt;The screen in the conference room showed a perfect happy path. Clean data. Discount calculations stacking correctly. Coupons applying. One-click ordering. Everything worked.&lt;/p&gt;

&lt;p&gt;The CEO said: "You're two months ahead of my timeline."&lt;/p&gt;

&lt;p&gt;Marcus smiled.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;"Because I used AI. Some people are still writing code by hand. You can see the difference, right?"&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;He glanced at the back row.&lt;/p&gt;

&lt;p&gt;I was in the back row. What that look meant — &lt;strong&gt;everyone in the room knew.&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;I stood up.&lt;/p&gt;

&lt;p&gt;"The module — "&lt;/p&gt;

&lt;p&gt;Marcus cut me off.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;"— has a problem?&lt;/strong&gt; Zero incidents in canary. Full green on the test dashboard. You've been on legacy — have you even looked at the new code?"&lt;/p&gt;

&lt;p&gt;The CEO glanced at me. Then turned back.&lt;/p&gt;

&lt;p&gt;"Marcus — run through the full Q3 cutover plan again."&lt;/p&gt;

&lt;p&gt;I stood there for maybe three seconds. Nobody looked at me. Nobody asked if I had more to say.&lt;/p&gt;

&lt;p&gt;I sat down.&lt;/p&gt;

&lt;p&gt;The CEO made the call: &lt;strong&gt;"Full Q3 cutover. Start sunsetting the legacy systems."&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;Marcus got his scope expanded — he now owned the entire engineering org. No more CTO sign-off needed.&lt;/p&gt;

&lt;p&gt;People patted him on the shoulder on the way out. "Well deserved." He said thanks. He walked right past me — &lt;strong&gt;he didn't need to look at me anymore.&lt;/strong&gt;&lt;/p&gt;




&lt;h2&gt;
  
  
  Act 5 · The Crash
&lt;/h2&gt;

&lt;p&gt;Day 3 of the full cutover.&lt;/p&gt;

&lt;p&gt;4:27 PM. The CFO walked straight into the CEO's office.&lt;/p&gt;

&lt;p&gt;Not a complaint. Worse.&lt;/p&gt;

&lt;p&gt;"A major client called me directly. Every order we've processed in the last three days has the wrong discount amount. Not some of them — &lt;strong&gt;every single one.&lt;/strong&gt; "&lt;/p&gt;

&lt;p&gt;Root cause analysis:&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight plaintext"&gt;&lt;code&gt;AI-generated code was computing compound discounts
with no defined calculation order.
The prompt didn't specify:
  "calculate line-item discounts first → then subtotal → then member tier."
The model guessed an order.
It guessed a different one on every request.
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;Canary hadn't caught it — because canary traffic only hit simple orders with single discounts. Full production on day one brought in bulk orders from major clients. Every single one computed discounts differently.&lt;/p&gt;

&lt;p&gt;Impact: 3 consecutive days. &lt;strong&gt;$2,800,000 in unreconciled billing.&lt;/strong&gt;&lt;/p&gt;

&lt;div class="table-wrapper-paragraph"&gt;&lt;table&gt;
&lt;thead&gt;
&lt;tr&gt;
&lt;th&gt;Metric&lt;/th&gt;
&lt;th&gt;Value&lt;/th&gt;
&lt;/tr&gt;
&lt;/thead&gt;
&lt;tbody&gt;
&lt;tr&gt;
&lt;td&gt;Orders affected&lt;/td&gt;
&lt;td&gt;1,247&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Average discrepancy per order&lt;/td&gt;
&lt;td&gt;$2,246&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Largest single order discrepancy&lt;/td&gt;
&lt;td&gt;$18,740&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Enterprise clients impacted&lt;/td&gt;
&lt;td&gt;16&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Longest uncorrected window&lt;/td&gt;
&lt;td&gt;72 hours&lt;/td&gt;
&lt;/tr&gt;
&lt;/tbody&gt;
&lt;/table&gt;&lt;/div&gt;

&lt;p&gt;The CEO called Marcus into his office. I wasn't in the room — but someone told me later he came out looking pale. CEO had HR schedule a postmortem.&lt;/p&gt;




&lt;h2&gt;
  
  
  Act 6 · The Reckoning
&lt;/h2&gt;

&lt;p&gt;Postmortem. The CEO chaired. Marcus sat in the front row. The CTO, CFO, finance team, and the enterprise account managers filled the room.&lt;/p&gt;

&lt;p&gt;I was there too. Back row.&lt;/p&gt;

&lt;p&gt;Marcus opened: "The prompt missed a business rule. The team is fixing it now."&lt;/p&gt;

&lt;p&gt;The CTO asked: "Canary ran for a month. How did nobody catch this?"&lt;/p&gt;

&lt;p&gt;A pause. "The canary environment didn't have complex discount data."&lt;/p&gt;

&lt;p&gt;"So your logic is — if the data wasn't in canary, it wouldn't be a problem in production?"&lt;/p&gt;

&lt;p&gt;The room went quiet.&lt;/p&gt;

&lt;p&gt;I raised my hand.&lt;/p&gt;

&lt;p&gt;"Can I share something?"&lt;/p&gt;

&lt;p&gt;Marcus turned and looked at me.&lt;/p&gt;

&lt;p&gt;The CTO nodded. "Put it on screen."&lt;/p&gt;

&lt;p&gt;I walked to the front. Plugged my laptop into the projector.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Slide 1:&lt;/strong&gt;&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight python"&gt;&lt;code&gt;&lt;span class="c1"&gt;# src/orders/discount_calculator.py:127
&lt;/span&gt;
&lt;span class="k"&gt;def&lt;/span&gt; &lt;span class="nf"&gt;apply_compound_discount&lt;/span&gt;&lt;span class="p"&gt;(&lt;/span&gt;&lt;span class="n"&gt;order&lt;/span&gt;&lt;span class="p"&gt;:&lt;/span&gt; &lt;span class="nb"&gt;dict&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt; &lt;span class="n"&gt;user&lt;/span&gt;&lt;span class="p"&gt;:&lt;/span&gt; &lt;span class="nb"&gt;dict&lt;/span&gt;&lt;span class="p"&gt;)&lt;/span&gt; &lt;span class="o"&gt;-&amp;gt;&lt;/span&gt; &lt;span class="nb"&gt;float&lt;/span&gt;&lt;span class="p"&gt;:&lt;/span&gt;
    &lt;span class="sh"&gt;"""&lt;/span&gt;&lt;span class="s"&gt;
    Apply tiered discounts to an order.
    TODO: handle compound discount ordering — currently iterates
          over discount fields in JSON insertion order, which means
          the calculation sequence depends on how the client serialized
          the payload. This is not guaranteed to be consistent across
          requests. Needs explicit ordering: line_item → subtotal → member.
    &lt;/span&gt;&lt;span class="sh"&gt;"""&lt;/span&gt;
    &lt;span class="n"&gt;total&lt;/span&gt; &lt;span class="o"&gt;=&lt;/span&gt; &lt;span class="n"&gt;order&lt;/span&gt;&lt;span class="p"&gt;[&lt;/span&gt;&lt;span class="sh"&gt;"&lt;/span&gt;&lt;span class="s"&gt;subtotal&lt;/span&gt;&lt;span class="sh"&gt;"&lt;/span&gt;&lt;span class="p"&gt;]&lt;/span&gt;
    &lt;span class="k"&gt;for&lt;/span&gt; &lt;span class="n"&gt;key&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt; &lt;span class="n"&gt;rule&lt;/span&gt; &lt;span class="ow"&gt;in&lt;/span&gt; &lt;span class="n"&gt;order&lt;/span&gt;&lt;span class="p"&gt;.&lt;/span&gt;&lt;span class="nf"&gt;get&lt;/span&gt;&lt;span class="p"&gt;(&lt;/span&gt;&lt;span class="sh"&gt;"&lt;/span&gt;&lt;span class="s"&gt;discounts&lt;/span&gt;&lt;span class="sh"&gt;"&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt; &lt;span class="p"&gt;{}).&lt;/span&gt;&lt;span class="nf"&gt;items&lt;/span&gt;&lt;span class="p"&gt;():&lt;/span&gt;
        &lt;span class="n"&gt;total&lt;/span&gt; &lt;span class="o"&gt;-=&lt;/span&gt; &lt;span class="n"&gt;rule&lt;/span&gt;&lt;span class="p"&gt;[&lt;/span&gt;&lt;span class="sh"&gt;"&lt;/span&gt;&lt;span class="s"&gt;amount&lt;/span&gt;&lt;span class="sh"&gt;"&lt;/span&gt;&lt;span class="p"&gt;]&lt;/span&gt;  &lt;span class="c1"&gt;# ← order depends on dict insertion order
&lt;/span&gt;    &lt;span class="k"&gt;return&lt;/span&gt; &lt;span class="nf"&gt;max&lt;/span&gt;&lt;span class="p"&gt;(&lt;/span&gt;&lt;span class="n"&gt;total&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt; &lt;span class="mf"&gt;0.0&lt;/span&gt;&lt;span class="p"&gt;)&lt;/span&gt;
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;"This isn't a bug. It's a TODO. There were 47 of these in the codebase when it shipped to production. Distributed across every critical path in the module."&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Slide 2:&lt;/strong&gt;&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight python"&gt;&lt;code&gt;&lt;span class="c1"&gt;# tests/test_discount_complex/test_discount_order.py — last modified 2019
&lt;/span&gt;
&lt;span class="k"&gt;def&lt;/span&gt; &lt;span class="nf"&gt;test_compound_discount_sequence&lt;/span&gt;&lt;span class="p"&gt;():&lt;/span&gt;
    &lt;span class="sh"&gt;"""&lt;/span&gt;&lt;span class="s"&gt;Verify discount application order: line_item → subtotal → member.

    This sequence was confirmed with Finance in 2019 and must not
    be changed without cross-team approval.
    &lt;/span&gt;&lt;span class="sh"&gt;"""&lt;/span&gt;
    &lt;span class="n"&gt;order&lt;/span&gt; &lt;span class="o"&gt;=&lt;/span&gt; &lt;span class="nf"&gt;sample_order&lt;/span&gt;&lt;span class="p"&gt;(&lt;/span&gt;
        &lt;span class="n"&gt;items&lt;/span&gt;&lt;span class="o"&gt;=&lt;/span&gt;&lt;span class="p"&gt;[{&lt;/span&gt;&lt;span class="sh"&gt;"&lt;/span&gt;&lt;span class="s"&gt;sku&lt;/span&gt;&lt;span class="sh"&gt;"&lt;/span&gt;&lt;span class="p"&gt;:&lt;/span&gt; &lt;span class="sh"&gt;"&lt;/span&gt;&lt;span class="s"&gt;A&lt;/span&gt;&lt;span class="sh"&gt;"&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt; &lt;span class="sh"&gt;"&lt;/span&gt;&lt;span class="s"&gt;price&lt;/span&gt;&lt;span class="sh"&gt;"&lt;/span&gt;&lt;span class="p"&gt;:&lt;/span&gt; &lt;span class="mf"&gt;100.00&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt; &lt;span class="sh"&gt;"&lt;/span&gt;&lt;span class="s"&gt;qty&lt;/span&gt;&lt;span class="sh"&gt;"&lt;/span&gt;&lt;span class="p"&gt;:&lt;/span&gt; &lt;span class="mi"&gt;5&lt;/span&gt;&lt;span class="p"&gt;}],&lt;/span&gt;
        &lt;span class="n"&gt;subtotal_discount_pct&lt;/span&gt;&lt;span class="o"&gt;=&lt;/span&gt;&lt;span class="mi"&gt;10&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt;
        &lt;span class="n"&gt;member_tier&lt;/span&gt;&lt;span class="o"&gt;=&lt;/span&gt;&lt;span class="sh"&gt;"&lt;/span&gt;&lt;span class="s"&gt;gold&lt;/span&gt;&lt;span class="sh"&gt;"&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt;
    &lt;span class="p"&gt;)&lt;/span&gt;
    &lt;span class="n"&gt;result&lt;/span&gt; &lt;span class="o"&gt;=&lt;/span&gt; &lt;span class="n"&gt;legacy_discount_engine&lt;/span&gt;&lt;span class="p"&gt;.&lt;/span&gt;&lt;span class="nf"&gt;apply&lt;/span&gt;&lt;span class="p"&gt;(&lt;/span&gt;&lt;span class="n"&gt;order&lt;/span&gt;&lt;span class="p"&gt;)&lt;/span&gt;

    &lt;span class="c1"&gt;# Expected: 500 * 0.9 (subtotal) * 0.85 (member) = 382.50
&lt;/span&gt;    &lt;span class="k"&gt;assert&lt;/span&gt; &lt;span class="nf"&gt;abs&lt;/span&gt;&lt;span class="p"&gt;(&lt;/span&gt;&lt;span class="n"&gt;result&lt;/span&gt;&lt;span class="p"&gt;.&lt;/span&gt;&lt;span class="n"&gt;final&lt;/span&gt; &lt;span class="o"&gt;-&lt;/span&gt; &lt;span class="mf"&gt;382.50&lt;/span&gt;&lt;span class="p"&gt;)&lt;/span&gt; &lt;span class="o"&gt;&amp;lt;&lt;/span&gt; &lt;span class="mf"&gt;0.01&lt;/span&gt;
    &lt;span class="k"&gt;assert&lt;/span&gt; &lt;span class="n"&gt;result&lt;/span&gt;&lt;span class="p"&gt;.&lt;/span&gt;&lt;span class="n"&gt;application_log&lt;/span&gt; &lt;span class="o"&gt;==&lt;/span&gt; &lt;span class="p"&gt;[&lt;/span&gt;
        &lt;span class="sh"&gt;"&lt;/span&gt;&lt;span class="s"&gt;line_item&lt;/span&gt;&lt;span class="sh"&gt;"&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt;
        &lt;span class="sh"&gt;"&lt;/span&gt;&lt;span class="s"&gt;subtotal&lt;/span&gt;&lt;span class="sh"&gt;"&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt;
        &lt;span class="sh"&gt;"&lt;/span&gt;&lt;span class="s"&gt;member&lt;/span&gt;&lt;span class="sh"&gt;"&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt;
    &lt;span class="p"&gt;]&lt;/span&gt;
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;"The old system had this test. Written five years ago."&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Slide 3:&lt;/strong&gt;&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight plaintext"&gt;&lt;code&gt;vibe-coding-take-down/evidence/
├── screenshot-01-unit-test-failures.png  (15 pass, 8 fail)
├── screenshot-02-todo-list.png           (47 TODOs)
└── screenshot-03-api-null.png            (API returning null)

Generated: 27 days before production cutover.
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;"Twenty-seven days before this went to production — I ran the tests. Fifteen passed, eight failed. Forty-seven TODOs. I screenshotted every single one."&lt;/p&gt;

&lt;p&gt;I let that sit.&lt;/p&gt;

&lt;p&gt;"&lt;strong&gt;Twenty-seven days. These tests sat in CI failing for almost a month. Did anyone open them?&lt;/strong&gt; "&lt;/p&gt;

&lt;p&gt;Silence.&lt;/p&gt;

&lt;p&gt;I asked again: "Did anyone look at the test report?"&lt;/p&gt;

&lt;p&gt;Marcus didn't look at me. He looked at the CTO.&lt;/p&gt;

&lt;p&gt;"During canary… we were focused on business metrics. Unit tests weren't fully executed. The timeline was tight."&lt;/p&gt;

&lt;p&gt;I didn't answer him. I put up slide four.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Slide 4: a screenshot of a Slack message.&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;Marcus's own message, sent before AI development began.&lt;/p&gt;

&lt;blockquote&gt;
&lt;p&gt;"Don't spend too much time on testing. AI generates its own code — it'll verify itself. Let's ship first, we'll backfill tests later."&lt;/p&gt;
&lt;/blockquote&gt;

&lt;p&gt;Nobody moved. Nobody spoke.&lt;/p&gt;

&lt;p&gt;I closed my laptop.&lt;/p&gt;

&lt;p&gt;"&lt;strong&gt;Marcus. You said AI would test itself. Forty-seven TODOs unaddressed. Eight failing tests. Two point eight million dollars lost. What exactly did your AI verify?&lt;/strong&gt; "&lt;/p&gt;

&lt;p&gt;Marcus didn't answer.&lt;/p&gt;

&lt;p&gt;The CEO spoke. He wasn't looking at Marcus. He was looking at me.&lt;/p&gt;

&lt;p&gt;"When did you build that folder?"&lt;/p&gt;

&lt;p&gt;"Twenty-seven days before production."&lt;/p&gt;

&lt;p&gt;"Why didn't you say anything?"&lt;/p&gt;

&lt;p&gt;"I did. At the demo. I stood up and tried to say something. Marcus cut me off. &lt;strong&gt;Nobody let me finish.&lt;/strong&gt; "&lt;/p&gt;




&lt;h2&gt;
  
  
  Act 7 · The Aftermath
&lt;/h2&gt;

&lt;p&gt;The CEO sent a company-wide email the next morning:&lt;/p&gt;

&lt;blockquote&gt;
&lt;p&gt;"Effective immediately: all AI-generated code must pass a line-by-line review by a senior engineer before merging to main. Test coverage requirements are non-negotiable. This policy cannot be overridden."&lt;/p&gt;
&lt;/blockquote&gt;

&lt;p&gt;Marcus kept his title.&lt;/p&gt;

&lt;p&gt;But the same day that email went out, HR released an updated org chart. A new Code Review Board, independent from Engineering, reporting directly to the CTO. I was put in charge.&lt;/p&gt;

&lt;p&gt;Three people on the board. The other two came with me from Legacy Systems — solid engineers who never learned to play the room.&lt;/p&gt;

&lt;p&gt;Marcus's team no longer had final say on what got merged.&lt;/p&gt;

&lt;p&gt;Someone asked me later: "You waited a month for one meeting? Was it worth it?"&lt;/p&gt;

&lt;p&gt;I said: "I didn't wait a month for a chance to prove I was right."&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;"I waited for him to say 'AI will test itself' in front of the whole company. I wanted to watch him swallow every word."&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;I deleted the evidence folder later. Didn't need it anymore.&lt;/p&gt;

&lt;p&gt;The next day I walked past Marcus's office. The door was open. He was sitting at his monitor, reading a document about the new code review process — the one I'd written.&lt;/p&gt;




&lt;p&gt;&lt;em&gt;Forty-seven TODOs. Not a single one became a bug. The AI just guessed a calculation order where no rule was specified — a different guess on every request. When the process itself doesn't exist, AI will help you prove that "doesn't exist" is good enough.&lt;/em&gt;&lt;/p&gt;




&lt;p&gt;&lt;em&gt;What's the worst "AI will handle it" decision you've seen on your team? Drop it in the comments.&lt;/em&gt;&lt;/p&gt;

&lt;p&gt;&lt;em&gt;It took me thirty seconds to create an evidence/ folder — and one second to delete it. Forty-seven TODOs taught me one thing: the only calculation order I trust is the one my coffee cup follows.&lt;a href="https://ko-fi.com/xulingfeng" rel="noopener noreferrer"&gt;Buy me a coffee ☕&lt;/a&gt;&lt;/em&gt;&lt;/p&gt;

&lt;p&gt;&lt;em&gt;Follow for more stories about AI testing, quality engineering, and what happens when the code is generated but not verified.&lt;/em&gt;&lt;/p&gt;

</description>
      <category>ai</category>
      <category>discuss</category>
      <category>programming</category>
      <category>career</category>
    </item>
    <item>
      <title>I Tried to Fix a Vulnerability. A $1,400,000 AI System Said No. Twenty Days Later, That Vulnerability Cost $4,200,000.</title>
      <dc:creator>xulingfeng</dc:creator>
      <pubDate>Sat, 06 Jun 2026 00:44:28 +0000</pubDate>
      <link>https://dev.to/xulingfeng/i-tried-to-fix-a-vulnerability-a-1400000-ai-system-said-no-twenty-days-later-that-5d1m</link>
      <guid>https://dev.to/xulingfeng/i-tried-to-fix-a-vulnerability-a-1400000-ai-system-said-no-twenty-days-later-that-5d1m</guid>
      <description>&lt;blockquote&gt;
&lt;p&gt;&lt;em&gt;This story was shared by a fellow developer on DEV who asked to remain anonymous. If you've got a story to tell — come find me. Your name won't appear anywhere.&lt;/em&gt;&lt;/p&gt;
&lt;/blockquote&gt;




&lt;blockquote&gt;
&lt;p&gt;&lt;em&gt;Based on real microservice security design patterns. About an engineer whose PR got blocked by an AI security system — he thought he was fixing a vulnerability. Turns out, someone had a vested interest in that vulnerability staying open.&lt;/em&gt;&lt;/p&gt;
&lt;/blockquote&gt;




&lt;h2&gt;
  
  
  1. $1,400,000
&lt;/h2&gt;

&lt;p&gt;All-hands meeting. CTO James stood at the front, a number on the screen:&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;$1,400,000&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;"This is what we're spending on security this year." He pointed at the number. "The biggest piece — right here."&lt;/p&gt;

&lt;p&gt;He clicked the remote. VoidSentinel's architecture topology appeared on screen.&lt;/p&gt;

&lt;p&gt;"VoidSentinel — an AI security platform. Integrated into our CI/CD pipeline. Starting today, every PR involving internal service-to-service calls — it reviews them automatically."&lt;/p&gt;

&lt;p&gt;The CEO didn't show up today. James didn't mention it. He looked straight at Mark — VP of Security.&lt;/p&gt;

&lt;p&gt;Mark took the mic. "VoidSentinel has been running in our pre-production environment for three weeks. It's caught 47 high-risk patterns. Zero false positives."&lt;/p&gt;

&lt;p&gt;He paused.&lt;/p&gt;

&lt;p&gt;"&lt;strong&gt;— Of course, some people might feel uncomfortable when their PR gets blocked. But this isn't personal. This is the security standard.&lt;/strong&gt; "&lt;/p&gt;

&lt;p&gt;He wasn't looking at me. But I knew who he was talking about.&lt;/p&gt;




&lt;h2&gt;
  
  
  2. High Risk. Denied.
&lt;/h2&gt;

&lt;p&gt;The story started three weeks earlier.&lt;/p&gt;

&lt;p&gt;We had a payment service and a user service that talked to each other internally. They shared an old API key — one key across thirty-plus services, unchanged for five years.&lt;/p&gt;

&lt;p&gt;It wasn't that nobody knew. It just never made it to the top of the backlog.&lt;/p&gt;

&lt;p&gt;On Day 1, I opened a PR: add independent service-to-service auth between the payment and user services. Not much code — a new token exchange module, three call sites modified.&lt;/p&gt;

&lt;p&gt;Five minutes later, VoidSentinel's automated comment hit:&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;"High-risk alert: Unauthorized internal access pattern change detected. This PR has been automatically rejected. Contact the security team."&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;I stared at that comment for a long time.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;This isn't "high risk."&lt;/strong&gt; This is adding independent auth between two services. This is fixing a hole that's been open for five years.&lt;/p&gt;

&lt;p&gt;I replied under the PR: "This PR fixes a shared credential vulnerability between services. It is not introducing a new attack surface. Please escalate for manual review."&lt;/p&gt;

&lt;p&gt;The next day, the reply came:&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;"VoidSentinel's verdict is final. Please reference the system's suggested modifications to adjust your code."&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;The system's "suggested modification" was: don't change anything.&lt;/p&gt;




&lt;h2&gt;
  
  
  3. This Is Final
&lt;/h2&gt;

&lt;p&gt;I walked to Mark's office.&lt;/p&gt;

&lt;p&gt;"That PR — it's not an attack. It's a fix. The payment service and user service are still sharing a five-year-old API key. If any one service gets compromised, the other thirty are exposed."&lt;/p&gt;

&lt;p&gt;Mark didn't look up.&lt;/p&gt;

&lt;p&gt;"VoidSentinel's model analyzed your code and flagged it as high risk. I trust its judgment."&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;"Its model can't read intent. It just sees someone modifying service-to-service auth — which looks exactly like what an attacker would do."&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;Mark finally looked up.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;"So what's your suggestion? Everyone can just modify internal auth whenever they want, because 'I had good intentions'?"&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;" — Add a manual review gate. High-risk PRs go through AI first, then human review if rejected."&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;"I am the human review. The system flagged high risk, and I agree. PR denied."&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;I went back to my desk.&lt;/p&gt;

&lt;p&gt;I tried a different approach: instead of modifying the auth layer, add a call allowlist at the gateway level. That wouldn't trigger VoidSentinel's "service auth change" detection pattern.&lt;/p&gt;

&lt;p&gt;Submitted the PR.&lt;/p&gt;

&lt;p&gt;VoidSentinel verdict: &lt;strong&gt;High risk. Denied.&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;— It recognized I'd changed the approach. The model's coverage was deeper than I thought.&lt;/p&gt;

&lt;p&gt;But it still couldn't cover the real vulnerability.&lt;/p&gt;




&lt;h2&gt;
  
  
  4. PIP
&lt;/h2&gt;

&lt;p&gt;Day 3. I got the PIP notice.&lt;/p&gt;

&lt;p&gt;Subject: &lt;strong&gt;"Assessment of Alignment with Company Security Protocols"&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;Mark's office. He slid the PIP notice across the desk.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;"Two attempts to bypass VoidSentinel's security review. This isn't a technical issue — it's an attitude problem."&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;"I submitted fixes. Not bypasses."&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;"From the system's perspective, you modified internal auth paths. The system flagged high risk. You changed the approach and submitted again. That's a bypass."&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;I looked at the PIP notice.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;"30-day improvement plan. Success criteria: zero compliance violations. Fail — you're out."&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;I signed it.&lt;/p&gt;

&lt;p&gt;Walking out of Mark's office, I ran into Jay from ops in the hallway. He lowered his voice:&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;"— You know Mark's a board appointment, right? The CEO's been trying to move him for almost a year. Watch your back. Also — VoidSentinel? He picked it. Sold the board on it for six months before they approved it. You tell him he picked wrong — he's not going to admit it."&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;I nodded.&lt;/p&gt;

&lt;p&gt;Back at my desk, I opened VoidSentinel's architecture documentation.&lt;/p&gt;




&lt;h2&gt;
  
  
  5. Nobody Knew
&lt;/h2&gt;

&lt;p&gt;For the next 24 days, I did three things.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;One: went through the PIP motions during the day.&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;No more touching service-to-service auth in any PR. VoidSentinel stopped flagging my code. Mark sent PIP progress emails every Friday. I replied every Friday with "on track."&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Two: built a proof of concept at night.&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;VoidSentinel's core blind spot — I confirmed it — wasn't that it was too sensitive. It was that it applied &lt;strong&gt;the exact same detection rules to fixes and attacks&lt;/strong&gt;. The code that fixes a vulnerability and the code that exploits one — to VoidSentinel's model, they're indistinguishable.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;That means: it can't tell who's fixing something and who's breaking something.&lt;/strong&gt; It only knows someone's modifying service-to-service auth. Doesn't matter who you are, or why you're doing it.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Three: set up a monitor.&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;It didn't block anything. It just logged. Logged every "high-risk" PR VoidSentinel intercepted. Logged the PR content. The submitter. The resolution.&lt;/p&gt;

&lt;p&gt;I piped it to a server I controlled. Auth call logs from 138 service nodes, synced in real time.&lt;/p&gt;

&lt;p&gt;Mark didn't know.&lt;br&gt;
Jay didn't know.&lt;br&gt;
&lt;strong&gt;Nobody knew.&lt;/strong&gt;&lt;/p&gt;




&lt;h2&gt;
  
  
  6. Zero Incidents
&lt;/h2&gt;

&lt;p&gt;Day 10. Mark sent a company-wide email:&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;"VoidSentinel has been live for two weeks. 217 high-risk PRs intercepted. Zero security incidents. This is the security standard we need."&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;I read that email. Then I opened my monitor.&lt;/p&gt;

&lt;p&gt;Out of 217 interceptions, 43 were security fixes that got falsely blocked. &lt;strong&gt;And 3 of those fixes were for vulnerabilities that — if exploited — VoidSentinel couldn't detect at all.&lt;/strong&gt; Because those vulnerabilities weren't in north-south traffic. They were in the gaps between services.&lt;/p&gt;

&lt;p&gt;VoidSentinel can't see what it isn't deployed to look at.&lt;/p&gt;




&lt;h2&gt;
  
  
  7. No
&lt;/h2&gt;

&lt;p&gt;Day 22.&lt;/p&gt;

&lt;p&gt;Mark's office. "You're more than halfway through the PIP. How are things?"&lt;/p&gt;

&lt;p&gt;"On track."&lt;/p&gt;

&lt;p&gt;"Anything you want to talk about?"&lt;/p&gt;

&lt;p&gt;"No."&lt;/p&gt;

&lt;p&gt;He looked at me.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;"Alex. Your technical skills are not in question. If you're willing to put something in writing — acknowledging VoidSentinel's security review process — the PIP can end early."&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;I stood up.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;"VoidSentinel's security review process — 43 false positives. 3 of them blocked real fixes. That's the only thing I can put in writing."&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;I didn't wait for his response.&lt;/p&gt;




&lt;h2&gt;
  
  
  8. $4,200,000
&lt;/h2&gt;

&lt;p&gt;Day 27. 2:47 AM.&lt;/p&gt;

&lt;p&gt;My phone lit up. Not a VoidSentinel alert — my own monitor.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;"Anomaly detected: service-to-service auth call — credential KT-9f4 — flagged as 'high risk' by VoidSentinel, then automatically cleared three minutes later."&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;I sat up. Opened my laptop.&lt;/p&gt;

&lt;p&gt;At 1:12 AM, someone had initiated an auth call on the payment service's dev interface. Credential KT-9f4 — a service account belonging to an employee who left four months ago.&lt;/p&gt;

&lt;p&gt;VoidSentinel flagged it. Then automatically cleared it.&lt;/p&gt;

&lt;p&gt;Reason: &lt;strong&gt;"Credential is valid. Call frequency normal. Classified as normal operation."&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;— The attacker used a legitimate credential. Normal frequency. Normal path. They just accessed a service they shouldn't.&lt;/p&gt;

&lt;p&gt;VoidSentinel recognized "someone's modifying service-to-service auth" — but after evaluating the call frequency and credential validity, its model decided this was a routine operation.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;It did nothing.&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;I opened VoidSentinel's dashboard. Threat score: &lt;strong&gt;0.02&lt;/strong&gt;. All green.&lt;/p&gt;

&lt;p&gt;Then I opened the Slack channel.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;"Payment reconciliation is off by $4.2 million. Anyone looking?"&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;I took three screenshots:&lt;/p&gt;

&lt;ol&gt;
&lt;li&gt;My monitor log — credential KT-9f4 at 1:12 AM, cross-service call&lt;/li&gt;
&lt;li&gt;VoidSentinel's audit log — "high risk" → cleared three minutes later → "normal"&lt;/li&gt;
&lt;li&gt;The Slack message — $4.2 million missing&lt;/li&gt;
&lt;/ol&gt;

&lt;p&gt;Then I opened the PR I'd submitted on Day 1.&lt;/p&gt;

&lt;p&gt;VoidSentinel's verdict: &lt;strong&gt;High risk. Denied.&lt;/strong&gt;&lt;br&gt;
Fix content: independent auth between payment service and user service.&lt;br&gt;
— The exact same link that was exploited at 1:12 AM.&lt;/p&gt;

&lt;p&gt;I sent the three screenshots and that PR to CTO James. CC'd the CEO.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;One line in the body:&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;"Day 27. The thing I tried to fix on Day 1 — someone used it tonight. VoidSentinel flagged it for three minutes, then let it through. PIP has 3 days left. Your call."&lt;/strong&gt;&lt;/p&gt;




&lt;h2&gt;
  
  
  9. RCA
&lt;/h2&gt;

&lt;p&gt;CTO James. VP Mark. VP of Finance. VP of Legal. The CEO — everyone in the room. One person I didn't recognize sat in the corner. Someone called him a "board observer."&lt;/p&gt;

&lt;p&gt;James ran through the incident timeline first. When he said "the attack vector exploited an internal auth gap," the CEO raised a hand and stopped him.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;"Alex. When did you submit your first PR?"&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;"Day 1."&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;"What was it for?"&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;"Independent auth between the payment and user services."&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;"What was VoidSentinel's verdict?"&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;"High risk. Denied."&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;"And then?"&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;I didn't answer. I opened the PR.&lt;/p&gt;

&lt;p&gt;I flipped to the first page — the code diff. A new token exchange module. Three call sites changed.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;"This was a fix. VoidSentinel flagged it as high risk."&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;Then I flipped to the 1:12 AM attack log.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;"This was an attack. VoidSentinel flagged it as high risk — then automatically cleared it. Reason: 'credential valid, frequency normal.' Same system. Same vulnerability. Same detection model. One blocked. One let through."&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;I closed the laptop.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;"— Because VoidSentinel can't tell the difference between a fix and an attack. It only knows someone's modifying things. Whether that person is patching or exploiting — it doesn't know. It can't."&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;The room went quiet for a few seconds.&lt;/p&gt;

&lt;p&gt;The CEO didn't look at Mark. He looked at me.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;"When were you put on PIP?"&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;"Day 3."&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;"What was the reason?"&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;"Two attempts to bypass VoidSentinel's security review."&lt;/p&gt;

&lt;p&gt;The CEO didn't say anything. He turned to Mark and asked quietly:&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;"He submitted a fix. You gave him a PIP. Is that right?"&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;James's eyebrow twitched.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;"Mark. The report you gave me said 'two attempts to bypass security review.' You didn't mention he was submitting a fix."&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;Mark opened his mouth. "His submission method — "&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;"He proposed independent auth. You said no. He tried a gateway-level approach. You said no. The vulnerability he reported — someone exploited it tonight. Your system — flagged it for three minutes, then let it through."&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;The CEO's voice was flat. Flat enough that the silence after it felt heavier than any shout.&lt;/p&gt;

&lt;p&gt;Mark couldn't find words.&lt;/p&gt;

&lt;p&gt;The CEO stood up.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;"Alex. My office."&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;I stood up. In my peripheral vision — Mark stayed seated. He didn't follow.&lt;/p&gt;




&lt;h2&gt;
  
  
  10. Eleven Months
&lt;/h2&gt;

&lt;p&gt;The CEO walked fast down the hall. I followed. He didn't turn around.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;"PIP rescinded. Effective immediately."&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;"Okay."&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;"Your fix gets deployed today."&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;" — VoidSentinel will block it."&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;"VoidSentinel gets reconfigured this afternoon. Your PR gets re-reviewed after the config update."&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;"Okay."&lt;/p&gt;

&lt;p&gt;He stopped.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;"That monitor you built. How long has it been running?"&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;"Day 3."&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;"The day you got the PIP?"&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;"The day I got the PIP."&lt;/p&gt;

&lt;p&gt;He looked at me. Maybe five seconds.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;"You know what it means — being on PIP and still building something that caught an attack before a $1.4M system did?"&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;" — It means those 27 days on PIP weren't wasted."&lt;/p&gt;

&lt;p&gt;He smiled. Not a "good job" smile. A different kind.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;" — You know how Mark got that seat?"&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;I paused.&lt;/p&gt;

&lt;p&gt;"Board appointment, I heard."&lt;/p&gt;

&lt;p&gt;The CEO nodded.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;"I waited eleven months for a reason."&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;He didn't finish the sentence. He didn't need to.&lt;/p&gt;

&lt;p&gt;I went back to my desk. Mark's desk was empty. HR had completed everything by 3 PM.&lt;/p&gt;

&lt;p&gt;I never replied to Mark's last PIP progress email. The last one sat at Day 27.&lt;/p&gt;

&lt;p&gt;Subject line: &lt;strong&gt;"On track."&lt;/strong&gt;&lt;/p&gt;




&lt;p&gt;&lt;em&gt;Later, I reopened that Day 1 PR. VoidSentinel ran it again — this time it passed, with a note: "Low risk. Recommended to merge."&lt;/em&gt;&lt;/p&gt;

&lt;p&gt;&lt;em&gt;I didn't feel good about it.&lt;/em&gt;&lt;/p&gt;

&lt;p&gt;&lt;em&gt;I just remembered what Jay said to me in the hallway that day.&lt;/em&gt;&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;— "The CEO's been trying to move him for almost a year. Watch your back."&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;&lt;em&gt;The reason he was waiting for was never about me.&lt;/em&gt;&lt;/strong&gt;&lt;br&gt;
&lt;strong&gt;&lt;em&gt;It was about someone finally handing him a board he didn't have to wait on anymore.&lt;/em&gt;&lt;/strong&gt;&lt;/p&gt;




&lt;p&gt;&lt;strong&gt;Folks, when you submit a fix that gets blocked —&lt;/strong&gt;&lt;br&gt;
&lt;strong&gt;are you fixing a bug, or finishing someone else's chess move?&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;👇&lt;/p&gt;

&lt;p&gt;&lt;em&gt;To the person fixing vulnerabilities at 3 AM — this one's for you. &lt;a href="https://ko-fi.com/xulingfeng" rel="noopener noreferrer"&gt;Buy me a coffee ☕&lt;/a&gt;&lt;/em&gt;&lt;/p&gt;

</description>
      <category>ai</category>
      <category>programming</category>
      <category>discuss</category>
      <category>career</category>
    </item>
    <item>
      <title>Our VP's AI Wrote 3,000 Tests. Production Cost $700K. I Deleted Every Single One.</title>
      <dc:creator>xulingfeng</dc:creator>
      <pubDate>Thu, 04 Jun 2026 23:50:06 +0000</pubDate>
      <link>https://dev.to/xulingfeng/our-vps-ai-wrote-3000-tests-production-cost-700k-i-deleted-every-single-one-5536</link>
      <guid>https://dev.to/xulingfeng/our-vps-ai-wrote-3000-tests-production-cost-700k-i-deleted-every-single-one-5536</guid>
      <description>&lt;blockquote&gt;
&lt;p&gt;&lt;em&gt;Based on real industry trends. About an AI testing tool promising 300x efficiency, a VP who rebranded hand-written automation as "manual testing," and a $700K SLA bill nobody saw coming.&lt;/em&gt;&lt;/p&gt;
&lt;/blockquote&gt;




&lt;h2&gt;
  
  
  Act 1: The All-Hands
&lt;/h2&gt;

&lt;p&gt;VP Harrison stood in front of the screen, the AI testing dashboard glowing behind him.&lt;/p&gt;

&lt;p&gt;&lt;em&gt;"Three days. Three thousand test cases. Zero human intervention."&lt;/em&gt;&lt;/p&gt;

&lt;p&gt;He paused. Let his eyes sweep the room. They landed on me.&lt;/p&gt;

&lt;p&gt;&lt;em&gt;"And some people — six years. Maintained four hundred automated test cases. That's less than twenty per person per year."&lt;/em&gt;&lt;/p&gt;

&lt;p&gt;A few people looked at their phones. Others studiously avoided my direction.&lt;/p&gt;

&lt;p&gt;&lt;em&gt;"I'm not here to debate efficiency. I'm here to ask — why does your team still exist?"&lt;/em&gt;&lt;/p&gt;

&lt;p&gt;I opened my notebook.&lt;/p&gt;

&lt;p&gt;&lt;em&gt;"Mr. Harrison. What's the coverage on those three thousand AI-generated tests?"&lt;/em&gt;&lt;/p&gt;

&lt;p&gt;&lt;em&gt;"One hundred percent."&lt;/em&gt;&lt;/p&gt;

&lt;p&gt;&lt;em&gt;"And how many bugs did they find?"&lt;/em&gt;&lt;/p&gt;

&lt;p&gt;A beat.&lt;/p&gt;

&lt;p&gt;&lt;em&gt;"The first phase is about regression coverage, not —"&lt;/em&gt;&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;"Zero."&lt;/strong&gt; I cut him off. &lt;strong&gt;"Three thousand tests, zero bugs. You ran three thousand checks on 'what the code does' and not a single one on 'what the code should do.' That's coverage, not quality."&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;VP Harrison smiled.&lt;/p&gt;

&lt;p&gt;&lt;em&gt;"I understand your anxiety. When new technology threatens your domain, people always rationalize resistance. But the data doesn't lie — three hundred times the efficiency, zero incremental cost. What took you six years to prove — AI did in three days."&lt;/em&gt;&lt;/p&gt;

&lt;p&gt;He didn't look at me again. Clicked to the next slide.&lt;/p&gt;




&lt;h2&gt;
  
  
  Act 2: The Sideline
&lt;/h2&gt;

&lt;p&gt;That afternoon, HR notified me: my team was being reassigned from Quality Assurance to the AI Engineering Group. My reporting line now went through VP Harrison's deputy.&lt;/p&gt;

&lt;p&gt;I walked to his office.&lt;/p&gt;

&lt;p&gt;&lt;em&gt;"Mr. Harrison. The AI testing tool needs a three-week trial run. I need to verify its behavior under production traffic patterns —"&lt;/em&gt;&lt;/p&gt;

&lt;p&gt;&lt;em&gt;"You don't need to verify it. I already did."&lt;/em&gt;&lt;/p&gt;

&lt;p&gt;&lt;em&gt;"What environment?"&lt;/em&gt;&lt;/p&gt;

&lt;p&gt;&lt;em&gt;"Staging. One hundred percent pass rate."&lt;/em&gt;&lt;/p&gt;

&lt;p&gt;&lt;em&gt;"Staging doesn't simulate real traffic shapes —"&lt;/em&gt;&lt;/p&gt;

&lt;p&gt;&lt;em&gt;"Are you telling me your four hundred manual test cases are more effective than three thousand AI-generated ones?"&lt;/em&gt; He leaned back. &lt;em&gt;"Do you actually believe that?"&lt;/em&gt;&lt;/p&gt;

&lt;p&gt;I stood at his office door. He didn't ask me to sit.&lt;/p&gt;

&lt;p&gt;&lt;em&gt;"Your new desk is on the third floor. AI Engineering Group. Report tomorrow."&lt;/em&gt;&lt;/p&gt;




&lt;h2&gt;
  
  
  Act 3: The Report
&lt;/h2&gt;

&lt;p&gt;I spent three nights pulling and reviewing all 3,000 AI-generated test cases.&lt;/p&gt;

&lt;p&gt;The AI tool itself wasn't bad. The problem wasn't the technology.&lt;/p&gt;

&lt;p&gt;The problem was the &lt;strong&gt;configuration&lt;/strong&gt;. VP Harrison's deputy had set the input boundary to "90th percentile of historical production data." The AI faithfully generated tests within that boundary. All three thousand cases lived inside the 90th percentile. Inside that range, the AI validated "what the code does according to the config." It couldn't — by design — validate "what the code should do at the boundaries." The config never asked it to look there. &lt;strong&gt;The AI didn't err. It flawlessly executed a flawed instruction set.&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;I wrote a full analysis report with configuration screenshots and comparative data.&lt;/p&gt;

&lt;p&gt;Sent it to VP Harrison. No CC.&lt;/p&gt;

&lt;p&gt;Twenty-three minutes later, his reply landed:&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;"Noted. The edge scenarios you identified have an estimated probability below 0.3%. Per our risk prioritization framework, we will not allocate resources to cover them. I suggest you focus on learning the new tools rather than finding reasons to reject them."&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;I read that line twice.&lt;/p&gt;

&lt;p&gt;Then I filed the report into a folder called &lt;code&gt;RCA_2026Q3&lt;/code&gt; and went back to maintaining my test suite.&lt;/p&gt;




&lt;h2&gt;
  
  
  Act 4: The Rollout
&lt;/h2&gt;

&lt;p&gt;Three weeks later. The AI tests went live on the main release pipeline.&lt;/p&gt;

&lt;p&gt;VP Harrison published a piece in the company newsletter:&lt;br&gt;
&lt;strong&gt;"Why We Retired Manual Testing — And Why Your Team Might Be Next"&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;One line found me in the company-wide email:&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;"Some people spent three weeks trying to prove AI wouldn't work. Two weeks in production — zero incidents. Sometimes, what you're resisting isn't the technology's flaws. It's your own insecurity."&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;The one remaining tester on my team walked over to my desk.&lt;/p&gt;

&lt;p&gt;&lt;em&gt;"Boss... that part about insecurity. Was that about you?"&lt;/em&gt;&lt;/p&gt;

&lt;p&gt;I closed the email tab.&lt;/p&gt;

&lt;p&gt;&lt;em&gt;"Two weeks zero incidents. Let's see what week three brings."&lt;/em&gt;&lt;/p&gt;




&lt;h2&gt;
  
  
  Act 5: The $700K Breakdown
&lt;/h2&gt;

&lt;p&gt;1:14 AM. PagerDuty lit up like a Christmas tree.&lt;/p&gt;

&lt;p&gt;A module the AI tests had cleared — thanks to that "90th percentile" boundary — hit a data race condition under real traffic. Every AI-generated test ran inside "normal traffic" parameters. Not a single test covered "resource contention when call frequency exceeds threshold." Because the AI's configuration never told it to check.&lt;/p&gt;

&lt;p&gt;Cascading failure. Core transaction pipeline down. Nine hours of data recovery.&lt;/p&gt;

&lt;p&gt;Initial damage: &lt;strong&gt;$700K&lt;/strong&gt;.&lt;/p&gt;

&lt;p&gt;The CTO called an RCA. Meeting time: Monday, 9 AM. Attendees: VP level and above... and me.&lt;/p&gt;




&lt;h2&gt;
  
  
  Act 6: The Meeting Room
&lt;/h2&gt;

&lt;p&gt;9 AM. The conference room.&lt;/p&gt;

&lt;p&gt;The CEO walked in. Didn't sit. Stood at the head of the table and placed a printed report on the surface.&lt;/p&gt;

&lt;p&gt;&lt;em&gt;"Mr. Harrison. You go first."&lt;/em&gt;&lt;/p&gt;

&lt;p&gt;VP Harrison cleared his throat.&lt;/p&gt;

&lt;p&gt;&lt;em&gt;"This was a tool-level edge case. The AI testing framework lacks built-in detection for this scenario. We've contacted the vendor — the next release will include a fix."&lt;/em&gt;&lt;/p&gt;

&lt;p&gt;The CEO listened standing. He didn't interrupt.&lt;/p&gt;

&lt;p&gt;When Harrison finished, he closed the folder and looked around the table.&lt;/p&gt;

&lt;p&gt;&lt;em&gt;"Before we went live — did anyone raise a concern like this?"&lt;/em&gt;&lt;/p&gt;

&lt;p&gt;Silence. VP Harrison said nothing. The CTO studied his laptop.&lt;/p&gt;

&lt;p&gt;Three seconds.&lt;/p&gt;

&lt;p&gt;I opened my notebook.&lt;/p&gt;

&lt;p&gt;&lt;em&gt;"Yes. One month ago."&lt;/em&gt;&lt;/p&gt;

&lt;p&gt;The CEO's eyes shifted to me.&lt;/p&gt;

&lt;p&gt;&lt;em&gt;"A report analyzing the AI testing tool's configuration — input boundary set at the 90th percentile, leaving twenty-three categories of low-probability, high-impact scenarios uncovered. Including the race condition that caused last night's outage."&lt;/em&gt;&lt;/p&gt;

&lt;p&gt;CEO: &lt;em&gt;"Who did you send it to?"&lt;/em&gt;&lt;/p&gt;

&lt;p&gt;&lt;em&gt;"Mr. Harrison."&lt;/em&gt;&lt;/p&gt;

&lt;p&gt;I plugged my laptop into the conference room projector. The email screenshot filled the screen.&lt;/p&gt;

&lt;p&gt;&lt;em&gt;"To: Mr. Harrison. Sent: June 7, 11:23 PM."&lt;/em&gt;&lt;/p&gt;

&lt;p&gt;Several people pulled out their phones to photograph it.&lt;/p&gt;

&lt;p&gt;The CEO glanced at the screen, then back at VP Harrison.&lt;/p&gt;

&lt;p&gt;&lt;em&gt;"You received it?"&lt;/em&gt;&lt;/p&gt;

&lt;p&gt;&lt;em&gt;"I did."&lt;/em&gt;&lt;/p&gt;

&lt;p&gt;&lt;em&gt;"And?"&lt;/em&gt;&lt;/p&gt;

&lt;p&gt;A pause.&lt;/p&gt;

&lt;p&gt;&lt;em&gt;"At the time, the assessment was —"&lt;/em&gt;&lt;/p&gt;

&lt;p&gt;I flipped to the next page.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;"Noted. The edge scenarios you identified have an estimated probability below 0.3%. Per our risk prioritization framework, we will not allocate resources to cover them. I suggest you focus on learning the new tools rather than finding reasons to reject them."&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;The CEO read it. Nodded once. No raised voice. No theatrics.&lt;/p&gt;

&lt;p&gt;He placed the report back on the table and looked at VP Harrison.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;"He sent it to you. Why didn't you escalate?"&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;VP Harrison had no answer.&lt;/p&gt;




&lt;h2&gt;
  
  
  Act 7: The Aftermath
&lt;/h2&gt;

&lt;p&gt;VP Harrison submitted his resignation two days later.&lt;/p&gt;

&lt;p&gt;Internal memo from the CTO: Quality Assurance restored as an independent division, reporting directly to the CTO. Budget doubled. I was appointed department head.&lt;/p&gt;

&lt;p&gt;That afternoon, I walked back to my old desk on the fifth floor. Still empty. Everything where I'd left it.&lt;/p&gt;

&lt;p&gt;A yellow sticky note on the corner of my monitor. Not mine — left by a former teammate who'd left the company months ago.&lt;/p&gt;

&lt;p&gt;It read: &lt;strong&gt;"Don't let them touch your tests."&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;I never knew if he meant the VP, or the AI.&lt;/p&gt;

&lt;p&gt;I peeled it off and tucked it into the first page of my notebook.&lt;/p&gt;

&lt;p&gt;I opened the AI testing platform. Typed &lt;code&gt;test_case list --all --source ai&lt;/code&gt;. 3,000 records.&lt;/p&gt;

&lt;p&gt;Select all. Delete. Confirm.&lt;/p&gt;

&lt;p&gt;&lt;em&gt;"This action cannot be undone. Delete 3,000 test cases?"&lt;/em&gt;&lt;/p&gt;

&lt;p&gt;Confirm.&lt;/p&gt;

&lt;p&gt;Three thousand cases. Three seconds. Gone.&lt;/p&gt;

&lt;p&gt;The one remaining tester on my team stood behind me.&lt;/p&gt;

&lt;p&gt;&lt;em&gt;"Boss... you just deleted everything?"&lt;/em&gt;&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;"The configuration was wrong. Every test was built on a broken foundation. If the foundation is crooked, no test on top of it will save you."&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;He didn't answer.&lt;/p&gt;

&lt;p&gt;I opened our four hundred automated test cases — six years of writing, one line at a time. Four hundred cases on the legacy system. Zero production incidents in six years.&lt;/p&gt;

&lt;p&gt;&lt;em&gt;"What about the new module?"&lt;/em&gt; he asked.&lt;/p&gt;

&lt;p&gt;&lt;em&gt;"We write them. Starting today."&lt;/em&gt;&lt;/p&gt;

&lt;p&gt;But I didn't close the AI testing platform.&lt;/p&gt;

&lt;p&gt;I opened its test generator. Pasted the new module's API spec. Changed the boundary config from the default 90th percentile to — unlimited. Generate everything. I'll curate.&lt;/p&gt;

&lt;p&gt;The AI generated 87 candidates.&lt;/p&gt;

&lt;p&gt;I reviewed every single one. Kept 42. Deleted 45. Added 8 boundary scenarios the AI never considered.&lt;/p&gt;

&lt;p&gt;Fifty cases merged into our test suite.&lt;/p&gt;

&lt;p&gt;Four hundred human-written, plus fifty AI-assisted — running together.&lt;/p&gt;

&lt;p&gt;The tester looked at the green checkmarks on the dashboard.&lt;/p&gt;

&lt;p&gt;&lt;em&gt;"Boss... you're using AI too?"&lt;/em&gt;&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;"The AI isn't the problem. The problem is who decides what it tests."&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;"The VP bought AI as a mask. I use AI as a microscope."&lt;/strong&gt;&lt;/p&gt;




&lt;p&gt;&lt;strong&gt;"AI makes mistakes. Humans make mistakes. But the worst part is — someone stacks both mistakes on top of each other, then blames it all on the AI."&lt;/strong&gt;&lt;/p&gt;




&lt;p&gt;&lt;strong&gt;AI-generated tests pass at 100%. They verify what the code does — not what it should do. When the code itself is wrong — AI will prove the wrong code is right.&lt;/strong&gt;&lt;/p&gt;




&lt;p&gt;&lt;em&gt;Have you seen someone package a process failure as a technology failure? What happened next?&lt;/em&gt;&lt;/p&gt;

&lt;p&gt;&lt;em&gt;Follow for more stories about AI testing, quality engineering, and what happens when the tools are smarter than the process.&lt;/em&gt;&lt;/p&gt;




&lt;p&gt;If these stories made you think or saved you time, &lt;a href="https://ko-fi.com/xulingfeng" rel="noopener noreferrer"&gt;buy me a coffee ☕&lt;/a&gt; — currently maintaining 400 automated test cases and manually reviewing every AI-generated one. Caffeine is the only config I trust at 💯.&lt;/p&gt;

</description>
      <category>ai</category>
      <category>discuss</category>
      <category>testing</category>
      <category>career</category>
    </item>
    <item>
      <title>Our CTO Guaranteed His $2.8B AI Gateway Was Safe. I Quietly Kept a Different Record. For 8 Months.</title>
      <dc:creator>xulingfeng</dc:creator>
      <pubDate>Wed, 03 Jun 2026 18:09:56 +0000</pubDate>
      <link>https://dev.to/xulingfeng/our-cto-built-an-ai-gateway-processing-28b-it-took-me-8-months-to-prove-it-would-approve-illegal-235l</link>
      <guid>https://dev.to/xulingfeng/our-cto-built-an-ai-gateway-processing-28b-it-took-me-8-months-to-prove-it-would-approve-illegal-235l</guid>
      <description>&lt;blockquote&gt;
&lt;p&gt;A story about formal verification and adversarial testing.&lt;br&gt;
About systems that are mathematically safe, and systems that don't fail in the real world — and which one regulators actually care about.&lt;/p&gt;
&lt;/blockquote&gt;




&lt;p&gt;&lt;strong&gt;Has your company ever made a decision using AI that just felt wrong? A loan that got approved when it shouldn't have. A compliance recommendation that made you do a double-take.&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Do you trust the system with the green checkmark that says "verified safe" — or are you quietly running your own tests in the background?&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;This is that story.&lt;/strong&gt;&lt;/p&gt;




&lt;h2&gt;
  
  
  1 · The Green Signature
&lt;/h2&gt;

&lt;p&gt;Day one. Drew was demoing his AI gateway at the all-hands tech meeting.&lt;/p&gt;

&lt;p&gt;He opened a terminal. Typed a command. Loaded the model. Fed it test data. A line of green text appeared on screen:&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;FORMAL VERIFICATION PASSED. This system is mathematically proven safe. No boundary violations possible.&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;Two seconds of silence. Then someone started clapping.&lt;/p&gt;

&lt;p&gt;Drew Chen, co-founder and principal architect. Stanford PhD, 15 top-tier conference papers. The company's AI decision system was his baby — a formal-verification gateway that constrained every model output. Running for two years, handling over $2.8B in transaction volume.&lt;/p&gt;

&lt;p&gt;The room applauded. Three people raised their hands with questions. Drew answered each one.&lt;/p&gt;

&lt;p&gt;I was sitting in the corner.&lt;/p&gt;

&lt;p&gt;I was hired to replace the previous QA lead. He got laid off — second on the list in the last round of cuts. When I showed up, my desk still had someone else's stuff in it. An unopened bag of tea. A style guide nobody ever read.&lt;/p&gt;

&lt;p&gt;Our QA team, including me: five people. Buried under Ops on the org chart. Not even Engineering.&lt;/p&gt;

&lt;p&gt;A co-founder built the AI gateway that carries $2.8B. He doesn't need QA, because he proved his system is mathematically safe.&lt;/p&gt;

&lt;p&gt;And me? I was a compliance hire. A box the company had to check.&lt;/p&gt;

&lt;p&gt;I didn't know yet that the next eight months would be spent proving that green signature would cause an incident serious enough to bring regulators through the front door.&lt;/p&gt;




&lt;p&gt;Second week. I submitted my test pipeline proposal.&lt;/p&gt;

&lt;p&gt;Adversarial testing. Distribution drift monitoring. Injection attack detection. Inference chain auditing. A complete AI validation framework.&lt;/p&gt;

&lt;p&gt;Drew finished reading in three minutes.&lt;/p&gt;

&lt;p&gt;"Formal verification already covers every possible output path. If an output exceeds the boundary, the gateway blocks it. Everything you're talking about — input injection, distribution drift — it's either allowed inside the boundary or blocked outside. There's no third category."&lt;/p&gt;

&lt;p&gt;"There's a gray zone between inside and outside," I said. "After three months of live service, the model develops feature drift on its own. The boundary constraints don't change — but the semantic layer does. Formal verification doesn't check semantics."&lt;/p&gt;

&lt;p&gt;"That's ops monitoring, not architecture."&lt;/p&gt;

&lt;p&gt;I opened my notebook. Three P0 incidents since the system went live.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Incident one:&lt;/strong&gt; Input distribution drift caused sustained output quality degradation. Triggered on day 52 of live service.&lt;br&gt;
&lt;strong&gt;Incident two:&lt;/strong&gt; Model output was syntactically valid but recommended a decision that violated business rules. Triggered on day 87.&lt;br&gt;
&lt;strong&gt;Incident three:&lt;/strong&gt; Semantic drift in a specific scenario. No boundary was breached. Triggered on day 103.&lt;/p&gt;

&lt;p&gt;Three incidents. All post-deployment. The formal verification signature was valid for every single one.&lt;/p&gt;

&lt;p&gt;Drew glanced at them.&lt;/p&gt;

&lt;p&gt;"These aren't system design problems."&lt;/p&gt;

&lt;p&gt;I closed my notebook.&lt;/p&gt;

&lt;p&gt;He put the report back on the desk. His fingers paused at the edge of the paper. Then he said something, without looking at me.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;"You've got good data. But data isn't architecture."&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;I knew he was giving himself an out. I didn't push.&lt;/p&gt;


&lt;h2&gt;
  
  
  2 · Drift
&lt;/h2&gt;

&lt;p&gt;I didn't have the standing to argue with him. So I built my own adversarial testing pipeline on a personal cloud account.&lt;/p&gt;

&lt;p&gt;Non-blocking. Recording only.&lt;/p&gt;

&lt;p&gt;April 2025. The pipeline caught the first AI distribution drift.&lt;/p&gt;

&lt;p&gt;I had been watching Drew's system for three months. Same input class, 1,000 runs. The output syntax was always correct. Parameters in range. Confidence scores passing. But the semantic distribution had shifted — the model's judgment criteria for the same category had &lt;strong&gt;quietly changed&lt;/strong&gt; over those three months.&lt;/p&gt;

&lt;p&gt;The formal verification signature was still valid. The gateway saw nothing wrong.&lt;/p&gt;

&lt;p&gt;I sent the report to Drew.&lt;/p&gt;

&lt;p&gt;24 hours later, his reply came back. He confirmed the drift. But:&lt;/p&gt;

&lt;p&gt;"This is normal model evolution. The architecture allows ±3% semantic drift. I've added an input normalization layer — should fix it."&lt;/p&gt;

&lt;p&gt;He fixed it over a weekend. Drift metrics zeroed out.&lt;/p&gt;

&lt;p&gt;But I knew this wasn't a fluke. The root cause was the model's online learning mechanism adjusting parameters without any guardrails. He fixed one tree. The forest was still burning.&lt;/p&gt;

&lt;p&gt;I didn't say anything. I just added a trigger to the pipeline:&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;「When drift accumulates past threshold, auto-save a complete audit snapshot.」&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;Then I archived 5,000 test records, dozens of distribution comparison charts, and a full traceability script — all to an encrypted volume outside company storage.&lt;/p&gt;

&lt;p&gt;Not to expose anyone. So that if things went wrong, someone would know what happened.&lt;/p&gt;


&lt;h2&gt;
  
  
  3 · Immunity
&lt;/h2&gt;

&lt;p&gt;After the drift incident, I wired my pipeline into CI. Every PR ran through it before merge. No pass, no deploy.&lt;/p&gt;

&lt;p&gt;The dev team hated it within a week. A PR that took minutes to unit-test now took almost four times longer. Minor issues blocked merges, waiting for me to review, waiting for Drew to decide.&lt;/p&gt;

&lt;p&gt;June 2025. Drew walked straight into the CEO's office.&lt;/p&gt;

&lt;p&gt;His argument: the QA team's adversarial testing was slowing releases by 4x. Every minor anomaly triggered a change review. The dev team was losing it.&lt;/p&gt;

&lt;p&gt;The CEO signed off: &lt;strong&gt;Quality Immunity&lt;/strong&gt;. Drew's architecture would assume full quality responsibility. External testing pipelines were advisory only — not release-blocking.&lt;/p&gt;

&lt;p&gt;Drew announced it at the weekly standup. He looked at me as he said it. Not a challenge. Just a statement of fact.&lt;/p&gt;

&lt;p&gt;"I guarantee the system is safe."&lt;/p&gt;

&lt;p&gt;I said nothing. Went back to my desk. Switched my pipeline to passive mode. Record only, don't block.&lt;/p&gt;

&lt;p&gt;Worked faster that way.&lt;/p&gt;

&lt;p&gt;Within three weeks, the pipeline logged:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;7 minor drifts that CI self-monitoring hadn't caught&lt;/li&gt;
&lt;li&gt;2 data mismatches between test and production environments&lt;/li&gt;
&lt;li&gt;1 label leak in a staging environment&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;I wrote a report. Put it in my drawer.&lt;/p&gt;

&lt;p&gt;I didn't send it. The CEO didn't even know my name.&lt;/p&gt;

&lt;p&gt;But here's the interesting part: Drew started adding adversarial training modules to the architecture. He added distribution drift metrics to the monitoring dashboard. He told the CEO, "We've strengthened our internal AI validation."&lt;/p&gt;

&lt;p&gt;I watched my data show up in Drew's slides.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;He said he didn't believe in this approach. His hands were already building it.&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;What does that tell you?&lt;/p&gt;

&lt;p&gt;It tells me he knew formal verification wasn't enough. But he couldn't say it. Because he built this system. If he admitted it wasn't sufficient, then every decision made on this system for the past two years would be questionable.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;He wasn't arguing with me. He was defending his own legacy.&lt;/strong&gt;&lt;/p&gt;


&lt;h2&gt;
  
  
  4 · Inside the Boundary
&lt;/h2&gt;

&lt;p&gt;September 2025.&lt;/p&gt;

&lt;p&gt;The company's AI system handles intelligent approvals in a financial compliance context — loan applications, risk recommendations, compliance suggestions. Tens of thousands per day. At that volume, $2.8B in two years makes sense.&lt;/p&gt;

&lt;p&gt;The incident took seconds.&lt;/p&gt;

&lt;p&gt;The on-call engineer saw it first — a compliance score ticked up to 92% on the monitoring screen. Glanced at it. Moved on. He didn't know that score corresponded to a loan that should never have been approved.&lt;/p&gt;

&lt;p&gt;The system generated a compliance recommendation for a customer's loan application. Syntax correct. Format valid.&lt;/p&gt;

&lt;p&gt;The recommendation violated regulations.&lt;/p&gt;

&lt;p&gt;Drew's architecture didn't alert. The boundary was intact. Format was clean. Confidence was passing. The system saw nothing wrong.&lt;/p&gt;

&lt;p&gt;My pipeline had flagged this input 48 hours earlier — its distribution characteristics had deviated significantly from the top 1% of the training set. The trigger was lit. But during the immunity period, nobody processed it.&lt;/p&gt;

&lt;p&gt;The customer got the result and found the problem. They had their own compliance team.&lt;/p&gt;

&lt;p&gt;They reported it to the regulator.&lt;/p&gt;

&lt;p&gt;Three days later, the regulator's representatives showed up at the front desk.&lt;/p&gt;


&lt;h2&gt;
  
  
  5 · The Audit Trail
&lt;/h2&gt;

&lt;p&gt;They didn't ask for architecture documents.&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Complete model version history — what model, what parameters, when it changed&lt;/li&gt;
&lt;li&gt;Audit owner for every change — who approved it, based on what data&lt;/li&gt;
&lt;li&gt;Traceable inference chains — how each decision was made, what version it rolled back to&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;Drew's architecture team spent three days trying to pull inference records from the logging system.&lt;/p&gt;

&lt;p&gt;Nothing.&lt;/p&gt;

&lt;p&gt;Because the design philosophy of a formal verification gateway is: input → check boundary → output. It doesn't record decision paths. It records "verified" or "not verified." If it's inside the boundary, it passes. If outside, it blocks. There's no "it's inside the boundary but semantically wrong" state.&lt;/p&gt;

&lt;p&gt;Kate was the compliance officer. She came to find me.&lt;/p&gt;

&lt;p&gt;"Does your system keep an audit trail?"&lt;/p&gt;

&lt;p&gt;"From day one."&lt;/p&gt;

&lt;p&gt;"And Drew's?"&lt;/p&gt;

&lt;p&gt;I didn't answer directly. I opened my laptop and showed her.&lt;/p&gt;

&lt;p&gt;My pipeline recorded a chain:&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight plaintext"&gt;&lt;code&gt;Input sample → Distribution check ✓ → Model inference → Drift detection → 
Semantic validation → Compliance check → Audit snapshot → Encrypted storage
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;Every step with a timestamp, model version, parameter hash. Started eight months ago.&lt;/p&gt;

&lt;p&gt;Kate looked at it for five minutes. Then she said:&lt;/p&gt;

&lt;p&gt;"Send this to the regulator's legal counsel. I want confirmation they've received it by nine tomorrow morning."&lt;/p&gt;




&lt;p&gt;At the hearing, Drew made one last effort.&lt;/p&gt;

&lt;p&gt;"Formal verification is the industry-accepted gold standard for AI safety," he said. "Our verification approach is at the highest academic and engineering level available."&lt;/p&gt;

&lt;p&gt;The regulator's representative flipped through his report.&lt;/p&gt;

&lt;p&gt;"100% pass rate. Covers all boundary conditions."&lt;/p&gt;

&lt;p&gt;The representative set the report down.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;"I understand. But regulation doesn't require mathematically perfect systems. Regulation requires systems that can be investigated when something goes wrong."&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;"Your system, by design, cannot be investigated."&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;The room was quiet for a long time.&lt;/p&gt;

&lt;p&gt;Drew didn't say another word.&lt;/p&gt;




&lt;h2&gt;
  
  
  6 · The Door
&lt;/h2&gt;

&lt;p&gt;Here's how it landed.&lt;/p&gt;

&lt;p&gt;Drew kept his title and his team.&lt;/p&gt;

&lt;p&gt;But his technical decision-making authority was transferred to a new AI governance committee. I was put in charge of it.&lt;/p&gt;

&lt;p&gt;Adversarial testing pipeline was formally integrated into the company's AI governance framework. Equal on paper. In practice, the center of gravity had shifted.&lt;/p&gt;

&lt;p&gt;A few days after the results came out. In the hallway.&lt;/p&gt;

&lt;p&gt;Drew stopped me. First time in eight months he used my name.&lt;/p&gt;

&lt;p&gt;"&lt;strong&gt;Ryan&lt;/strong&gt;. You won this round. But your method has no mathematical guarantee."&lt;/p&gt;

&lt;p&gt;"You're right."&lt;/p&gt;

&lt;p&gt;"Mathematically safe systems and systems that don't fail in the real world — they're not the same thing."&lt;/p&gt;

&lt;p&gt;He stopped walking. Looked at me.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;"Your AI doesn't make mistakes — inside the boundaries you defined for it."&lt;/strong&gt; I said.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;"The problem is — after deployment, the AI walked itself right out of those boundaries."&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;"And you didn't even install a door."&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;He stood there.&lt;/p&gt;

&lt;p&gt;I walked away.&lt;/p&gt;

&lt;p&gt;Heard later that he redesigned the architecture. Added an audit logging layer.&lt;/p&gt;

&lt;p&gt;Not because formal verification couldn't do it. Because he never thought to put a door on it in the first place.&lt;/p&gt;




&lt;p&gt;&lt;strong&gt;You can't prove a system is safe. What you can prove is: when it isn't — you're watching.&lt;/strong&gt;&lt;/p&gt;




&lt;p&gt;&lt;strong&gt;Formal verification says: "My math can't be wrong."&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Adversarial testing says: "Your math doesn't tell me when it will change."&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;When the auditors show up — which one do they look at?&lt;/strong&gt;&lt;/p&gt;




&lt;p&gt;I couldn't install a door in Drew's system. But the stories stay open. &lt;a href="https://ko-fi.com/xulingfeng" rel="noopener noreferrer"&gt;buy me a coffee&lt;/a&gt; ☕&lt;/p&gt;

</description>
      <category>ai</category>
      <category>programming</category>
      <category>discuss</category>
      <category>testing</category>
    </item>
    <item>
      <title>I Asked for $500/Month and got turned down. My Company Spent $470K on AI Instead. Then I Quit.</title>
      <dc:creator>xulingfeng</dc:creator>
      <pubDate>Wed, 03 Jun 2026 08:11:00 +0000</pubDate>
      <link>https://dev.to/xulingfeng/i-asked-for-500month-my-company-spent-470k-on-ai-instead-then-i-quit-38pd</link>
      <guid>https://dev.to/xulingfeng/i-asked-for-500month-my-company-spent-470k-on-ai-instead-then-i-quit-38pd</guid>
      <description>&lt;h1&gt;
  
  
  I asked for $500/month and got turned down. The company spent $470K on an AI platform instead. Then I quit.
&lt;/h1&gt;

&lt;blockquote&gt;
&lt;p&gt;A story about what $500 couldn't buy, what $470K couldn't fix, and an 87-day countdown on a door that wouldn't open again.&lt;/p&gt;
&lt;/blockquote&gt;




&lt;p&gt;&lt;strong&gt;Your company bought AI this year. How much did it cost?&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Are you sure that money bought technology — or did it buy a "we're doing AI too" slide deck, and a nicer-looking résumé for someone else?&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;I asked for $500/month and got turned down. The company spent $470K on an AI platform instead. Then I quit.&lt;/strong&gt;&lt;/p&gt;




&lt;p&gt;1&lt;/p&gt;

&lt;p&gt;The pay cut notice went out to the whole department by email.&lt;/p&gt;

&lt;p&gt;Red title bar. Three lines of body text. "Optimizing cost structure — evaluating role efficiency." My name was seventh on the list. Not first. Not last.&lt;/p&gt;

&lt;p&gt;30%.&lt;/p&gt;

&lt;p&gt;I closed the email and went back to fixing the log.&lt;/p&gt;

&lt;p&gt;Mike, at the desk next to mine, pushed his keyboard away. "Thirty percent? Are they serious?"&lt;/p&gt;

&lt;p&gt;I didn't answer.&lt;/p&gt;

&lt;p&gt;He turned to look at me. "You're not mad?"&lt;/p&gt;

&lt;p&gt;"I was mad," I said. "Past tense."&lt;/p&gt;

&lt;p&gt;— I wasn't mad &lt;em&gt;today&lt;/em&gt;. I was mad three months ago, when the VP announced at an all-hands meeting that the company was buying an AI platform for $470K a year. Someone checked quietly — similar platforms on the market cost about a third of that.&lt;/p&gt;

&lt;p&gt;After the meeting I followed the VP to his office and put a budget request on his desk.&lt;/p&gt;

&lt;p&gt;Cloud servers: $500/month. A model validation pipeline. Stress tests, evaluation runs, regression checks — the full stack. Two pages, every line item accounted for.&lt;/p&gt;

&lt;p&gt;The VP flipped through it for three seconds.&lt;/p&gt;

&lt;p&gt;"Not needed. The AI platform comes with built-in validation."&lt;/p&gt;

&lt;p&gt;"That's black-box testing. Feed in data, get a score. A pipeline needs traceability — which model layer, what parameters, what version of the eval script."&lt;/p&gt;

&lt;p&gt;"I hear you. But the budget won't clear." He slid the request back across the desk, like he was mentioning something that didn't really involve him. "Run a prototype on your own account first. Get some results, then we'll talk about budget."&lt;/p&gt;

&lt;p&gt;I looked at him.&lt;/p&gt;

&lt;p&gt;He knew those words were empty. I knew he knew. Neither of us said anything else.&lt;/p&gt;

&lt;p&gt;For the next three months, the entire floor was busy with the vendor's platform deployment. Nobody paid attention to what I was doing. I spent a weekend putting the pipeline together and the rest of the time feeding it data.&lt;/p&gt;

&lt;p&gt;"We'll talk about budget" — the VP never brought it up again.&lt;/p&gt;

&lt;p&gt;2&lt;/p&gt;

&lt;p&gt;The VP hired someone to take over the AI platform — Mark. The whole engineering team's AI work was reassigned to him, including mine. In his first week, he put up a slide deck on the big screen.&lt;/p&gt;

&lt;p&gt;Three slides. First one: market size. Second one: customer logos. Third one: one sentence.&lt;/p&gt;

&lt;p&gt;"— Covers approximately 72% of the engineering team's routine development work."&lt;/p&gt;

&lt;p&gt;That evening I came back to my desk and found Mark standing next to it — holding a piece of paper.&lt;/p&gt;

&lt;p&gt;"Unauthorized Systems List." He set it down next to my keyboard. One line was highlighted in yellow: &lt;em&gt;Model Validation Pipeline / Registered Under: Personal Account / Budget Allocation: None.&lt;/em&gt;&lt;/p&gt;

&lt;p&gt;"From the IT audit. No purchase record, no license, personal account — I'm guessing you haven't read the company's compliance policy."&lt;/p&gt;

&lt;p&gt;"You can check with the VP," I said. "He told me to run the prototype three months ago."&lt;/p&gt;

&lt;p&gt;Mark didn't respond to that.&lt;/p&gt;

&lt;p&gt;He glanced at my screen — the stress test I'd been running for the last twenty minutes.&lt;/p&gt;

&lt;p&gt;"The VP resigned," he said. Quietly. Like he was stating something that had nothing to do with him.&lt;/p&gt;

&lt;p&gt;The VP left without a goodbye email, without a handover meeting. One day he was at his desk. The next day it was empty.&lt;/p&gt;

&lt;p&gt;I didn't react.&lt;/p&gt;

&lt;p&gt;"You registered this on a free personal tier?"&lt;/p&gt;

&lt;p&gt;I didn't look up.&lt;/p&gt;

&lt;p&gt;He stood there for two seconds, folded the paper, and put it back in his pocket. He took three steps toward the door, stopped, and said without turning around:&lt;/p&gt;

&lt;p&gt;"You can keep it around. As a reference."&lt;/p&gt;

&lt;p&gt;Then he walked out.&lt;/p&gt;

&lt;p&gt;I watched him disappear around the corner.&lt;/p&gt;

&lt;p&gt;The platform had been live for a month. Every week, Mark sent out a progress email with a dashboard screenshot — model accuracy at 98.7%, response latency under 200ms, coverage climbing steadily. The numbers looked better every time.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;But every number was the platform testing itself. Its own test set, on its own models, producing its own report. Not a single data point from actual production feedback.&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;The only data set in the whole company that could independently validate whether a model actually worked — that was on my cloud server. The test suite had been built from five years of production failures, one incident at a time. Every label hand-reviewed. Every model that passed it, you knew exactly how it would perform on real traffic.&lt;/p&gt;

&lt;p&gt;Mark knew how important this thing was — it was outside his audit list, and that's exactly why he needed it. &lt;strong&gt;He was auditing my pipeline. What he really needed was five years of my mistakes.&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;I closed my laptop and went to the rooftop.&lt;/p&gt;

&lt;p&gt;Two empty chairs and a waist-high wall. I sat there for fourteen minutes. The wind was strong enough to keep my shirt collar flapping.&lt;/p&gt;

&lt;p&gt;My phone rang. I thought it was a delivery.&lt;/p&gt;

&lt;p&gt;It wasn't. It was a recruiter.&lt;/p&gt;

&lt;p&gt;"Simon? This is Kate. Mia suggested I reach out — she said you might be exactly who we're looking for."&lt;/p&gt;

&lt;p&gt;Mia. A recruiter I'd worked with six months ago. She'd put me forward for two positions, neither worked out. But she said better opportunities would come in three months.&lt;/p&gt;

&lt;p&gt;Three months.&lt;/p&gt;

&lt;p&gt;"What kind of role?"&lt;/p&gt;

&lt;p&gt;"AI infrastructure. VP level."&lt;/p&gt;

&lt;p&gt;"And the comp?"&lt;/p&gt;

&lt;p&gt;She said a number.&lt;/p&gt;

&lt;p&gt;Then she said: "Want to think about it?"&lt;/p&gt;

&lt;p&gt;"Don't need to. Send me the JD."&lt;/p&gt;

&lt;p&gt;I hung up and stood on the rooftop for a long time. You could see the whole campus from up there. Four buildings. I was in Building Two, fourth floor, west side — one of those windows was my desk for five years. Five years. How many PRs had I written? How many bugs had I fixed? How many internal tools had I built? I'd lost count. But the company remembered a different number: what percentage of my work could be automated.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;72%.&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;One number erased every line of code I'd written in five years.&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;I didn't look at the windows again. I walked back down to my desk.&lt;/p&gt;

&lt;p&gt;The keyboard was still warm. The log was still running.&lt;/p&gt;

&lt;p&gt;A few days later, I took a half-day off.&lt;/p&gt;

&lt;p&gt;3&lt;/p&gt;

&lt;p&gt;The interview was in a small conference room at the new company.&lt;/p&gt;

&lt;p&gt;No slide decks, no market sizes. The engineering director pushed a laptop across the table with a production incident reproduced on screen: an AI model's recommendation accuracy had dropped from &lt;strong&gt;93.7% to 68.2%&lt;/strong&gt; over two weeks. &lt;strong&gt;No errors in the inference logs. No memory leaks. No latency changes.&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;He asked: "How would you investigate this?"&lt;/p&gt;

&lt;p&gt;I looked at it for three minutes. Three layers.&lt;/p&gt;

&lt;p&gt;First, the feature computation pipeline. A text processing library the pipeline depended on had received an auto-patch two weeks earlier. &lt;strong&gt;The patch fixed an edge-case bug — but it also changed the vector mapping for those inputs.&lt;/strong&gt; The edge cases had been a tiny fraction of traffic, barely affecting overall accuracy. But after the patch, the routing shifted. More and more inputs got mapped to the new feature space, and accuracy drifted down, slowly, invisibly.&lt;/p&gt;

&lt;p&gt;Second, the input monitor — was the online pipeline running a real-time feature distribution comparison against the training pipeline? A threshold breaker that would shut things down automatically? No.&lt;/p&gt;

&lt;p&gt;Third, the human fallback — below a confidence threshold, was there a manual review flow? No. Fully automated. Full black box. Nobody knew anything was wrong until the incident hit. &lt;strong&gt;An auto-patch had silently changed the path the model was using to read its own data.&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;I wrote seven lines of pseudocode. One checkpoint per layer.&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;The director read it. Then he said one sentence:&lt;/p&gt;

&lt;p&gt;"When can you start?"&lt;/p&gt;

&lt;p&gt;"A month. Handover."&lt;/p&gt;

&lt;p&gt;"What's your notice period?"&lt;/p&gt;

&lt;p&gt;"Four weeks."&lt;/p&gt;

&lt;p&gt;He picked up his phone, sent a message. Thirty seconds later he put it down.&lt;/p&gt;

&lt;p&gt;"We'll buy out your notice period. The offer starts the day you sign. Your salary runs from today."&lt;/p&gt;

&lt;p&gt;I stared at him.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;"Your time is worth more than the company you're at. I'm not waiting four weeks."&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;I said: "Deal."&lt;/p&gt;

&lt;p&gt;4&lt;/p&gt;

&lt;p&gt;Two weeks later, the offer was signed. Sixty percent more than my current salary. No fancy title — "VP" was just a placeholder; I'd start with a team of three.&lt;/p&gt;

&lt;p&gt;No need to serve the notice period — the new company bought it out in full. I nodded to HR, signed the handover documents, and that was it.&lt;/p&gt;

&lt;p&gt;I sat at my desk, let the last log finish, committed the code, and closed my laptop.&lt;/p&gt;

&lt;p&gt;Then I packed. A mug. A few technical books. Sticky notes that hadn't run out. No framed photos, no souvenirs, no "5-year anniversary" plaque — that was at the bottom of a drawer somewhere. I didn't even remember to take it.&lt;/p&gt;

&lt;p&gt;I walked to the elevator and pressed B2. The doors closed. Floor 4. Floor 3.&lt;/p&gt;

&lt;p&gt;The elevator stopped at 3. The doors opened. Mark was standing there.&lt;/p&gt;

&lt;p&gt;He didn't step in. He looked at my backpack.&lt;/p&gt;

&lt;p&gt;"You're leaving?"&lt;/p&gt;

&lt;p&gt;"Yeah."&lt;/p&gt;

&lt;p&gt;He stood there for two seconds, then walked in. The doors closed. He didn't look at me — he watched the floor numbers change.&lt;/p&gt;

&lt;p&gt;"That pipeline of yours — the test data and labels — where are they stored?"&lt;/p&gt;

&lt;p&gt;In three months, this was the first technical question he'd ever asked me. Not when he was standing in front of the whole engineering team with his audit list. Now. In an elevator. Nobody else.&lt;/p&gt;

&lt;p&gt;I didn't answer.&lt;/p&gt;

&lt;p&gt;"How do I get access after you're gone?"&lt;/p&gt;

&lt;p&gt;"You don't have permissions."&lt;/p&gt;

&lt;p&gt;"I can add you as a collaborator — "&lt;/p&gt;

&lt;p&gt;"Personal account. One owner. I don't add you, nobody gets in. Call IT, call support — the account is registered under my personal email, and support only talks to me."&lt;/p&gt;

&lt;p&gt;He was quiet. Two seconds. Three seconds.&lt;/p&gt;

&lt;p&gt;The elevator went from floor 3 to floor 2.&lt;/p&gt;

&lt;p&gt;"...How many days do we have?"&lt;/p&gt;

&lt;p&gt;"Key rotation every ninety days. You've got eighty-seven."&lt;/p&gt;

&lt;p&gt;He didn't say anything else.&lt;/p&gt;

&lt;p&gt;The elevator reached B2. I stepped out. He held the door open with one hand — didn't let it close. He watched me walk away and said something that wasn't to me — it was to the elevator wall:&lt;/p&gt;

&lt;p&gt;"Eighty-seven days."&lt;/p&gt;

&lt;p&gt;The doors closed.&lt;/p&gt;

&lt;p&gt;5&lt;/p&gt;

&lt;p&gt;I was about ten steps into the B2 corridor when I heard footsteps behind me — not walking, running.&lt;/p&gt;

&lt;p&gt;I turned around.&lt;/p&gt;

&lt;p&gt;The CEO was coming down the stairwell.&lt;/p&gt;

&lt;p&gt;"Wait."&lt;/p&gt;

&lt;p&gt;I looked at him.&lt;/p&gt;

&lt;p&gt;"Who's poaching you?" His voice was quiet. Not an accusation — he was still catching his breath.&lt;/p&gt;

&lt;p&gt;"I signed."&lt;/p&gt;

&lt;p&gt;"I know. Who?"&lt;/p&gt;

&lt;p&gt;I didn't answer. He stood at the stairwell door for two seconds, caught his breath, and walked toward me.&lt;/p&gt;

&lt;p&gt;"I said, who? Give me a name. I'll double it — I'll approve the budget myself. Tell me where you're going right now."&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;He didn't say "don't leave." He didn't say "we need you." He said "tell me the name of my competitor."&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;That told me more than any farewell speech ever could.&lt;/p&gt;

&lt;p&gt;I looked him in the eye for a moment.&lt;/p&gt;

&lt;p&gt;"That AI platform — the one Mark is running — the free trial ends in thirty days. Renewal is $470K a year. You knew that, right?"&lt;/p&gt;

&lt;p&gt;He didn't answer.&lt;/p&gt;

&lt;p&gt;"When it went live, it was all promises. Now the trial is almost up — has anyone actually verified those dashboard numbers?"&lt;/p&gt;

&lt;p&gt;I stepped forward. He stepped back — not because I pushed him, but because he moved.&lt;/p&gt;

&lt;p&gt;"Whatever your new company is paying, I'll double it. Report to me directly. No layers."&lt;/p&gt;

&lt;p&gt;I stopped and looked at him.&lt;/p&gt;

&lt;p&gt;"You signed the pay cut notice. The 30% one."&lt;/p&gt;

&lt;p&gt;He froze. "That was company-wide — "&lt;/p&gt;

&lt;p&gt;"I know. You didn't stop it."&lt;/p&gt;

&lt;p&gt;He had nothing to say.&lt;/p&gt;

&lt;p&gt;I said:&lt;/p&gt;

&lt;p&gt;"The new company bought out my notice period. The three minutes you've spent talking to me — you just cost them."&lt;/p&gt;

&lt;p&gt;He didn't say another word.&lt;/p&gt;

&lt;p&gt;I held his gaze for two seconds, then turned and kept walking toward the parking garage exit.&lt;/p&gt;

&lt;p&gt;After a few steps, I looked back. He hadn't followed. He was still standing at the stairwell door, tie crooked, one hand on his knee.&lt;/p&gt;

&lt;p&gt;The lights were on at B1. He hadn't taken the elevator — he'd run down the stairs.&lt;/p&gt;

&lt;p&gt;I didn't look back again.&lt;/p&gt;

&lt;p&gt;6&lt;/p&gt;

&lt;p&gt;Week four at the new company. My tool passed the full end-to-end stress test.&lt;/p&gt;

&lt;p&gt;The engineering director stopped me in the hallway and said exactly one sentence:&lt;/p&gt;

&lt;p&gt;"Twelve-week probation — you passed it in four. Want an extra equity grant?"&lt;/p&gt;

&lt;p&gt;"The third architecture proposal is still running. Results next week. Let's talk then."&lt;/p&gt;

&lt;p&gt;He nodded and walked off.&lt;/p&gt;

&lt;p&gt;Before I left, I scrolled past a message from an ex-colleague. The old company's AI platform free trial had expired. They didn't renew. Mark's team had submitted three renewal requests — all three were kicked back. The last one had three words in the approval field: "ROI?"&lt;/p&gt;

&lt;p&gt;I opened the AI platform's website. The VP's photo was on the team page. Third screen down.&lt;/p&gt;

&lt;p&gt;"72% coverage."&lt;/p&gt;

&lt;p&gt;What was left was the 28% that nobody else could do.&lt;/p&gt;

&lt;p&gt;I didn't reply to the message. I locked my screen and packed up. The pipeline would break sooner or later. Whether Mark ever got the data — I don't know.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;What $500/month couldn't buy, $470K a year couldn't buy either.&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;The wind on the rooftop is the same everywhere.&lt;/strong&gt;&lt;/p&gt;




&lt;p&gt;&lt;strong&gt;Some things don't show up in a budget request. They're not in the slide deck, and they're not in the purchase order. That doesn't mean they don't exist.&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;So — are you the VP? Are you Mark? Are you the CEO? Or are you me?&lt;/strong&gt;&lt;/p&gt;




&lt;p&gt;&lt;em&gt;Follow for more stories about AI, engineering, and what happens when code meets corporate reality.&lt;/em&gt;&lt;/p&gt;

</description>
      <category>ai</category>
      <category>programming</category>
      <category>discuss</category>
      <category>career</category>
    </item>
    <item>
      <title>I Built Open-Source AI. Our New CTO Spent $8M on His Old Company's Product and Fired My Team. Two Weeks Later, the CEO Called.</title>
      <dc:creator>xulingfeng</dc:creator>
      <pubDate>Tue, 02 Jun 2026 08:40:51 +0000</pubDate>
      <link>https://dev.to/xulingfeng/i-built-open-source-ai-our-new-cto-spent-8m-on-his-old-companys-product-and-fired-my-team-two-3jp8</link>
      <guid>https://dev.to/xulingfeng/i-built-open-source-ai-our-new-cto-spent-8m-on-his-old-companys-product-and-fired-my-team-two-3jp8</guid>
      <description>&lt;blockquote&gt;
&lt;p&gt;&lt;em&gt;Based on a real production incident. About an $8M AI platform sourced from the new CTO's former employer. A legacy system that ran five years without a single P0. And what happens when nobody remembers where the load-bearing walls are.&lt;/em&gt;&lt;/p&gt;
&lt;/blockquote&gt;




&lt;p&gt;&lt;strong&gt;1&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;The news broke on a Friday.&lt;/p&gt;

&lt;p&gt;Henry was called to HR first. Came back with a piece of paper. White-faced.&lt;/p&gt;

&lt;p&gt;"Severance package. Sign. Out."&lt;/p&gt;

&lt;p&gt;Mike went second. Walked out and tossed his badge on the desk.&lt;/p&gt;

&lt;p&gt;"Five years. Huh."&lt;/p&gt;

&lt;p&gt;Then Jay, our ops engineer. Will, our tester. Chris, our frontend guy.&lt;/p&gt;

&lt;p&gt;One by one. One piece of paper each.&lt;/p&gt;

&lt;p&gt;Finally, HR called me. She didn't look at me.&lt;/p&gt;

&lt;p&gt;"Leo. Company decision — the entire backend team is being optimized out. Your position is on the list."&lt;/p&gt;

&lt;p&gt;I said, "I ran a six-month AI pilot that was internally validated. What about that?"&lt;/p&gt;

&lt;p&gt;"…That's a management decision."&lt;/p&gt;

&lt;p&gt;"Whose decision?"&lt;/p&gt;

&lt;p&gt;She didn't answer.&lt;/p&gt;

&lt;p&gt;"Severance package. Fine."&lt;/p&gt;

&lt;p&gt;That afternoon, our team of six had our last round at the pub downstairs.&lt;/p&gt;

&lt;p&gt;Mike raised his glass: "Leo. That six-month AI pilot — I really wanted to see it go live."&lt;/p&gt;

&lt;p&gt;Henry said, "Go live? The code's already been wiped clean. They don't even know what we built."&lt;/p&gt;

&lt;p&gt;I smiled. Didn't say a word.&lt;/p&gt;




&lt;p&gt;&lt;strong&gt;2&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;Three weeks earlier. The all-hands meeting.&lt;/p&gt;

&lt;p&gt;The CEO stood on stage. Next to him, a man I'd never seen before. Navy blazer. Metal-rimmed glasses. "Brought in from outside" written all over him.&lt;/p&gt;

&lt;p&gt;The slide read: &lt;strong&gt;"AI First — A Complete Technology Architecture Upgrade."&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;"We're honored to welcome James — former AI VP at a major tech company. As of today, he's our new CTO. He's going to lead us into the next era."&lt;/p&gt;

&lt;p&gt;The room applauded. Our table didn't.&lt;/p&gt;

&lt;p&gt;James took the mic. Scanned the room — his gaze paused on us for two seconds.&lt;/p&gt;

&lt;p&gt;"I spent a week reviewing our current system. Let me be blunt — it's a pile of 2021 relics. It runs. It's stable. So what? You have no idea what AI can do today. This stack can't handle it."&lt;/p&gt;

&lt;p&gt;He clicked to the next slide. A gear getting crushed by an AI chip.&lt;/p&gt;

&lt;blockquote&gt;
&lt;p&gt;&lt;strong&gt;"Two weeks. Full AI replacement. Old system — wiped clean, rebuilt from zero."&lt;/strong&gt;&lt;/p&gt;
&lt;/blockquote&gt;

&lt;p&gt;Mike muttered beside me: &lt;strong&gt;"Bullshit."&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;James heard him. Turned, looked at Mike, and smiled. "Something to add?"&lt;/p&gt;

&lt;p&gt;Mike didn't answer. But his face was red.&lt;/p&gt;

&lt;p&gt;I didn't say anything either.&lt;/p&gt;

&lt;p&gt;Because that system — I wrote it.&lt;/p&gt;

&lt;p&gt;Five years ago. By myself. No help. No oversight. Three months of all-nighters, and I took the entire business line from zero to profit. The company went from losing money to making money. Not a single P0 the whole time.&lt;/p&gt;

&lt;p&gt;And now someone was calling it "antique."&lt;/p&gt;

&lt;p&gt;After the meeting, I went back to my desk. Packed the entire legacy system from the production servers onto my old laptop — the one I bought myself back when I was writing it. 500GB. Stuffed it in a drawer.&lt;/p&gt;

&lt;p&gt;Not for revenge. Just — someone needed to remember how this thing was built.&lt;/p&gt;

&lt;p&gt;Our team had six people.&lt;/p&gt;

&lt;p&gt;Mike, backend — four years with me, his code more reliable than documentation.&lt;br&gt;
Jay, ops — would wake up at 3 AM to check an alert.&lt;br&gt;
Will, testing — ran full-load stress tests before every release, no exceptions.&lt;br&gt;
Chris, frontend — once he fixed a bug, it never came back.&lt;br&gt;
Henry — the youngest, but six months ago he became the lead on our AI pilot.&lt;/p&gt;

&lt;p&gt;That's right. Six months ago, we'd already started working on AI. Not AI as a slogan. Real AI. Line by line. Version by version.&lt;/p&gt;

&lt;p&gt;Back then, the previous CTO was still around. We proposed a plan: &lt;strong&gt;don't replace the legacy system, but add an AI assistance layer on critical paths.&lt;/strong&gt; He approved a pilot.&lt;/p&gt;

&lt;p&gt;Three months. Two business scenarios — order routing and anomaly detection. All based on open-source models, fine-tuned in-house. No procurement. No budget approval. We labeled the data ourselves, trained ourselves, tested ourselves. The numbers — I remember every single one:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Intent recognition accuracy: 91.3%&lt;/li&gt;
&lt;li&gt;Average response latency: 180ms&lt;/li&gt;
&lt;li&gt;Zero hardware procurement cost (ran on existing server idle capacity)&lt;/li&gt;
&lt;li&gt;Three-person cross-validation consistency: 98.2%&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;The results passed internal validation. The previous CTO approved the next phase.&lt;/p&gt;

&lt;p&gt;Then he was restructured out. And in came this guy.&lt;/p&gt;




&lt;p&gt;&lt;strong&gt;3&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Two weeks ago.&lt;/strong&gt; The technical strategy meeting. James asked me to attend.&lt;/p&gt;

&lt;p&gt;I brought six months of our team's data.&lt;/p&gt;

&lt;p&gt;"James, this is our team's AI pilot results from the past six months. All open-source. Zero procurement cost."&lt;/p&gt;

&lt;p&gt;I put the comparison table on screen:&lt;/p&gt;

&lt;div class="table-wrapper-paragraph"&gt;&lt;table&gt;
&lt;thead&gt;
&lt;tr&gt;
&lt;th&gt;Metric&lt;/th&gt;
&lt;th&gt;Our Pilot&lt;/th&gt;
&lt;th&gt;Their Proposal (Initial Quote)&lt;/th&gt;
&lt;/tr&gt;
&lt;/thead&gt;
&lt;tbody&gt;
&lt;tr&gt;
&lt;td&gt;Intent recognition accuracy&lt;/td&gt;
&lt;td&gt;&lt;strong&gt;91.3%&lt;/strong&gt;&lt;/td&gt;
&lt;td&gt;Claimed ≥95%, actual &lt;strong&gt;76.8%&lt;/strong&gt;
&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Avg response latency&lt;/td&gt;
&lt;td&gt;&lt;strong&gt;180ms&lt;/strong&gt;&lt;/td&gt;
&lt;td&gt;
&lt;strong&gt;2.1s&lt;/strong&gt; (peak 3.8s)&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Daily concurrent load&lt;/td&gt;
&lt;td&gt;4,200 (already running)&lt;/td&gt;
&lt;td&gt;Not live yet, promised 5,000&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Inference hardware cost&lt;/td&gt;
&lt;td&gt;&lt;strong&gt;$0&lt;/strong&gt;&lt;/td&gt;
&lt;td&gt;
&lt;strong&gt;$3.8M&lt;/strong&gt; (4 GPU servers + rack)&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Model licensing cost&lt;/td&gt;
&lt;td&gt;
&lt;strong&gt;$0&lt;/strong&gt; (open-source)&lt;/td&gt;
&lt;td&gt;
&lt;strong&gt;$4.2M&lt;/strong&gt; (first year)&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Time to production&lt;/td&gt;
&lt;td&gt;&lt;strong&gt;Already running 3 months&lt;/strong&gt;&lt;/td&gt;
&lt;td&gt;Promised 4-6 months&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;P0 incidents&lt;/td&gt;
&lt;td&gt;&lt;strong&gt;0&lt;/strong&gt;&lt;/td&gt;
&lt;td&gt;Not live, no data&lt;/td&gt;
&lt;/tr&gt;
&lt;/tbody&gt;
&lt;/table&gt;&lt;/div&gt;

&lt;p&gt;"91% versus 76%. 180 milliseconds versus 2.1 seconds. Zero dollars versus $8M. James — I'm not against AI. I'm against spending $8M on something that performs worse than what we already built."&lt;/p&gt;

&lt;p&gt;James stared at the screen for five seconds. Then he smiled. The same smile from the all-hands meeting — pity from above.&lt;/p&gt;

&lt;p&gt;"Leo. That 91% — you ran it on your own 2,000-label dataset. Do you know what production data distribution looks like? What long-tail cases will do to that 91%?"&lt;/p&gt;

&lt;p&gt;Henry stood up. "James, those were production logs. Full de-sensitized data from Q3 last year to Q1 this year. 470,000 records. Labeled by our team."&lt;/p&gt;

&lt;p&gt;James turned to him. "Your labels? Who validated them?"&lt;/p&gt;

&lt;p&gt;"Three-person cross-validation. 98.2% consistency."&lt;/p&gt;

&lt;p&gt;James paused. Changed angle. "And that 180ms — how many machines were running it?"&lt;/p&gt;

&lt;p&gt;"One. Two cores from an existing application server."&lt;/p&gt;

&lt;p&gt;James's throat moved. He didn't answer.&lt;/p&gt;

&lt;p&gt;Then he said the line I'll never forget:&lt;/p&gt;

&lt;blockquote&gt;
&lt;p&gt;&lt;strong&gt;"Leo. I'm not dismissing what you did. But — putting a smart lock on an old house doesn't make it a smart home."&lt;/strong&gt;&lt;/p&gt;
&lt;/blockquote&gt;

&lt;p&gt;Before I could respond, Henry's voice cut through: "James, then tell us — where exactly are we wrong? If the data is wrong, point to the data. If the approach is wrong, point to the approach. &lt;strong&gt;You're not saying no — you're saying you don't want to look.&lt;/strong&gt;"&lt;/p&gt;

&lt;p&gt;The room went quiet. James stared at Henry. Henry stared back.&lt;/p&gt;

&lt;p&gt;Finally James turned to me. "Leo. I respect what your team did. But what this company needs isn't patching. It's rebuilding."&lt;/p&gt;

&lt;p&gt;I wanted to say: &lt;strong&gt;Rebuilding and replacing are two different things.&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;But I didn't.&lt;/p&gt;

&lt;p&gt;Because I saw his slide close — and there it was in the bottom-right corner. The procurement department's approval number.&lt;/p&gt;

&lt;p&gt;I knew that format. Eight digits.&lt;/p&gt;

&lt;p&gt;$8 million.&lt;/p&gt;

&lt;p&gt;And I knew the vendor — his former company's product.&lt;/p&gt;

&lt;p&gt;From that day, James's team took over the production environment. First order of business — clean up the old code.&lt;/p&gt;

&lt;p&gt;"Redundant systems. Incompatible with the new architecture. Remove everything."&lt;/p&gt;

&lt;p&gt;I watched five years of my code get deleted line by line from the servers. Git repos archived and locked. CI/CD pipelines redirected. Even the wiki pages about the legacy architecture were marked "Deprecated."&lt;/p&gt;

&lt;p&gt;Someone joked, "Your antique — it's officially a museum piece now."&lt;/p&gt;

&lt;p&gt;Mike sent one message in the group chat: &lt;strong&gt;"He's burning bridges."&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;I didn't reply.&lt;/p&gt;

&lt;p&gt;Because the bridge wasn't fully burned — I still had a copy in my drawer.&lt;/p&gt;




&lt;p&gt;&lt;strong&gt;4&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;Then came Friday.&lt;/p&gt;

&lt;p&gt;After that last round at the pub, I went back to the office.&lt;/p&gt;

&lt;p&gt;Swiped my badge one last time. Opened a workstation.&lt;/p&gt;

&lt;p&gt;Copied the AI pilot code, the fine-tuned model weights, the 470,000 labeled records, the test reports — all of it — onto my old laptop.&lt;/p&gt;

&lt;p&gt;Not a backup.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Evidence.&lt;/strong&gt;&lt;/p&gt;




&lt;p&gt;&lt;strong&gt;5&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Two weeks later. 2:47 AM.&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;The first week ran smoothly. Monitoring all green. The CEO patted James on the shoulder in the hallway — "Nice work."&lt;/p&gt;

&lt;p&gt;Then, on day thirteen — some accumulated threshold was triggered.&lt;/p&gt;

&lt;p&gt;My phone rang. The CEO's private number.&lt;/p&gt;

&lt;p&gt;I picked up. Three seconds of silence.&lt;/p&gt;

&lt;p&gt;"Leo. James's system… crashed."&lt;/p&gt;

&lt;p&gt;His voice didn't sound like a CEO. It sounded like a middle-aged man who'd just been dragged out of bed.&lt;/p&gt;

&lt;p&gt;"All six GPU servers went OOM. Three AI agents started stealing each other's context. Order duplication rate — 37%. Customer complaints — 2,300 in four hours. Estimated refund liability — $2.7 million."&lt;/p&gt;

&lt;p&gt;He kept going: "A competitor's 'offer of assistance' landed in our client's CEO inbox. First line: 'Your current architecture presents significant risk. We can complete migration within one week.'"&lt;/p&gt;

&lt;p&gt;I was already at the door, putting on my shoes. But I asked anyway: "What did James say?"&lt;/p&gt;

&lt;p&gt;CEO paused. "…He said they need two weeks."&lt;/p&gt;

&lt;p&gt;I blinked. "Two weeks?"&lt;/p&gt;

&lt;p&gt;"They said the prompts need retuning."&lt;/p&gt;

&lt;p&gt;I couldn't help it. I laughed into the phone. "$8 million. Two weeks before it collapsed. And the excuse is — the prompts weren't tuned."&lt;/p&gt;

&lt;p&gt;I thought to myself, &lt;strong&gt;His old company's product. Never disappoints.&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;CEO didn't respond. Long silence. Then he said, "That old system… do you still have it?"&lt;/p&gt;

&lt;p&gt;"Yes."&lt;/p&gt;

&lt;p&gt;Another pause. Then he said something I didn't expect: "We can't find anything. The Git repos are empty. The servers are all running his new stack. Even the deployment docs were deleted. &lt;strong&gt;Five years of your work — there's nothing left in this company.&lt;/strong&gt; "&lt;/p&gt;

&lt;p&gt;I didn't say anything.&lt;/p&gt;

&lt;p&gt;"The code. Your team's six months of work. Is it still there?"&lt;/p&gt;

&lt;p&gt;"All of it. On one laptop. 500GB."&lt;/p&gt;

&lt;p&gt;He let out a long breath — like he'd been holding it for two full weeks.&lt;/p&gt;

&lt;p&gt;"Name your price."&lt;/p&gt;

&lt;p&gt;"I'll be there in 30 minutes."&lt;/p&gt;




&lt;p&gt;&lt;strong&gt;6&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;3:12 AM.&lt;/p&gt;

&lt;p&gt;I was sitting in the server room. The new ops guy didn't know me — he was standing next to me, hands shaking, holding a cigarette.&lt;/p&gt;

&lt;p&gt;I connected my laptop to the switch. Changed one Nginx route.&lt;/p&gt;

&lt;p&gt;"Cut it over."&lt;/p&gt;

&lt;p&gt;He stared at me. "…Cut over to what?"&lt;/p&gt;

&lt;p&gt;"The legacy system."&lt;/p&gt;

&lt;p&gt;"Wasn't that… decommissioned?"&lt;/p&gt;

&lt;p&gt;I said: &lt;strong&gt;"No. I kept it."&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;30 seconds.&lt;/p&gt;

&lt;p&gt;The monitoring dashboard went from blood red to green.&lt;/p&gt;

&lt;p&gt;Orders started flowing normally again. The entire AI stack — taken offline.&lt;/p&gt;

&lt;p&gt;I also deployed the AI pilot modules from my laptop onto three regular servers. Running as an assistance layer.&lt;/p&gt;

&lt;p&gt;140ms. 90.8%. Zero new hardware.&lt;/p&gt;

&lt;p&gt;I put the comparison on the big screen:&lt;/p&gt;

&lt;div class="table-wrapper-paragraph"&gt;&lt;table&gt;
&lt;thead&gt;
&lt;tr&gt;
&lt;th&gt;Metric&lt;/th&gt;
&lt;th&gt;The $8M AI (Before Crash)&lt;/th&gt;
&lt;th&gt;Legacy + AI Assist (After Recovery)&lt;/th&gt;
&lt;/tr&gt;
&lt;/thead&gt;
&lt;tbody&gt;
&lt;tr&gt;
&lt;td&gt;Response time&lt;/td&gt;
&lt;td&gt;2.1s → timeout&lt;/td&gt;
&lt;td&gt;&lt;strong&gt;140–210ms&lt;/strong&gt;&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Accuracy&lt;/td&gt;
&lt;td&gt;64% → gibberish&lt;/td&gt;
&lt;td&gt;&lt;strong&gt;90.8%&lt;/strong&gt;&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Concurrent load&lt;/td&gt;
&lt;td&gt;Crashed at 62&lt;/td&gt;
&lt;td&gt;&lt;strong&gt;4,200 normal&lt;/strong&gt;&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Error rate&lt;/td&gt;
&lt;td&gt;100%&lt;/td&gt;
&lt;td&gt;&lt;strong&gt;0.3%&lt;/strong&gt;&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Recovery time&lt;/td&gt;
&lt;td&gt;Promised "two weeks"&lt;/td&gt;
&lt;td&gt;&lt;strong&gt;30 seconds&lt;/strong&gt;&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Hardware cost&lt;/td&gt;
&lt;td&gt;$8M&lt;/td&gt;
&lt;td&gt;&lt;strong&gt;$0 (existing hardware)&lt;/strong&gt;&lt;/td&gt;
&lt;/tr&gt;
&lt;/tbody&gt;
&lt;/table&gt;&lt;/div&gt;

&lt;p&gt;The CEO stood at the server room door, still catching his breath. He stared at the green line on the screen — a full half-minute.&lt;/p&gt;

&lt;p&gt;Then he asked, "Your six people — how many can come back?"&lt;/p&gt;

&lt;p&gt;I said, "How many can come back depends on why you let them go in the first place."&lt;/p&gt;

&lt;p&gt;CEO didn't answer. He stared at the green line, then at my old laptop in my hand. Finally, he asked the question he'd been holding back:&lt;/p&gt;

&lt;blockquote&gt;
&lt;p&gt;&lt;strong&gt;"That smart lock — is it running?"&lt;/strong&gt;&lt;/p&gt;
&lt;/blockquote&gt;

&lt;p&gt;He was talking about James's joke — "putting a smart lock on an old house doesn't make it a smart home." He remembered. So did I.&lt;/p&gt;

&lt;p&gt;I didn't turn around. "It's running. 140ms. 90.8%. And —"&lt;/p&gt;

&lt;p&gt;I closed the laptop.&lt;/p&gt;

&lt;blockquote&gt;
&lt;p&gt;&lt;strong&gt;"Putting a smart lock on an old house doesn't make it a smart home. But the $8M new house you couldn't build — the old house is what held it up."&lt;/strong&gt;&lt;/p&gt;
&lt;/blockquote&gt;

&lt;p&gt;CEO didn't say a word. He looked down the hall at the brightly lit "Special Projects" office. Then he turned and walked away.&lt;/p&gt;

&lt;p&gt;Two steps. Stopped.&lt;/p&gt;

&lt;p&gt;"Your team — I'll make the call. Come back."&lt;/p&gt;




&lt;p&gt;&lt;strong&gt;7&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;A week later. The client renewed. With a new clause in the contract: &lt;strong&gt;Any core architecture changes must be communicated in advance and approved in writing by the client.&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;Nobody mentioned the competitor's "offer of assistance" again. Someone sent me a screenshot of the email with a note: &lt;strong&gt;"$8M vs 30 seconds. Leo, you win."&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;James was moved to "Special Projects" — no budget, no team, no OKRs. Just a desk and a chair.&lt;/p&gt;

&lt;p&gt;HR called me again. "Leo. The company wants you back. The entire backend team — full recall. Name your salary."&lt;/p&gt;

&lt;p&gt;"Fine. Three conditions."&lt;/p&gt;

&lt;p&gt;"First — cancel the paid AI. We don't need it."&lt;/p&gt;

&lt;p&gt;"Second — our AI pilot becomes official. But it stays as an assistance layer. It doesn't touch the main pipeline."&lt;/p&gt;

&lt;p&gt;"Third." I paused. "From now on — how the system is built and what tools we use — the engineers decide. Not the people who write the checks."&lt;/p&gt;

&lt;p&gt;HR was quiet for a long time. "I'll ask the CEO."&lt;/p&gt;

&lt;p&gt;The CEO's response — HR relayed it to me — six words:&lt;/p&gt;

&lt;blockquote&gt;
&lt;p&gt;&lt;strong&gt;"Say yes to all of it. Bring him back."&lt;/strong&gt;&lt;/p&gt;
&lt;/blockquote&gt;




&lt;p&gt;&lt;strong&gt;8&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;Henry came back. Mike came back. Jay, Will, Chris — the whole team recalled.&lt;/p&gt;

&lt;p&gt;Same desks. Not the same salaries — everyone doubled.&lt;/p&gt;

&lt;p&gt;The CEO called me into his office. Closed the door. Three sentences:&lt;/p&gt;

&lt;blockquote&gt;
&lt;p&gt;&lt;strong&gt;"Leo. The seat — it's yours."&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;"CTO."&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;"The team is yours. The budget is yours. The technical direction — yours."&lt;/strong&gt;&lt;/p&gt;
&lt;/blockquote&gt;

&lt;p&gt;One late night, Mike walked over with two cups of instant noodles. "Leo. What do you think — if they'd taken one look at what we built back then, what would have happened?"&lt;/p&gt;

&lt;p&gt;I thought about it. "Nothing. They wouldn't have looked."&lt;/p&gt;

&lt;p&gt;Mike smiled. "Yeah. That's why you kept it."&lt;/p&gt;

&lt;p&gt;I said: &lt;strong&gt;"Someone has to remember. How this system was built — and who built it. Not a PowerPoint. Not $8 million. Six people."&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;That afternoon, I ran into James in the hallway.&lt;/p&gt;

&lt;p&gt;He was walking out of "Special Projects," carrying a cardboard box. Inside: his metal-rimmed glasses. A framed photo — him and the CEO on his first day.&lt;/p&gt;

&lt;p&gt;He saw me. Paused.&lt;/p&gt;

&lt;p&gt;I kept walking. Didn't look back.&lt;/p&gt;

&lt;p&gt;Five steps. I heard his voice behind me. "That pilot project… were the numbers real?"&lt;/p&gt;

&lt;p&gt;I stopped. Didn't turn around.&lt;/p&gt;

&lt;blockquote&gt;
&lt;p&gt;&lt;strong&gt;"Real. Putting a smart lock on an old house doesn't make it a smart home. But if you're going to tear the house down — you should at least know where the load-bearing walls are."&lt;/strong&gt;&lt;/p&gt;
&lt;/blockquote&gt;

&lt;p&gt;I walked away.&lt;/p&gt;

&lt;p&gt;That old laptop is still in my drawer.&lt;/p&gt;

&lt;p&gt;500GB.&lt;/p&gt;

&lt;p&gt;Auto-syncs every night.&lt;/p&gt;

&lt;p&gt;Not for revenge.&lt;/p&gt;

&lt;p&gt;So the next "James" knows — &lt;strong&gt;this system? Someone knows it better than anyone.&lt;/strong&gt;&lt;/p&gt;




&lt;p&gt;&lt;em&gt;You can spend $8M on an "AI-first" transformation. Or you can spend six months with a small team, build an open-source pilot that costs nothing, and keep the people who actually understand the system. The best AI strategy isn't replacing your team — it's giving them tools to do what they already do best.&lt;/em&gt;&lt;/p&gt;




&lt;p&gt;Your company is rolling out AI across the board, isn't it?&lt;/p&gt;

&lt;p&gt;"Full AI replacement." "Two weeks to production." "95% automation." Sound familiar?&lt;/p&gt;

&lt;p&gt;James wasn't the first CTO to do this. He won't be the last.&lt;/p&gt;

&lt;p&gt;So I'll ask you two questions —&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Will your job be replaced by AI?&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;When a system goes down — does the human save it, or does the AI?&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;Drop your real thoughts below 👇&lt;/p&gt;




&lt;p&gt;The CTO spent $8M on his old company's product. My open-source solution cost $0. If you enjoyed this story, meet me in the middle — &lt;strong&gt;&lt;a href="https://ko-fi.com/xulingfeng" rel="noopener noreferrer"&gt;buy me a coffee ☕&lt;/a&gt;&lt;/strong&gt; ($8M not required.🤣)&lt;/p&gt;

</description>
      <category>ai</category>
      <category>opensource</category>
      <category>career</category>
      <category>programming</category>
    </item>
    <item>
      <title>[Boost]</title>
      <dc:creator>xulingfeng</dc:creator>
      <pubDate>Tue, 02 Jun 2026 01:26:01 +0000</pubDate>
      <link>https://dev.to/xulingfeng/-g7d</link>
      <guid>https://dev.to/xulingfeng/-g7d</guid>
      <description>&lt;div class="ltag__link--embedded"&gt;
  &lt;div class="crayons-story "&gt;
  &lt;a href="https://dev.to/cydavid/the-most-valuable-qa-skill-in-the-age-of-ai-is-thinking-3b8p" class="crayons-story__hidden-navigation-link"&gt;The Most Valuable QA Skill in the Age of AI Is Thinking&lt;/a&gt;


  &lt;div class="crayons-story__body crayons-story__body-full_post"&gt;
    &lt;div class="crayons-story__top"&gt;
      &lt;div class="crayons-story__meta"&gt;
        &lt;div class="crayons-story__author-pic"&gt;

          &lt;a href="/cydavid" class="crayons-avatar  crayons-avatar--l  "&gt;
            &lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Fuser%2Fprofile_image%2F1641776%2F2ac1fcc1-59ae-4b42-a6f0-4f6de5d7e4cd.jpeg" alt="cydavid profile" class="crayons-avatar__image"&gt;
          &lt;/a&gt;
        &lt;/div&gt;
        &lt;div&gt;
          &lt;div&gt;
            &lt;a href="/cydavid" class="crayons-story__secondary fw-medium m:hidden"&gt;
              David Ingraham
            &lt;/a&gt;
            &lt;div class="profile-preview-card relative mb-4 s:mb-0 fw-medium hidden m:inline-block"&gt;
              
                David Ingraham
                
              
              &lt;div id="story-author-preview-content-3796362" class="profile-preview-card__content crayons-dropdown branded-7 p-4 pt-0"&gt;
                &lt;div class="gap-4 grid"&gt;
                  &lt;div class="-mt-4"&gt;
                    &lt;a href="/cydavid" class="flex"&gt;
                      &lt;span class="crayons-avatar crayons-avatar--xl mr-2 shrink-0"&gt;
                        &lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Fuser%2Fprofile_image%2F1641776%2F2ac1fcc1-59ae-4b42-a6f0-4f6de5d7e4cd.jpeg" class="crayons-avatar__image" alt=""&gt;
                      &lt;/span&gt;
                      &lt;span class="crayons-link crayons-subtitle-2 mt-5"&gt;David Ingraham&lt;/span&gt;
                    &lt;/a&gt;
                  &lt;/div&gt;
                  &lt;div class="print-hidden"&gt;
                    
                      Follow
                    
                  &lt;/div&gt;
                  &lt;div class="author-preview-metadata-container"&gt;&lt;/div&gt;
                &lt;/div&gt;
              &lt;/div&gt;
            &lt;/div&gt;

          &lt;/div&gt;
          &lt;a href="https://dev.to/cydavid/the-most-valuable-qa-skill-in-the-age-of-ai-is-thinking-3b8p" class="crayons-story__tertiary fs-xs"&gt;&lt;time&gt;Jun 1&lt;/time&gt;&lt;span class="time-ago-indicator-initial-placeholder"&gt;&lt;/span&gt;&lt;/a&gt;
        &lt;/div&gt;
      &lt;/div&gt;

    &lt;/div&gt;

    &lt;div class="crayons-story__indention"&gt;
      &lt;h2 class="crayons-story__title crayons-story__title-full_post"&gt;
        &lt;a href="https://dev.to/cydavid/the-most-valuable-qa-skill-in-the-age-of-ai-is-thinking-3b8p" id="article-link-3796362"&gt;
          The Most Valuable QA Skill in the Age of AI Is Thinking
        &lt;/a&gt;
      &lt;/h2&gt;
        &lt;div class="crayons-story__tags"&gt;
            &lt;a class="crayons-tag  crayons-tag--monochrome " href="/t/ai"&gt;&lt;span class="crayons-tag__prefix"&gt;#&lt;/span&gt;ai&lt;/a&gt;
            &lt;a class="crayons-tag  crayons-tag--monochrome " href="/t/testing"&gt;&lt;span class="crayons-tag__prefix"&gt;#&lt;/span&gt;testing&lt;/a&gt;
            &lt;a class="crayons-tag  crayons-tag--monochrome " href="/t/programming"&gt;&lt;span class="crayons-tag__prefix"&gt;#&lt;/span&gt;programming&lt;/a&gt;
        &lt;/div&gt;
      &lt;div class="crayons-story__bottom"&gt;
        &lt;div class="crayons-story__details"&gt;
          &lt;a href="https://dev.to/cydavid/the-most-valuable-qa-skill-in-the-age-of-ai-is-thinking-3b8p" class="crayons-btn crayons-btn--s crayons-btn--ghost crayons-btn--icon-left"&gt;
            &lt;div class="multiple_reactions_aggregate"&gt;
              &lt;span class="multiple_reactions_icons_container"&gt;
                  &lt;span class="crayons_icon_container"&gt;
                    &lt;img src="https://assets.dev.to/assets/sparkle-heart-5f9bee3767e18deb1bb725290cb151c25234768a0e9a2bd39370c382d02920cf.svg" width="18" height="18"&gt;
                  &lt;/span&gt;
              &lt;/span&gt;
              &lt;span class="aggregate_reactions_counter"&gt;1&lt;span class="hidden s:inline"&gt;&amp;nbsp;reaction&lt;/span&gt;&lt;/span&gt;
            &lt;/div&gt;
          &lt;/a&gt;
            &lt;a href="https://dev.to/cydavid/the-most-valuable-qa-skill-in-the-age-of-ai-is-thinking-3b8p#comments" class="crayons-btn crayons-btn--s crayons-btn--ghost crayons-btn--icon-left flex items-center"&gt;
              

              1&lt;span class="hidden s:inline"&gt;&amp;nbsp;comment&lt;/span&gt;
            &lt;/a&gt;
        &lt;/div&gt;
        &lt;div class="crayons-story__save"&gt;
          &lt;small class="crayons-story__tertiary fs-xs mr-2"&gt;
            8 min read
          &lt;/small&gt;
            
              &lt;span class="bm-initial crayons-icon c-btn__icon"&gt;
                

              &lt;/span&gt;
              &lt;span class="bm-success crayons-icon c-btn__icon"&gt;
                

              &lt;/span&gt;
            
        &lt;/div&gt;
      &lt;/div&gt;
    &lt;/div&gt;
  &lt;/div&gt;
&lt;/div&gt;

&lt;/div&gt;


</description>
    </item>
    <item>
      <title>[Boost]</title>
      <dc:creator>xulingfeng</dc:creator>
      <pubDate>Tue, 02 Jun 2026 00:30:39 +0000</pubDate>
      <link>https://dev.to/xulingfeng/-3e60</link>
      <guid>https://dev.to/xulingfeng/-3e60</guid>
      <description>&lt;div class="ltag__link--embedded"&gt;
  &lt;div class="crayons-story "&gt;
  &lt;a href="https://dev.to/zerodrop/testing-email-flows-in-playwright-without-a-mail-server-2ll7" class="crayons-story__hidden-navigation-link"&gt;Testing email flows in Playwright without a mail server&lt;/a&gt;


  &lt;div class="crayons-story__body crayons-story__body-full_post"&gt;
    &lt;div class="crayons-story__top"&gt;
      &lt;div class="crayons-story__meta"&gt;
        &lt;div class="crayons-story__author-pic"&gt;

          &lt;a href="/zerodrop" class="crayons-avatar  crayons-avatar--l  "&gt;
            &lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Fuser%2Fprofile_image%2F3963379%2Fffcb1c8f-729c-4777-8c14-e3cccac1774b.png" alt="zerodrop profile" class="crayons-avatar__image" width="200" height="200"&gt;
          &lt;/a&gt;
        &lt;/div&gt;
        &lt;div&gt;
          &lt;div&gt;
            &lt;a href="/zerodrop" class="crayons-story__secondary fw-medium m:hidden"&gt;
              zerodrop
            &lt;/a&gt;
            &lt;div class="profile-preview-card relative mb-4 s:mb-0 fw-medium hidden m:inline-block"&gt;
              
                zerodrop
                
              
              &lt;div id="story-author-preview-content-3798597" class="profile-preview-card__content crayons-dropdown branded-7 p-4 pt-0"&gt;
                &lt;div class="gap-4 grid"&gt;
                  &lt;div class="-mt-4"&gt;
                    &lt;a href="/zerodrop" class="flex"&gt;
                      &lt;span class="crayons-avatar crayons-avatar--xl mr-2 shrink-0"&gt;
                        &lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Fuser%2Fprofile_image%2F3963379%2Fffcb1c8f-729c-4777-8c14-e3cccac1774b.png" class="crayons-avatar__image" alt="" width="200" height="200"&gt;
                      &lt;/span&gt;
                      &lt;span class="crayons-link crayons-subtitle-2 mt-5"&gt;zerodrop&lt;/span&gt;
                    &lt;/a&gt;
                  &lt;/div&gt;
                  &lt;div class="print-hidden"&gt;
                    
                      Follow
                    
                  &lt;/div&gt;
                  &lt;div class="author-preview-metadata-container"&gt;&lt;/div&gt;
                &lt;/div&gt;
              &lt;/div&gt;
            &lt;/div&gt;

          &lt;/div&gt;
          &lt;a href="https://dev.to/zerodrop/testing-email-flows-in-playwright-without-a-mail-server-2ll7" class="crayons-story__tertiary fs-xs"&gt;&lt;time&gt;Jun 1&lt;/time&gt;&lt;span class="time-ago-indicator-initial-placeholder"&gt;&lt;/span&gt;&lt;/a&gt;
        &lt;/div&gt;
      &lt;/div&gt;

    &lt;/div&gt;

    &lt;div class="crayons-story__indention"&gt;
      &lt;h2 class="crayons-story__title crayons-story__title-full_post"&gt;
        &lt;a href="https://dev.to/zerodrop/testing-email-flows-in-playwright-without-a-mail-server-2ll7" id="article-link-3798597"&gt;
          Testing email flows in Playwright without a mail server
        &lt;/a&gt;
      &lt;/h2&gt;
        &lt;div class="crayons-story__tags"&gt;
            &lt;a class="crayons-tag  crayons-tag--monochrome " href="/t/playwright"&gt;&lt;span class="crayons-tag__prefix"&gt;#&lt;/span&gt;playwright&lt;/a&gt;
            &lt;a class="crayons-tag  crayons-tag--monochrome " href="/t/testing"&gt;&lt;span class="crayons-tag__prefix"&gt;#&lt;/span&gt;testing&lt;/a&gt;
            &lt;a class="crayons-tag  crayons-tag--monochrome " href="/t/webdev"&gt;&lt;span class="crayons-tag__prefix"&gt;#&lt;/span&gt;webdev&lt;/a&gt;
            &lt;a class="crayons-tag  crayons-tag--monochrome " href="/t/javascript"&gt;&lt;span class="crayons-tag__prefix"&gt;#&lt;/span&gt;javascript&lt;/a&gt;
        &lt;/div&gt;
      &lt;div class="crayons-story__bottom"&gt;
        &lt;div class="crayons-story__details"&gt;
          &lt;a href="https://dev.to/zerodrop/testing-email-flows-in-playwright-without-a-mail-server-2ll7" class="crayons-btn crayons-btn--s crayons-btn--ghost crayons-btn--icon-left"&gt;
            &lt;div class="multiple_reactions_aggregate"&gt;
              &lt;span class="multiple_reactions_icons_container"&gt;
                  &lt;span class="crayons_icon_container"&gt;
                    &lt;img src="https://assets.dev.to/assets/sparkle-heart-5f9bee3767e18deb1bb725290cb151c25234768a0e9a2bd39370c382d02920cf.svg" width="24" height="24"&gt;
                  &lt;/span&gt;
              &lt;/span&gt;
              &lt;span class="aggregate_reactions_counter"&gt;1&lt;span class="hidden s:inline"&gt;&amp;nbsp;reaction&lt;/span&gt;&lt;/span&gt;
            &lt;/div&gt;
          &lt;/a&gt;
            &lt;a href="https://dev.to/zerodrop/testing-email-flows-in-playwright-without-a-mail-server-2ll7#comments" class="crayons-btn crayons-btn--s crayons-btn--ghost crayons-btn--icon-left flex items-center"&gt;
              

              2&lt;span class="hidden s:inline"&gt;&amp;nbsp;comments&lt;/span&gt;
            &lt;/a&gt;
        &lt;/div&gt;
        &lt;div class="crayons-story__save"&gt;
          &lt;small class="crayons-story__tertiary fs-xs mr-2"&gt;
            4 min read
          &lt;/small&gt;
            
              &lt;span class="bm-initial crayons-icon c-btn__icon"&gt;
                

              &lt;/span&gt;
              &lt;span class="bm-success crayons-icon c-btn__icon"&gt;
                

              &lt;/span&gt;
            
        &lt;/div&gt;
      &lt;/div&gt;
    &lt;/div&gt;
  &lt;/div&gt;
&lt;/div&gt;

&lt;/div&gt;


</description>
    </item>
    <item>
      <title>[Boost]</title>
      <dc:creator>xulingfeng</dc:creator>
      <pubDate>Mon, 01 Jun 2026 23:51:18 +0000</pubDate>
      <link>https://dev.to/xulingfeng/-47bi</link>
      <guid>https://dev.to/xulingfeng/-47bi</guid>
      <description>&lt;div class="ltag__link--embedded"&gt;
  &lt;div class="crayons-story "&gt;
  &lt;a href="https://dev.to/rapidclaw/your-ai-agent-doesnt-have-a-model-problem-it-has-an-ops-problem-the-20-reliability-trap-6oa" class="crayons-story__hidden-navigation-link"&gt;Your AI Agent Doesn't Have a Model Problem — It Has an Ops Problem [The 20% Reliability Trap]&lt;/a&gt;


  &lt;div class="crayons-story__body crayons-story__body-full_post"&gt;
    &lt;div class="crayons-story__top"&gt;
      &lt;div class="crayons-story__meta"&gt;
        &lt;div class="crayons-story__author-pic"&gt;

          &lt;a href="/rapidclaw" class="crayons-avatar  crayons-avatar--l  "&gt;
            &lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Fuser%2Fprofile_image%2F3850323%2F4c57502d-d13a-4255-aa80-30e2ab22d035.jpeg" alt="rapidclaw profile" class="crayons-avatar__image" width="460" height="460"&gt;
          &lt;/a&gt;
        &lt;/div&gt;
        &lt;div&gt;
          &lt;div&gt;
            &lt;a href="/rapidclaw" class="crayons-story__secondary fw-medium m:hidden"&gt;
              Tijo Gaucher
            &lt;/a&gt;
            &lt;div class="profile-preview-card relative mb-4 s:mb-0 fw-medium hidden m:inline-block"&gt;
              
                Tijo Gaucher
                
              
              &lt;div id="story-author-preview-content-3791990" class="profile-preview-card__content crayons-dropdown branded-7 p-4 pt-0"&gt;
                &lt;div class="gap-4 grid"&gt;
                  &lt;div class="-mt-4"&gt;
                    &lt;a href="/rapidclaw" class="flex"&gt;
                      &lt;span class="crayons-avatar crayons-avatar--xl mr-2 shrink-0"&gt;
                        &lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Fuser%2Fprofile_image%2F3850323%2F4c57502d-d13a-4255-aa80-30e2ab22d035.jpeg" class="crayons-avatar__image" alt="" width="460" height="460"&gt;
                      &lt;/span&gt;
                      &lt;span class="crayons-link crayons-subtitle-2 mt-5"&gt;Tijo Gaucher&lt;/span&gt;
                    &lt;/a&gt;
                  &lt;/div&gt;
                  &lt;div class="print-hidden"&gt;
                    
                      Follow
                    
                  &lt;/div&gt;
                  &lt;div class="author-preview-metadata-container"&gt;&lt;/div&gt;
                &lt;/div&gt;
              &lt;/div&gt;
            &lt;/div&gt;

          &lt;/div&gt;
          &lt;a href="https://dev.to/rapidclaw/your-ai-agent-doesnt-have-a-model-problem-it-has-an-ops-problem-the-20-reliability-trap-6oa" class="crayons-story__tertiary fs-xs"&gt;&lt;time&gt;Jun 1&lt;/time&gt;&lt;span class="time-ago-indicator-initial-placeholder"&gt;&lt;/span&gt;&lt;/a&gt;
        &lt;/div&gt;
      &lt;/div&gt;

    &lt;/div&gt;

    &lt;div class="crayons-story__indention"&gt;
      &lt;h2 class="crayons-story__title crayons-story__title-full_post"&gt;
        &lt;a href="https://dev.to/rapidclaw/your-ai-agent-doesnt-have-a-model-problem-it-has-an-ops-problem-the-20-reliability-trap-6oa" id="article-link-3791990"&gt;
          Your AI Agent Doesn't Have a Model Problem — It Has an Ops Problem [The 20% Reliability Trap]
        &lt;/a&gt;
      &lt;/h2&gt;
        &lt;div class="crayons-story__tags"&gt;
            &lt;a class="crayons-tag  crayons-tag--monochrome " href="/t/ai"&gt;&lt;span class="crayons-tag__prefix"&gt;#&lt;/span&gt;ai&lt;/a&gt;
            &lt;a class="crayons-tag  crayons-tag--monochrome " href="/t/agents"&gt;&lt;span class="crayons-tag__prefix"&gt;#&lt;/span&gt;agents&lt;/a&gt;
            &lt;a class="crayons-tag  crayons-tag--monochrome " href="/t/automation"&gt;&lt;span class="crayons-tag__prefix"&gt;#&lt;/span&gt;automation&lt;/a&gt;
            &lt;a class="crayons-tag  crayons-tag--monochrome " href="/t/devops"&gt;&lt;span class="crayons-tag__prefix"&gt;#&lt;/span&gt;devops&lt;/a&gt;
        &lt;/div&gt;
      &lt;div class="crayons-story__bottom"&gt;
        &lt;div class="crayons-story__details"&gt;
          &lt;a href="https://dev.to/rapidclaw/your-ai-agent-doesnt-have-a-model-problem-it-has-an-ops-problem-the-20-reliability-trap-6oa" class="crayons-btn crayons-btn--s crayons-btn--ghost crayons-btn--icon-left"&gt;
            &lt;div class="multiple_reactions_aggregate"&gt;
              &lt;span class="multiple_reactions_icons_container"&gt;
                  &lt;span class="crayons_icon_container"&gt;
                    &lt;img src="https://assets.dev.to/assets/sparkle-heart-5f9bee3767e18deb1bb725290cb151c25234768a0e9a2bd39370c382d02920cf.svg" width="24" height="24"&gt;
                  &lt;/span&gt;
              &lt;/span&gt;
              &lt;span class="aggregate_reactions_counter"&gt;2&lt;span class="hidden s:inline"&gt;&amp;nbsp;reactions&lt;/span&gt;&lt;/span&gt;
            &lt;/div&gt;
          &lt;/a&gt;
            &lt;a href="https://dev.to/rapidclaw/your-ai-agent-doesnt-have-a-model-problem-it-has-an-ops-problem-the-20-reliability-trap-6oa#comments" class="crayons-btn crayons-btn--s crayons-btn--ghost crayons-btn--icon-left flex items-center"&gt;
              

              2&lt;span class="hidden s:inline"&gt;&amp;nbsp;comments&lt;/span&gt;
            &lt;/a&gt;
        &lt;/div&gt;
        &lt;div class="crayons-story__save"&gt;
          &lt;small class="crayons-story__tertiary fs-xs mr-2"&gt;
            4 min read
          &lt;/small&gt;
            
              &lt;span class="bm-initial crayons-icon c-btn__icon"&gt;
                

              &lt;/span&gt;
              &lt;span class="bm-success crayons-icon c-btn__icon"&gt;
                

              &lt;/span&gt;
            
        &lt;/div&gt;
      &lt;/div&gt;
    &lt;/div&gt;
  &lt;/div&gt;
&lt;/div&gt;

&lt;/div&gt;


</description>
    </item>
    <item>
      <title>My Company Bought a $660K AI Platform. I Was Replaced. On Friday at 2:58 AM, It Fixed Everything. Then It Rolled Back the Wrong Patch.</title>
      <dc:creator>xulingfeng</dc:creator>
      <pubDate>Mon, 01 Jun 2026 03:47:40 +0000</pubDate>
      <link>https://dev.to/xulingfeng/my-company-bought-a-660k-ai-platform-i-was-replaced-on-friday-at-258-am-it-fixed-everything-3kc4</link>
      <guid>https://dev.to/xulingfeng/my-company-bought-a-660k-ai-platform-i-was-replaced-on-friday-at-258-am-it-fixed-everything-3kc4</guid>
      <description>&lt;p&gt;&lt;em&gt;Based on real system architecture decisions. About a $660K AI platform, three AI agents that kept the dashboard green, and a P0 incident that cost $3.15M over one weekend.&lt;/em&gt;&lt;/p&gt;




&lt;h2&gt;
  
  
  Act 1 · The All-Hands Meeting
&lt;/h2&gt;

&lt;p&gt;Wang Lei, VP of Product, stood in front of the big screen, a smile on his face.&lt;/p&gt;

&lt;p&gt;Behind him, a dashboard rolled data from the "Axon AI Client Engineering Platform — Q1 Performance Report." Numbers cascaded across the wall:&lt;/p&gt;

&lt;div class="table-wrapper-paragraph"&gt;&lt;table&gt;
&lt;thead&gt;
&lt;tr&gt;
&lt;th&gt;Metric&lt;/th&gt;
&lt;th&gt;Axon Platform&lt;/th&gt;
&lt;th&gt;Human Team (Last Q1)&lt;/th&gt;
&lt;th&gt;Improvement&lt;/th&gt;
&lt;/tr&gt;
&lt;/thead&gt;
&lt;tbody&gt;
&lt;tr&gt;
&lt;td&gt;Avg daily tickets processed&lt;/td&gt;
&lt;td&gt;847&lt;/td&gt;
&lt;td&gt;312&lt;/td&gt;
&lt;td&gt;&lt;strong&gt;+171%&lt;/strong&gt;&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Avg first response time&lt;/td&gt;
&lt;td&gt;12s&lt;/td&gt;
&lt;td&gt;4h 17m&lt;/td&gt;
&lt;td&gt;&lt;strong&gt;↓ 99.92%&lt;/strong&gt;&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Customer satisfaction&lt;/td&gt;
&lt;td&gt;4.8/5&lt;/td&gt;
&lt;td&gt;4.1/5&lt;/td&gt;
&lt;td&gt;&lt;strong&gt;+17%&lt;/strong&gt;&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Monthly operating cost&lt;/td&gt;
&lt;td&gt;$52K&lt;/td&gt;
&lt;td&gt;$133K&lt;/td&gt;
&lt;td&gt;&lt;strong&gt;−61%&lt;/strong&gt;&lt;/td&gt;
&lt;/tr&gt;
&lt;/tbody&gt;
&lt;/table&gt;&lt;/div&gt;

&lt;p&gt;Twelve department heads sat in the room. Dead silence.&lt;/p&gt;

&lt;p&gt;Wang Lei planted both hands on the table and scanned the room. His eyes landed on me.&lt;/p&gt;

&lt;p&gt;"Alex. Your team processed 312 tickets last Q1. Axon processed more than that in a &lt;em&gt;single day&lt;/em&gt; last month."&lt;/p&gt;

&lt;p&gt;He smiled. Not a friendly smile. A sentencing smile.&lt;/p&gt;

&lt;p&gt;"And Axon costs less than a third of your team's operating expense."&lt;/p&gt;

&lt;p&gt;"We invested $660K in the whole platform. At current operating costs, it pays for itself in eighteen months."&lt;/p&gt;

&lt;p&gt;"After management review — the Client Engineering technical liaison function is being fully transitioned to the Axon platform."&lt;/p&gt;

&lt;p&gt;He clicked to the next slide.&lt;/p&gt;

&lt;p&gt;"Employees in replaced roles will complete exit interviews within the week."&lt;/p&gt;

&lt;p&gt;Someone inhaled sharply.&lt;/p&gt;

&lt;p&gt;I didn't. I opened my notebook to page 37.&lt;/p&gt;

&lt;p&gt;"Wang, what dimensions are these numbers from?"&lt;/p&gt;

&lt;p&gt;"What do you mean, 'what dimensions'?" His smile tightened.&lt;/p&gt;

&lt;p&gt;"Of those 847 daily tickets — how many are auto-tagging and routing, and how many are actual technical resolutions?"&lt;/p&gt;

&lt;p&gt;The room went quiet for about five seconds.&lt;/p&gt;

&lt;p&gt;Wang Lei looked at me. "Axon's ticket closure rate is ninety-three percent."&lt;/p&gt;

&lt;p&gt;"What's the reopen rate?"&lt;/p&gt;

&lt;p&gt;He paused. "What?"&lt;/p&gt;

&lt;p&gt;"After Axon replies — how many customers reopen the same ticket within twenty-four hours?"&lt;/p&gt;

&lt;p&gt;"We're still collecting that —"&lt;/p&gt;

&lt;p&gt;"Let me save you the trouble." I turned my notebook toward the room. Three lines in handwriting:&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight plaintext"&gt;&lt;code&gt;Axon ticket closure rate: 93%
24h reopen rate: 41%
Human escalation rate: 37%
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;"Out of every 100 tickets Axon closes, 41 customers had their issue unresolved and came back. 37 of those ended up needing a human anyway."&lt;/p&gt;

&lt;p&gt;"847 tickets × 37% = 313. That's exactly what my team handled manually last Q1."&lt;/p&gt;

&lt;p&gt;People in the room started checking their phones.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;"Your AI didn't replace anyone. You just put a voice assistant in front of every ticket I was already handling."&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;Wang Lei's face went red.&lt;/p&gt;

&lt;p&gt;After the meeting, the HR notification hit my phone. Time: 3 PM today.&lt;/p&gt;




&lt;h2&gt;
  
  
  Act 2 · The HR Signing
&lt;/h2&gt;

&lt;p&gt;Zheng, the HR Director. Mid-forties, sharp chin. Her smile felt like an assessment.&lt;/p&gt;

&lt;p&gt;She slid a paper across the desk: &lt;em&gt;Voluntary Separation Agreement.&lt;/em&gt; Severance: the legal minimum.&lt;/p&gt;

&lt;p&gt;"Sign it."&lt;/p&gt;

&lt;p&gt;"I want to see the real Axon operating data."&lt;/p&gt;

&lt;p&gt;"This isn't a negotiation." She uncapped the pen and set it on the paper. "Your desk needs to be cleared by 6 PM. Company laptop, keycard, all storage media — hand them over on-site."&lt;/p&gt;

&lt;p&gt;"Storage media?"&lt;/p&gt;

&lt;p&gt;"Per Wang Lei's specific request — seven years of accumulated technical materials and client communication records in Client Engineering. All company property. All to be turned over."&lt;/p&gt;

&lt;p&gt;I looked at her.&lt;/p&gt;

&lt;p&gt;"Are you serious?"&lt;/p&gt;

&lt;p&gt;She didn't blink.&lt;/p&gt;

&lt;p&gt;I picked up the pen. Signed.&lt;/p&gt;

&lt;p&gt;Then handed it back.&lt;/p&gt;

&lt;p&gt;"All client-related local files on my laptop — already deleted."&lt;/p&gt;

&lt;p&gt;Her face flickered. "What?"&lt;/p&gt;

&lt;p&gt;"Archived backups are in the company knowledge base. Local cache, work notes, technical scoping docs — wiped clean before I walked in here. The company assets I already submitted. Everything I ever uploaded to the knowledge base."&lt;/p&gt;

&lt;p&gt;"What's left is my personal engineering notebook."&lt;/p&gt;

&lt;p&gt;I pulled a hardcover notebook from my bag. The cover was worn white, corners frayed.&lt;/p&gt;

&lt;p&gt;"Twenty-three client requirement analyses. Seventeen POC architecture scopes. Seven years of post-mortems written at 3 AM after every phone call. All in here."&lt;/p&gt;

&lt;p&gt;"Not company property. I wrote it."&lt;/p&gt;

&lt;p&gt;I closed the notebook and stood up.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;"If Wang Lei needs this data — his AI can generate it."&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;I walked out of HR and went back to my desk. The old sticky note was still under the monitor, the ink faded — from four years ago, my first POC with Mike, CTO of MedTech. I'd written my number and told him, "If something breaks at 3 AM, call this." He saved it in his phone. I kept the note under my monitor — a reminder of what I'd promised. A phone number. Next to it: &lt;em&gt;3 AM. Call this.&lt;/em&gt;&lt;/p&gt;

&lt;p&gt;I looked at it. Folded it twice. Put it in my jacket pocket.&lt;/p&gt;




&lt;h2&gt;
  
  
  Act 3 · Seven Years of Weight
&lt;/h2&gt;

&lt;p&gt;2 AM. I sat in my car, engine off.&lt;/p&gt;

&lt;p&gt;Seven years.&lt;/p&gt;

&lt;p&gt;Seven years of late-night calls I'd lost track of. MedTech's compliance audit — four rounds. I found a log bug buried three years deep before their own compliance officer did. FinTech's payment system migration — I slept on the data center floor three nights straight. Manufacturing's IoT protocol stack failure — I sat in a remote session for 11 hours, diffing logs line by line.&lt;/p&gt;

&lt;p&gt;Not because I wrote better code than anyone else.&lt;/p&gt;

&lt;p&gt;Because when those clients had an emergency, the first person they called was me — not the support line.&lt;/p&gt;

&lt;p&gt;My name in the ticketing system: 214 P0+P1 incidents resolved.&lt;/p&gt;

&lt;p&gt;193 of them happened outside business hours.&lt;/p&gt;

&lt;p&gt;They'd never appear in Wang Lei's PowerPoint. Because Axon's "847 daily tickets" only counted business hours. It didn't count 3 AM.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;At 3 AM, Axon is off duty.&lt;/strong&gt;&lt;/p&gt;




&lt;h2&gt;
  
  
  Act 4 · The Crash
&lt;/h2&gt;

&lt;p&gt;Three weeks later. Friday. 2:58 AM.&lt;/p&gt;

&lt;p&gt;My phone lit up on the nightstand.&lt;/p&gt;

&lt;p&gt;Not a number I didn't recognize — Mike, CTO of MedTech.&lt;/p&gt;

&lt;p&gt;"Alex. Our compliance pipeline is stuck. Core transaction modules are erroring out. Payment gateways are all timing out. We've got 20,000 orders queued."&lt;/p&gt;

&lt;p&gt;"What did Axon do?"&lt;/p&gt;

&lt;p&gt;"It ran diagnostics automatically. Rolled back the last two deployments, restored from snapshot. All surface metrics turned green. Then the compliance pipeline crashed again fifteen minutes later — and this time the data was completely corrupted. It sent an auto-reply — 'Your case has been escalated to our technical team, expected response within 48 hours' — and marked the ticket as 'Resolved.'"&lt;/p&gt;

&lt;p&gt;"48 hours?"&lt;/p&gt;

&lt;p&gt;"48 hours. And here's the fun part — the rollback also removed a compliance hotfix from three weeks ago. The one your team applied manually because it never made it into the deployment pipeline. Finding and reapplying it is going to take at least two more days."&lt;/p&gt;

&lt;p&gt;"Finance just ran the numbers — direct losses so far: $630K. If this isn't restored by Monday morning, including SLA penalties and compliance fines, we're looking at over $3.15M. Do you have anyone over there who can take a remote look? The on-call engineers can't even tell me what hotfixes were applied."&lt;/p&gt;

&lt;p&gt;I sat up. Opened my notebook.&lt;/p&gt;

&lt;p&gt;"I don't have access anymore. I returned my laptop the day I was let go."&lt;/p&gt;

&lt;p&gt;Two seconds of silence on the line.&lt;/p&gt;

&lt;p&gt;"I'll give you a temporary account. MedTech-side ops portal. You built the integration layer — you know it better than anyone on my payroll."&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;"Fifteen minutes."&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;That fifteen minutes turned into a weekend.&lt;/strong&gt;&lt;/p&gt;




&lt;h2&gt;
  
  
  Act 5 · The Data Doesn't Lie
&lt;/h2&gt;

&lt;p&gt;Sunday, 5 PM. I sent an email.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;To:&lt;/strong&gt; Former CEO, VP Wang Lei&lt;br&gt;
&lt;strong&gt;CC:&lt;/strong&gt; Mike, FinTech CTO, Manufacturing CTO&lt;/p&gt;

&lt;p&gt;Attachment: &lt;code&gt;Axon_vs_Reality_2026Q2.md&lt;/code&gt;&lt;/p&gt;

&lt;div class="table-wrapper-paragraph"&gt;&lt;table&gt;
&lt;thead&gt;
&lt;tr&gt;
&lt;th&gt;Comparison&lt;/th&gt;
&lt;th&gt;Axon Claimed&lt;/th&gt;
&lt;th&gt;Actual (3-week production data)&lt;/th&gt;
&lt;th&gt;Gap&lt;/th&gt;
&lt;/tr&gt;
&lt;/thead&gt;
&lt;tbody&gt;
&lt;tr&gt;
&lt;td&gt;Auto-resolution (surface-level)&lt;/td&gt;
&lt;td&gt;95%&lt;/td&gt;
&lt;td&gt;83%&lt;/td&gt;
&lt;td&gt;&lt;strong&gt;−12pp&lt;/strong&gt;&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Auto-resolution (architecture-level)&lt;/td&gt;
&lt;td&gt;—&lt;/td&gt;
&lt;td&gt;6.8%&lt;/td&gt;
&lt;td&gt;&lt;strong&gt;6.8% actual&lt;/strong&gt;&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Ticket closure (no reopen in 48h)&lt;/td&gt;
&lt;td&gt;93%&lt;/td&gt;
&lt;td&gt;61%&lt;/td&gt;
&lt;td&gt;&lt;strong&gt;−32pp&lt;/strong&gt;&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;P0/P1 diagnostic accuracy&lt;/td&gt;
&lt;td&gt;—&lt;/td&gt;
&lt;td&gt;Rollback restored surface → left root cause untouched&lt;/td&gt;
&lt;td&gt;&lt;strong&gt;3/3 misdiagnosed&lt;/strong&gt;&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;P0/P1 manual hotfix preservation&lt;/td&gt;
&lt;td&gt;—&lt;/td&gt;
&lt;td&gt;Auto-rollback overwrote without detection&lt;/td&gt;
&lt;td&gt;&lt;strong&gt;0%&lt;/strong&gt;&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Off-hours coverage&lt;/td&gt;
&lt;td&gt;24/7&lt;/td&gt;
&lt;td&gt;89% ticket accumulation between midnight and 6 AM&lt;/td&gt;
&lt;td&gt;&lt;strong&gt;Effectively absent&lt;/strong&gt;&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Client opt-in willingness&lt;/td&gt;
&lt;td&gt;—&lt;/td&gt;
&lt;td&gt;Top 3 clients all refused Axon handoff&lt;/td&gt;
&lt;td&gt;&lt;strong&gt;100% rejection&lt;/strong&gt;&lt;/td&gt;
&lt;/tr&gt;
&lt;/tbody&gt;
&lt;/table&gt;&lt;/div&gt;

&lt;p&gt;The email body had one line:&lt;/p&gt;

&lt;p&gt;&lt;em&gt;"This is what your $660K bought."&lt;/em&gt;&lt;/p&gt;

&lt;p&gt;Twenty minutes later, Mike's reply was one sentence:&lt;/p&gt;

&lt;p&gt;&lt;em&gt;"Monday, 9 AM. My office. Bring your notebook."&lt;/em&gt;&lt;/p&gt;




&lt;h2&gt;
  
  
  Act 6 · The Phone Call
&lt;/h2&gt;

&lt;p&gt;Sunday, 11:47 PM. My former CEO's number.&lt;/p&gt;

&lt;p&gt;He'd never called me at this hour.&lt;/p&gt;

&lt;p&gt;"Alex."&lt;/p&gt;

&lt;p&gt;"Yeah."&lt;/p&gt;

&lt;p&gt;"I saw the email."&lt;/p&gt;

&lt;p&gt;Silence.&lt;/p&gt;

&lt;p&gt;"Are those numbers real?"&lt;/p&gt;

&lt;p&gt;"There's a full audit trail in our internal systems. Check it yourself."&lt;/p&gt;

&lt;p&gt;More silence.&lt;/p&gt;

&lt;p&gt;"Wang Lei says he didn't know."&lt;/p&gt;

&lt;p&gt;I paused.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;"He says he didn't know — and you believed him?"&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;A long silence on the line.&lt;/p&gt;

&lt;p&gt;"Were you at the Axon procurement meeting?"&lt;/p&gt;

&lt;p&gt;"No."&lt;/p&gt;

&lt;p&gt;"Then how do you know the $660K figure?"&lt;/p&gt;

&lt;p&gt;"...The budget was Wang Lei's proposal. The board approved it."&lt;/p&gt;

&lt;p&gt;"And when you signed off — did you know his PPT numbers were cherry-picked? Did you know the '95% auto-resolution' only counted tickets the AI could open — not tickets it could actually close?"&lt;/p&gt;

&lt;p&gt;He didn't answer.&lt;/p&gt;

&lt;p&gt;"Did you know — when you signed?"&lt;/p&gt;

&lt;p&gt;"No."&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;"Now you do."&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;"Alex."&lt;/p&gt;

&lt;p&gt;"Yeah."&lt;/p&gt;

&lt;p&gt;"Come back."&lt;/p&gt;

&lt;p&gt;"Come back to what?"&lt;/p&gt;

&lt;p&gt;"Your old role. Wang Lei — I'll handle it."&lt;/p&gt;

&lt;p&gt;"No thanks."&lt;/p&gt;

&lt;p&gt;"What?"&lt;/p&gt;

&lt;p&gt;"MedTech's offer is already in. Principal Architect. Double the compensation. Title bump. Signed Monday morning."&lt;/p&gt;

&lt;p&gt;"Tell Wang Lei — his Axon is great at generating beautiful reports."&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;"It's just not great at answering the phone at 3 AM."&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;I hung up.&lt;/p&gt;




&lt;h2&gt;
  
  
  Act 7 · The New Office
&lt;/h2&gt;

&lt;p&gt;Three months later. MedTech's new compliance engineering center.&lt;/p&gt;

&lt;p&gt;No monitor on my desk. Just a laptop and a worn-hardcover notebook.&lt;/p&gt;

&lt;p&gt;Mike walked over with two coffees.&lt;/p&gt;

&lt;p&gt;"Did you know your old HR called to verify your background?"&lt;/p&gt;

&lt;p&gt;"Which HR?"&lt;/p&gt;

&lt;p&gt;"Your former company. They wanted to confirm you were actually employed here."&lt;/p&gt;

&lt;p&gt;"What did you say?"&lt;/p&gt;

&lt;p&gt;"I said — he's in the office next to mine. Want to say hello?"&lt;/p&gt;

&lt;p&gt;I laughed.&lt;/p&gt;

&lt;p&gt;An email notification popped up. FinTech's CTO.&lt;/p&gt;

&lt;p&gt;&lt;em&gt;"Subject: Your report made it to our board. We're putting our engagement with your former company on hold. Want to talk about a consulting contract?"&lt;/em&gt;&lt;/p&gt;

&lt;p&gt;I glanced at the time. 1 AM.&lt;/p&gt;

&lt;p&gt;I locked the screen. Didn't reply.&lt;/p&gt;

&lt;p&gt;I'll answer tomorrow morning when I get to the office.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Trust doesn't come with "please hold while the system generates a response."&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;It comes with a phone number — and when you call it, a real person picks up.&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;&lt;em&gt;——&lt;/em&gt;&lt;/p&gt;




&lt;p&gt;&lt;em&gt;Alex's notebook still has that sticky note inside. The phone number hasn't changed. The only difference is — this time, nobody's AI gets in the way.&lt;/em&gt;&lt;/p&gt;




&lt;p&gt;&lt;em&gt;Have you ever watched a system you built get handed over to someone else's algorithm? What happened next?&lt;/em&gt;&lt;/p&gt;




&lt;p&gt;&lt;strong&gt;My dashboard has been green for 3 hours. I'm terrified🤣. — &lt;a href="https://ko-fi.com/xulingfeng" rel="noopener noreferrer"&gt;buy me a coffee ☕&lt;/a&gt;&lt;/strong&gt;&lt;/p&gt;

</description>
      <category>ai</category>
      <category>programming</category>
      <category>discuss</category>
      <category>career</category>
    </item>
  </channel>
</rss>
