<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:dc="http://purl.org/dc/elements/1.1/">
  <channel>
    <title>DEV Community: AGHL</title>
    <description>The latest articles on DEV Community by AGHL (@aghl_retestees).</description>
    <link>https://dev.to/aghl_retestees</link>
    <image>
      <url>https://media2.dev.to/dynamic/image/width=90,height=90,fit=cover,gravity=auto,format=auto/https:%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Fuser%2Fprofile_image%2F3914704%2F5db7cf3e-eb11-4396-9388-e9cabc008bbb.png</url>
      <title>DEV Community: AGHL</title>
      <link>https://dev.to/aghl_retestees</link>
    </image>
    <atom:link rel="self" type="application/rss+xml" href="https://dev.to/feed/aghl_retestees"/>
    <language>en</language>
    <item>
      <title>We measured how much time our team wasted on flaky tests. The numbers were ugly</title>
      <dc:creator>AGHL</dc:creator>
      <pubDate>Tue, 05 May 2026 21:37:43 +0000</pubDate>
      <link>https://dev.to/aghl_retestees/we-measured-how-much-time-our-team-wasted-on-flaky-tests-the-numbers-were-ugly-2ef4</link>
      <guid>https://dev.to/aghl_retestees/we-measured-how-much-time-our-team-wasted-on-flaky-tests-the-numbers-were-ugly-2ef4</guid>
      <description>&lt;p&gt;Like most teams using GitHub Actions, we’d gotten used to the ritual: push code, wait for CI, see a red build, re-run it, hope it passes this time. “It’s probably flaky” became the default response to any test failure — including real ones.&lt;/p&gt;

&lt;p&gt;We decided to actually measure the damage. Over 30 days on a single repo:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;  &lt;strong&gt;842 CI runs&lt;/strong&gt; → &lt;strong&gt;117 failures&lt;/strong&gt; (13.9% failure rate)&lt;/li&gt;
&lt;li&gt;  &lt;strong&gt;31.5 developer hours&lt;/strong&gt; spent investigating and re-running&lt;/li&gt;
&lt;li&gt;  &lt;strong&gt;$426 in CI compute&lt;/strong&gt; burned on re-runs that shouldn’t have been needed&lt;/li&gt;
&lt;li&gt;  &lt;strong&gt;1 regression shipped to production&lt;/strong&gt; because a real failure was dismissed as “just flaky”&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;The worst part? Nobody could tell us &lt;em&gt;which&lt;/em&gt; tests were flaky. We had a vague sense — “that login test is weird” — but no actual inventory. And without an inventory, you can’t fix what you can’t see.&lt;/p&gt;

&lt;p&gt;So we built &lt;strong&gt;Retestees&lt;/strong&gt; — a tool that connects to your GitHub Actions in 2 minutes and gives you a CI Waste Report showing:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;  &lt;strong&gt;Which tests fail repeatedly without code changes&lt;/strong&gt; (true flaky tests)&lt;/li&gt;
&lt;li&gt;  &lt;strong&gt;How much developer time and CI cost&lt;/strong&gt; each one wastes&lt;/li&gt;
&lt;li&gt;  &lt;strong&gt;Which workflows are the least stable&lt;/strong&gt;, ranked by failure rate&lt;/li&gt;
&lt;li&gt;  &lt;strong&gt;A clear priority list&lt;/strong&gt; so you fix the most expensive flaky tests first&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;No code changes. No config files. No test framework plugins. You connect your repo, and we analyze your existing CI history.&lt;/p&gt;

&lt;p&gt;We’re looking for teams to try it free. If you’re on GitHub Actions and tired of re-running builds, we’ll generate a free CI Waste Report for your repo — no credit card, no commitment. You’ll see exactly where your CI time and money are going.&lt;/p&gt;

&lt;p&gt;&lt;a href="https://retestees.app" rel="noopener noreferrer"&gt;Get your free CI Waste Report&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;(We also have a $29/mo beta plan if you want ongoing monitoring and alerts. Details on the site.)&lt;/p&gt;

&lt;p&gt;Would love feedback from anyone dealing with this. What’s your worst flaky test story?&lt;/p&gt;

</description>
      <category>githubactions</category>
      <category>devops</category>
      <category>productivity</category>
      <category>testing</category>
    </item>
  </channel>
</rss>
