<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:dc="http://purl.org/dc/elements/1.1/">
  <channel>
    <title>DEV Community: t3riah</title>
    <description>The latest articles on DEV Community by t3riah (@t3riah).</description>
    <link>https://dev.to/t3riah</link>
    <image>
      <url>https://media2.dev.to/dynamic/image/width=90,height=90,fit=cover,gravity=auto,format=auto/https:%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Fuser%2Fprofile_image%2F3801387%2Ffa9406bf-635f-4a7e-9e7b-9e6b1965fc66.png</url>
      <title>DEV Community: t3riah</title>
      <link>https://dev.to/t3riah</link>
    </image>
    <atom:link rel="self" type="application/rss+xml" href="https://dev.to/feed/t3riah"/>
    <language>en</language>
    <item>
      <title>I Built a Free AI Model Waste Auditor — No Signup Required</title>
      <dc:creator>t3riah</dc:creator>
      <pubDate>Mon, 02 Mar 2026 09:41:33 +0000</pubDate>
      <link>https://dev.to/t3riah/i-built-a-free-ai-model-waste-auditor-no-signup-required-2ahn</link>
      <guid>https://dev.to/t3riah/i-built-a-free-ai-model-waste-auditor-no-signup-required-2ahn</guid>
      <description>&lt;p&gt;&lt;strong&gt;TL;DR:&lt;/strong&gt; I built &lt;a href="https://no13thfloor.org" rel="noopener noreferrer"&gt;No 13th Floor&lt;/a&gt; — a free tool that audits your AI model usage and tells you exactly how much compute and money you're wasting by running oversized models. No account needed. No credit card. Try it in 30 seconds.&lt;/p&gt;




&lt;h2&gt;
  
  
  The Problem
&lt;/h2&gt;

&lt;p&gt;Every engineering team I've talked to has the same story: they reached for GPT-4 (or a 70B open-source model) to handle tasks that a 7B model could do just fine. The logic made sense at the time — "why risk it?" — but the bill didn't.&lt;/p&gt;

&lt;p&gt;A classification task that needs 3B parameters gets a 70B model. A summarization job that runs perfectly on Mistral 7B gets routed through a GPT-4-class API at 30x the cost. The waste is invisible until it shows up as a cloud bill line item that everyone shrugs at.&lt;/p&gt;

&lt;p&gt;This isn't hypothetical. These conversations happen on Reddit, in Slack, in post-mortems:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;"We were using GPT-4 for everything... yeah, I know" — r/OpenAI&lt;/li&gt;
&lt;li&gt;"AI support costs way higher than expected... wrong model for the task" — r/SaaS&lt;/li&gt;
&lt;li&gt;"It felt like overkill to use a 175B+ model just for simple logic" — r/AI_Agents&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;The pattern is universal: reach for the biggest model, pay the price, trial-and-error to something cheaper. That whole process can be shortcircuited.&lt;/p&gt;




&lt;h2&gt;
  
  
  What No 13th Floor Does
&lt;/h2&gt;

&lt;p&gt;You describe:&lt;/p&gt;

&lt;ol&gt;
&lt;li&gt;Your use case (what the model is actually doing)&lt;/li&gt;
&lt;li&gt;The model you're currently running&lt;/li&gt;
&lt;/ol&gt;

&lt;p&gt;The tool returns:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;Floor Score&lt;/strong&gt; — a letter grade (A–F) for your current model efficiency&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Monthly waste estimate&lt;/strong&gt; in USD&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Annual projected waste&lt;/strong&gt;&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Oversize ratio&lt;/strong&gt; — how many times larger your model is than optimal&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Recommended architecture&lt;/strong&gt; — the right-sized model for your workload&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Platform matches&lt;/strong&gt; — curated cost-reduction platforms matched to your specific use case&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;No account. No credit card. No waiting.&lt;/p&gt;




&lt;h2&gt;
  
  
  The Stack
&lt;/h2&gt;

&lt;p&gt;Kept it intentionally lean:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;Backend:&lt;/strong&gt; Python / Flask&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Frontend:&lt;/strong&gt; Plain HTML/CSS — no framework&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Inference:&lt;/strong&gt; Groq API (Llama 3 70B for the scoring logic)&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Hosting:&lt;/strong&gt; Oracle Cloud Free Tier&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Auth:&lt;/strong&gt; None — by design&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;The irony of using a 70B model to tell you when &lt;em&gt;not&lt;/em&gt; to use a 70B model isn't lost on me. The scoring engine needs reasoning capability. The output recommendations are about your workload, not ours.&lt;/p&gt;




&lt;h2&gt;
  
  
  Self-Hosting
&lt;/h2&gt;

&lt;p&gt;The repo is fully open:&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight shell"&gt;&lt;code&gt;git clone https://github.com/t3riah/no13thfloor.git
&lt;span class="nb"&gt;cd &lt;/span&gt;no13thfloor
&lt;span class="nb"&gt;cp&lt;/span&gt; .env.example .env
&lt;span class="c"&gt;# Add your GROQ_API_KEY to .env&lt;/span&gt;
pip &lt;span class="nb"&gt;install&lt;/span&gt; &lt;span class="nt"&gt;-r&lt;/span&gt; requirements.txt
python app.py
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;All you need is a free &lt;a href="https://groq.com" rel="noopener noreferrer"&gt;Groq API key&lt;/a&gt;.&lt;/p&gt;




&lt;h2&gt;
  
  
  The Cost-Reduction Platform Stack
&lt;/h2&gt;

&lt;p&gt;Based on your score, the tool recommends right-sized inference platforms:&lt;/p&gt;

&lt;div class="table-wrapper-paragraph"&gt;&lt;table&gt;
&lt;thead&gt;
&lt;tr&gt;
&lt;th&gt;Platform&lt;/th&gt;
&lt;th&gt;Best For&lt;/th&gt;
&lt;th&gt;Savings Potential&lt;/th&gt;
&lt;/tr&gt;
&lt;/thead&gt;
&lt;tbody&gt;
&lt;tr&gt;
&lt;td&gt;Groq&lt;/td&gt;
&lt;td&gt;Speed-critical, real-time tasks&lt;/td&gt;
&lt;td&gt;Up to 90% vs GPT-4&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Together AI&lt;/td&gt;
&lt;td&gt;Open-source model hosting&lt;/td&gt;
&lt;td&gt;Up to 95% vs GPT-4&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Replicate&lt;/td&gt;
&lt;td&gt;Bursty / pay-per-prediction&lt;/td&gt;
&lt;td&gt;Eliminates idle cost&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Vantage&lt;/td&gt;
&lt;td&gt;AI API cost tracking&lt;/td&gt;
&lt;td&gt;Free up to $2,500/mo&lt;/td&gt;
&lt;/tr&gt;
&lt;/tbody&gt;
&lt;/table&gt;&lt;/div&gt;




&lt;h2&gt;
  
  
  Why Free?
&lt;/h2&gt;

&lt;p&gt;Because the problem is a tax on builders who don't know better yet. Paywalling the audit would just mean the waste continues. If this tool saves your team money, consider &lt;a href="https://ko-fi.com/no13thfloor" rel="noopener noreferrer"&gt;supporting it on Ko-fi&lt;/a&gt; — it keeps the free tier free.&lt;/p&gt;




&lt;h2&gt;
  
  
  Try It
&lt;/h2&gt;

&lt;p&gt;&lt;strong&gt;&lt;a href="https://no13thfloor.org" rel="noopener noreferrer"&gt;no13thfloor.org&lt;/a&gt;&lt;/strong&gt; — free, no signup, results in under 30 seconds.&lt;/p&gt;

&lt;p&gt;Source: &lt;a href="https://github.com/t3riah/no13thfloor" rel="noopener noreferrer"&gt;github.com/t3riah/no13thfloor&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;Built by &lt;a href="https://firstprincipledynamics.com" rel="noopener noreferrer"&gt;First Principle Dynamics LLC&lt;/a&gt; — Miami, FL. Minority-owned. Building tools grounded in first principles thinking.&lt;/p&gt;




&lt;p&gt;&lt;em&gt;The 13th floor exists. You just weren't looking for it.&lt;/em&gt;&lt;/p&gt;

</description>
      <category>ai</category>
      <category>python</category>
      <category>opensource</category>
      <category>webdev</category>
    </item>
  </channel>
</rss>
