DEV Community

t3riah
t3riah

Posted on

I Built a Free AI Model Waste Auditor — No Signup Required

TL;DR: I built No 13th Floor — a free tool that audits your AI model usage and tells you exactly how much compute and money you're wasting by running oversized models. No account needed. No credit card. Try it in 30 seconds.


The Problem

Every engineering team I've talked to has the same story: they reached for GPT-4 (or a 70B open-source model) to handle tasks that a 7B model could do just fine. The logic made sense at the time — "why risk it?" — but the bill didn't.

A classification task that needs 3B parameters gets a 70B model. A summarization job that runs perfectly on Mistral 7B gets routed through a GPT-4-class API at 30x the cost. The waste is invisible until it shows up as a cloud bill line item that everyone shrugs at.

This isn't hypothetical. These conversations happen on Reddit, in Slack, in post-mortems:

  • "We were using GPT-4 for everything... yeah, I know" — r/OpenAI
  • "AI support costs way higher than expected... wrong model for the task" — r/SaaS
  • "It felt like overkill to use a 175B+ model just for simple logic" — r/AI_Agents

The pattern is universal: reach for the biggest model, pay the price, trial-and-error to something cheaper. That whole process can be shortcircuited.


What No 13th Floor Does

You describe:

  1. Your use case (what the model is actually doing)
  2. The model you're currently running

The tool returns:

  • Floor Score — a letter grade (A–F) for your current model efficiency
  • Monthly waste estimate in USD
  • Annual projected waste
  • Oversize ratio — how many times larger your model is than optimal
  • Recommended architecture — the right-sized model for your workload
  • Platform matches — curated cost-reduction platforms matched to your specific use case

No account. No credit card. No waiting.


The Stack

Kept it intentionally lean:

  • Backend: Python / Flask
  • Frontend: Plain HTML/CSS — no framework
  • Inference: Groq API (Llama 3 70B for the scoring logic)
  • Hosting: Oracle Cloud Free Tier
  • Auth: None — by design

The irony of using a 70B model to tell you when not to use a 70B model isn't lost on me. The scoring engine needs reasoning capability. The output recommendations are about your workload, not ours.


Self-Hosting

The repo is fully open:

git clone https://github.com/t3riah/no13thfloor.git
cd no13thfloor
cp .env.example .env
# Add your GROQ_API_KEY to .env
pip install -r requirements.txt
python app.py
Enter fullscreen mode Exit fullscreen mode

All you need is a free Groq API key.


The Cost-Reduction Platform Stack

Based on your score, the tool recommends right-sized inference platforms:

Platform Best For Savings Potential
Groq Speed-critical, real-time tasks Up to 90% vs GPT-4
Together AI Open-source model hosting Up to 95% vs GPT-4
Replicate Bursty / pay-per-prediction Eliminates idle cost
Vantage AI API cost tracking Free up to $2,500/mo

Why Free?

Because the problem is a tax on builders who don't know better yet. Paywalling the audit would just mean the waste continues. If this tool saves your team money, consider supporting it on Ko-fi — it keeps the free tier free.


Try It

no13thfloor.org — free, no signup, results in under 30 seconds.

Source: github.com/t3riah/no13thfloor

Built by First Principle Dynamics LLC — Miami, FL. Minority-owned. Building tools grounded in first principles thinking.


The 13th floor exists. You just weren't looking for it.

Top comments (0)