I Built a Free AI Model Waste Auditor — No Signup Required

#webdev #python #ai #opensource

TL;DR: I built No 13th Floor — a free tool that audits your AI model usage and tells you exactly how much compute and money you're wasting by running oversized models. No account needed. No credit card. Try it in 30 seconds.

The Problem

Every engineering team I've talked to has the same story: they reached for GPT-4 (or a 70B open-source model) to handle tasks that a 7B model could do just fine. The logic made sense at the time — "why risk it?" — but the bill didn't.

A classification task that needs 3B parameters gets a 70B model. A summarization job that runs perfectly on Mistral 7B gets routed through a GPT-4-class API at 30x the cost. The waste is invisible until it shows up as a cloud bill line item that everyone shrugs at.

This isn't hypothetical. These conversations happen on Reddit, in Slack, in post-mortems:

"We were using GPT-4 for everything... yeah, I know" — r/OpenAI
"AI support costs way higher than expected... wrong model for the task" — r/SaaS
"It felt like overkill to use a 175B+ model just for simple logic" — r/AI_Agents

The pattern is universal: reach for the biggest model, pay the price, trial-and-error to something cheaper. That whole process can be shortcircuited.

What No 13th Floor Does

You describe:

Your use case (what the model is actually doing)
The model you're currently running

The tool returns:

Floor Score — a letter grade (A–F) for your current model efficiency
Monthly waste estimate in USD
Annual projected waste
Oversize ratio — how many times larger your model is than optimal
Recommended architecture — the right-sized model for your workload
Platform matches — curated cost-reduction platforms matched to your specific use case

No account. No credit card. No waiting.

The Stack

Kept it intentionally lean:

Backend: Python / Flask
Frontend: Plain HTML/CSS — no framework
Inference: Groq API (Llama 3 70B for the scoring logic)
Hosting: Oracle Cloud Free Tier
Auth: None — by design

The irony of using a 70B model to tell you when not to use a 70B model isn't lost on me. The scoring engine needs reasoning capability. The output recommendations are about your workload, not ours.

Self-Hosting

The repo is fully open:

git clone https://github.com/t3riah/no13thfloor.git
cd no13thfloor
cp .env.example .env
# Add your GROQ_API_KEY to .env
pip install -r requirements.txt
python app.py

All you need is a free Groq API key.

The Cost-Reduction Platform Stack

Based on your score, the tool recommends right-sized inference platforms:

Platform	Best For	Savings Potential
Groq	Speed-critical, real-time tasks	Up to 90% vs GPT-4
Together AI	Open-source model hosting	Up to 95% vs GPT-4
Replicate	Bursty / pay-per-prediction	Eliminates idle cost
Vantage	AI API cost tracking	Free up to $2,500/mo

Why Free?

Because the problem is a tax on builders who don't know better yet. Paywalling the audit would just mean the waste continues. If this tool saves your team money, consider supporting it on Ko-fi — it keeps the free tier free.