Kinan Nasri

Posted on Dec 30, 2025

Why averages lie: p99 latency is what users actually feel

#performance #python #systems #opensource

Most performance dashboards look fine.

Average latency: low

CPU: stable

Memory: healthy

And yet users complain that “the app feels slow”.

This usually isn’t a mystery. It’s a metrics problem.

Averages hide pain

Imagine this system:

The average latency is ~20ms.

Looks great on a chart.

But 1 out of every 100 users experiences a full second pause.

That’s not an edge case — that’s a real user.

This is why percentiles matter:

If your p99 is bad, your system feels bad — even if averages look perfect.

High variance is often more damaging than slow performance.

A stable 40ms system feels faster than one that jumps between 5ms and 200ms.

That variability — jitter — is what makes UIs stutter, audio glitch, and frames drop.

I wanted a way to analyze raw timing data without dashboards, heavy dependencies, or full observability stacks.

So I built Latency Lens — a small CLI tool focused on:

No averages-first thinking. Just tail behavior.

If users say your system feels slow:

That’s usually where the truth is.