VivekLumbhani

Posted on Nov 16

"I Built an IoT Dashboard That Could Kill Someone (A Story About Real-Time Data)"

#iot #mongodb #webdev #javascript

The Message That Changed Everything
"Vivek, we need to talk about the temperature alerts."

It was my second month at the IoT company. My manager's tone was...
concerning.

"The HVAC system at the pharmaceutical warehouse failed last night.
Temperature hit 28°C. They lost £50,000 worth of temperature-sensitive
medication."

My stomach dropped.

"Did our system send an alert?"

"Yes. Seventeen minutes after the threshold was breached."

That's when I realized: I had built a "real-time" dashboard that
wasn't actually real-time. And in some industries, seventeen minutes
isn't just inconvenient.

It's catastrophic.

What "Real-Time" Actually Means (Spoiler: Not What I Thought)
When I started building our IoT monitoring dashboard, I thought I
understood "real-time."

Refresh the page, get new data. Maybe poll every 30 seconds. That's
real-time, right?

Wrong.

Here's what I learned the hard way:

Real-Time Categories:

Hard Real-Time (Life or Death)
• Medical devices, aircraft systems, industrial safety
• Deadline miss = catastrophic failure
• Response time: Milliseconds to seconds
• Our pharmaceutical warehouse? This category.
Soft Real-Time (Business Critical)
• Financial trading, live sports scores, ride-sharing
• Deadline miss = degraded service, unhappy users
• Response time: Seconds to minutes
• Our regular building monitoring? This category.
Near Real-Time (User Convenience)
• Social media feeds, weather updates, analytics dashboards
• Deadline miss = minor inconvenience
• Response time: Minutes acceptable
• What I had accidentally built.

I had designed a system for category 3 when I needed category 1.

The Architecture I Built (That Almost Failed)
Let me show you what I initially built. It seemed fine in development:

┌─────────────┐ ┌──────────────┐ ┌─────────────┐
│ IoT Devices │────────▶│ Node.js │────────▶│ MongoDB │
│ (500+) │ MQTT │ Backend │ │ Database │
└─────────────┘ └──────────────┘ └─────────────┘
│
│ HTTP Polling
│ (every 30 seconds)
▼
┌──────────────┐
│ React │
│ Dashboard │
└──────────────┘

The flow:

IoT device sends sensor reading via MQTT
Backend receives, validates, stores in MongoDB
Frontend polls API every 30 seconds
If temperature exceeds threshold, show alert

Seems reasonable, right?

Here's the problem:

Worst case latency:

Device sends reading: 0 seconds
MQTT transmission: 1-2 seconds
Backend processing: 1-2 seconds
Database write: 0.5 seconds
Waiting for next poll: 0-30 seconds (average 15s)
Frontend processing: 0.5 seconds

Total: 18-36 seconds between event and user notification.

In our pharmaceutical warehouse case, it took 17 minutes because:

The alert happened at 2:47 AM
No one had the dashboard open
Email alerts were queued and delayed
By the time someone checked, it was too late After the incident, we had an emergency meeting.

The client was (understandably) furious. The facilities manager
showed us photos of ruined medication. We're talking insulin,
vaccines, biologics - stuff that MUST stay cold.

"Your system is supposed to prevent this," he said. "We paid for
real-time monitoring. If the temperature goes above 8°C, someone
needs to know IMMEDIATELY. Not in fifteen minutes. Not in five
minutes. IMMEDIATELY."

He was right.

We had sold them a "real-time monitoring system" but delivered
something that was... delayed-time? Near-time? Definitely-not-
when-it-mattered-time.

I spent that night redesigning the entire system.

The Architecture That Actually Works
Here's what I built to fix it:

┌─────────────┐ ┌──────────────┐ ┌─────────────┐
│ IoT Devices │────────▶│ Node.js │────────▶│ MongoDB │
│ (500+) │ MQTT │ Backend │ │ Database │
└─────────────┘ └──────────────┘ └─────────────┘
│
│ WebSocket
│ (persistent connection)
▼
┌──────────────┐
│ React │
│ Dashboard │
└──────────────┘
│
│ Push Notifications
▼
┌──────────────┐
│ Mobile │
│ App + SMS │
└──────────────┘

Key changes:

WebSocket Connection (Not Polling)
• Persistent bidirectional connection
• Server pushes data instantly when available
• No waiting for next poll cycle
In-Memory Alert Processing
• Critical alerts bypass database queue
• Processed in Node.js event loop
• Sub-second detection
Multi-Channel Notifications
• WebSocket to dashboard (instant)
• Push notifications to mobile app (2-3 seconds)
• SMS for critical alerts (5-10 seconds)
• Email as backup (30-60 seconds)
Redundant Monitoring
• Multiple backend instances
• Load balancer with health checks
• Failover to backup notification service

New latency:

Device sends reading: 0 seconds
MQTT transmission: 1-2 seconds
Backend processing + alert check: 0.1 seconds
WebSocket push: 0.1 seconds
Dashboard update: 0.1 seconds

Total: 1.5-2.5 seconds. Every. Single. Time.

The Results (And Why This Matters)
After implementing the WebSocket-based system:

Metrics:

Alert latency: 17 minutes → 2 seconds (99.8% improvement)
Dashboard update frequency: 30 seconds → real-time
Client satisfaction: Angry → Happy
My stress levels: Through the roof → Manageable

Real Impact:

Three months after the redesign, we had another HVAC failure at
the same warehouse. This time:

Temperature exceeded threshold at 3:42 AM
Alert reached facilities manager's phone at 3:42:03 AM (3 seconds later)
He was able to respond immediately
Backup cooling activated within 8 minutes
No medication lost

The facilities manager called me personally to say thank you.

That moment made every late night debugging WebSocket connections
worth it.

What I Learned About "Real-Time"

"Real-time" isn't a technical feature - it's a requirement

Not all systems need true real-time, but when they do, it's not
negotiable. Ask yourself: What happens if this alert is delayed
by 10 seconds? 1 minute? 10 minutes?

If the answer is "financial loss" or "safety risk", you need
real real-time.

Polling is a trap for low-frequency updates

30-second polling seems fine until:

You need sub-second updates
You have 500+ clients polling simultaneously
Your database can't handle the load
Something critical happens between polls

WebSockets aren't scary (but they are different)

Coming from REST APIs, WebSockets felt alien. But for real-time
data, they're essential:

Persistent connection = instant updates
Bidirectional = server can push
Lower latency than polling
More efficient at scale

Have a backup plan for critical alerts

Our multi-channel approach saved us:

WebSocket fails? → Mobile push notification
Mobile app crashed? → SMS
SMS delayed? → Email
Everything fails? → Automated phone call

When it's critical, redundancy isn't overkill.

Test failure scenarios obsessively

We built a "chaos testing" system:

Randomly disconnect clients
Simulate network delays
Kill backend servers
Overflow the message queue

Every failure we discovered in testing was one we didn't face in
production with real medication at stake.

The Checklist: Do You Need Real Real-Time?
Ask yourself:

□ Are you building safety-critical systems?
(Medical, industrial, infrastructure)

□ Is financial loss possible from delayed data?
(Trading, fraud detection, inventory)

□ Do users expect instant updates?
(Collaboration tools, live events, gaming)

□ Are you monitoring critical infrastructure?
(Servers, IoT devices, security systems)

□ Could someone be harmed by delayed alerts?
(Temperature, pressure, access control)

If you checked even ONE box, stop polling and implement proper
real-time updates.

DEV Community

"I Built an IoT Dashboard That Could Kill Someone (A Story About Real-Time Data)"

Top comments (0)