DEV Community

# monitoring

Tag for content related to software monitoring.

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
Operating Real-Time AI: SLAs, Observability, and Knowing When It's Broken

Prioritizing data age over model quality

Operating Real-Time AI: SLAs, Observability, and Knowing When It's Broken

11
Comments 7
10 min read
Inspect an AI Agent Run Without Paying for Logs You'll Never Read — Telemetry Shouldn't Be Your Second Biggest Bill

Inspect an AI Agent Run Without Paying for Logs You'll Never Read — Telemetry Shouldn't Be Your Second Biggest Bill

6
Comments 3
10 min read
Part 5 — Installing a Black Box Recorder in Your RAG System: 4-Layer Metadata + 3-Level Verification, Root Cause in 5 Minutes

Part 5 — Installing a Black Box Recorder in Your RAG System: 4-Layer Metadata + 3-Level Verification, Root Cause in 5 Minutes

6
Comments
9 min read
Stop Relying Entirely on Uptime Kuma for Incident Response

Stop Relying Entirely on Uptime Kuma for Incident Response

1
Comments 2
6 min read
Building an Application Log Analytics Platform with Amazon S3 Tables: Cost Optimization by Migrating from CloudWatch Logs

Building an Application Log Analytics Platform with Amazon S3 Tables: Cost Optimization by Migrating from CloudWatch Logs

2
Comments
5 min read
Why your GPU reports 75 C while your VRAM is cooking at 105 C – the telemetry gap that kills LLM inference

Why your GPU reports 75 C while your VRAM is cooking at 105 C – the telemetry gap that kills LLM inference

Comments 1
11 min read
I switched on production evals for my LLM app — and they scored nothing

I switched on production evals for my LLM app — and they scored nothing

Comments 1
5 min read
Oncall isn't supposed to be this hard

Oncall isn't supposed to be this hard

4
Comments
5 min read
Grafana Dashboards: Information Density vs Readability

Grafana Dashboards: Information Density vs Readability

Comments
5 min read
I built a self-hosted log search tool for my team

I built a self-hosted log search tool for my team

2
Comments
2 min read
I monitored 11 public MCP servers. Latency ranged 215 (97ms to 21 seconds).

I monitored 11 public MCP servers. Latency ranged 215 (97ms to 21 seconds).

1
Comments 1
2 min read
Laravel Actuator

Laravel Actuator

Comments
3 min read
Cache-hit dispersion is the 7th vendor-risk axis — and the one your invoice can't see

Cache-hit dispersion is the 7th vendor-risk axis — and the one your invoice can't see

Comments
8 min read
Detecting API anomalies behind a 200 OK — with statistics, not AI

Detecting API anomalies behind a 200 OK — with statistics, not AI

Comments 1
3 min read
Sentry SDK 2.x Auto-Integrations Flood Your Inbox — Here's the Filter

Sentry SDK 2.x Auto-Integrations Flood Your Inbox — Here's the Filter

Comments
5 min read
👋 Sign in for the ability to sort posts by relevant, latest, or top.