DEV Community

# monitoring

Tag for content related to software monitoring.

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
Operating Real-Time AI: SLAs, Observability, and Knowing When It's Broken

Operating Real-Time AI: SLAs, Observability, and Knowing When It's Broken

3
Comments 1
10 min read
Replacing Elasticsearch with ClickHouse : A 90% Cost-Reduction Migration

Replacing Elasticsearch with ClickHouse : A 90% Cost-Reduction Migration

3
Comments
33 min read
How I Monitor AI Agents: CloudWatch for Infra, Arize Phoenix for Traces and OpenTelemetry, LLM-as-Judge for Quality

How I Monitor AI Agents: CloudWatch for Infra, Arize Phoenix for Traces and OpenTelemetry, LLM-as-Judge for Quality

3
Comments 1
7 min read
I Tested 7 Self-Hosted Monitoring Tools on a $3 VPS in 2026 (Here's the One I Kept)

I Tested 7 Self-Hosted Monitoring Tools on a $3 VPS in 2026 (Here's the One I Kept)

1
Comments
5 min read
OpenTelemetry custom spans in .NET: seeing what your code decided

OpenTelemetry custom spans in .NET: seeing what your code decided

4
Comments
13 min read
Full Observability in Istio: Metrics with Prometheus/Grafana + Distributed Tracing with Jaeger

Full Observability in Istio: Metrics with Prometheus/Grafana + Distributed Tracing with Jaeger

Comments
5 min read
Kubelet Metrics: How cAdvisor and CRI Collect Kubernetes Stats

Kubelet Metrics: How cAdvisor and CRI Collect Kubernetes Stats

2
Comments
31 min read
Building an Application Log Analytics Platform with Amazon S3 Tables: Cost Optimization by Migrating from CloudWatch Logs

Building an Application Log Analytics Platform with Amazon S3 Tables: Cost Optimization by Migrating from CloudWatch Logs

2
Comments
5 min read
7 cron expression gotchas that will silently break your scheduled jobs

7 cron expression gotchas that will silently break your scheduled jobs

2
Comments
3 min read
Why your uptime monitor says everything's fine while users see a white screen

Why your uptime monitor says everything's fine while users see a white screen

3
Comments
6 min read
Production-Grade Observability: Building a Complete LGTM Stack with SLOs, DORA Metrics, and Intelligent Alerting

Production-Grade Observability: Building a Complete LGTM Stack with SLOs, DORA Metrics, and Intelligent Alerting

2
Comments
10 min read
The 8 Grafana panels every Cosmos validator dashboard should have (and most don't)

The 8 Grafana panels every Cosmos validator dashboard should have (and most don't)

2
Comments 1
6 min read
Building an Error Monitoring Tool Without Pricing Overages

Building an Error Monitoring Tool Without Pricing Overages

2
Comments
11 min read
Datadog Alternatives for Small Teams (2026): An Honest Comparison from a Solo Dev

Datadog Alternatives for Small Teams (2026): An Honest Comparison from a Solo Dev

1
Comments
8 min read
GitHub Silently Removed payload.commits From PushEvent — Here's What Broke and How to Catch the Next One

GitHub Silently Removed payload.commits From PushEvent — Here's What Broke and How to Catch the Next One

1
Comments
5 min read
👋 Sign in for the ability to sort posts by relevant, latest, or top.