DEV Community

# observability

Gaining deep insights into system behavior through metrics, logs, and traces.

Posts

đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.
Datadog: Observability Lessons from 50+ AWS Apps

Datadog: Observability Lessons from 50+ AWS Apps

4
Comments
7 min read
Logs, Metrics, and Traces: What They Are and When to Use Each

Logs, Metrics, and Traces: What They Are and When to Use Each

1
Comments
4 min read
Debugging Microservices Like a Pro: How Trace IDs Saved My Production Incident

Debugging Microservices Like a Pro: How Trace IDs Saved My Production Incident

Comments
1 min read
We built a small calculator that shows how much inventory drift actually costs

We built a small calculator that shows how much inventory drift actually costs

1
Comments 1
1 min read
Observability in GenAI Systems: What to Log, Measure, and Monitor

Observability in GenAI Systems: What to Log, Measure, and Monitor

Comments
4 min read
Your Traces Look Fine. Your Revenue Isn’t.

Your Traces Look Fine. Your Revenue Isn’t.

1
Comments
2 min read
Reliability vs Uptime: Why Availability Fails at Scale

Reliability vs Uptime: Why Availability Fails at Scale

5
Comments 1
3 min read
Observability Isn’t Understanding — Why We Still Don’t Know Our Systems

Observability Isn’t Understanding — Why We Still Don’t Know Our Systems

Comments
3 min read
Is Elixir’s Observability Ready for Production? A Guide for Skeptical Engineers

Is Elixir’s Observability Ready for Production? A Guide for Skeptical Engineers

Comments
10 min read
Mitigating 'Scraping Shock': Engineering Cost-Aware Data Pipelines

Mitigating 'Scraping Shock': Engineering Cost-Aware Data Pipelines

Comments
5 min read
How a Missing Trace Led Me to Build a Local Observability Stack

How a Missing Trace Led Me to Build a Local Observability Stack

2
Comments
10 min read
LangGraph4j Hooks and OpenTelemetry

LangGraph4j Hooks and OpenTelemetry

3
Comments 2
3 min read
Incident Response Runbook Template for DevOps

Incident Response Runbook Template for DevOps

1
Comments
3 min read
Datadog + AWS: Observability Maturity Model 2026

Datadog + AWS: Observability Maturity Model 2026

2
Comments
8 min read
From Logs to Insights: How to Adopt OpenTelemetry Collectors Without Breaking Your Existing Infrastructure

From Logs to Insights: How to Adopt OpenTelemetry Collectors Without Breaking Your Existing Infrastructure

3
Comments
4 min read
đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.