DEV Community

# observability

Gaining deep insights into system behavior through metrics, logs, and traces.

Posts

đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.
Measuring What Matters: Adding Multiple Dimension Sets to AWS Lambda Powertools

Measuring What Matters: Adding Multiple Dimension Sets to AWS Lambda Powertools

Comments
4 min read
Why Core-Aware Logging Matters: The Architecture Behind LHOS_LOGx

Why Core-Aware Logging Matters: The Architecture Behind LHOS_LOGx

1
Comments
2 min read
Why your system can be 100% up and still completely broken

Why your system can be 100% up and still completely broken

3
Comments 2
2 min read
The Tiny Struct That Boots Grafana

The Tiny Struct That Boots Grafana

Comments
10 min read
Gonzo: An Open-Source Terminal UI That's Changing How I Analyze Logs

Gonzo: An Open-Source Terminal UI That's Changing How I Analyze Logs

Comments
3 min read
Turning block/goose into an AI SRE Agent

Turning block/goose into an AI SRE Agent

Comments
3 min read
Datadog vs OneUptime vs OptyxStack – Understanding the Differences in Observability and Operations

Datadog vs OneUptime vs OptyxStack – Understanding the Differences in Observability and Operations

5
Comments
2 min read
Sleep Tight, Cluster Right: Stop Burning Cash at 3 AM

Sleep Tight, Cluster Right: Stop Burning Cash at 3 AM

Comments
2 min read
All I Want for Christmas is Observable Multi-Modal Agentic Systems

All I Want for Christmas is Observable Multi-Modal Agentic Systems

Comments
8 min read
Your Audit Logs Are Lying to You: 6 Properties That Make Logs Actually Verifiable

Your Audit Logs Are Lying to You: 6 Properties That Make Logs Actually Verifiable

Comments
6 min read
How I Built Real-Time Dashboards for Claude Code Metrics with OTEL, Prometheus, and Grafana

How I Built Real-Time Dashboards for Claude Code Metrics with OTEL, Prometheus, and Grafana

Comments
3 min read
LLM evaluation guide: When to add online evals to your AI application

LLM evaluation guide: When to add online evals to your AI application

Comments
5 min read
From Logs to Insights: How to Adopt OpenTelemetry Collectors Without Breaking Your Existing Infrastructure

From Logs to Insights: How to Adopt OpenTelemetry Collectors Without Breaking Your Existing Infrastructure

3
Comments
4 min read
Your Observability Stack Is Optimized for the Wrong Thing

Your Observability Stack Is Optimized for the Wrong Thing

Comments
8 min read
Incident Response Runbook Template for DevOps

Incident Response Runbook Template for DevOps

1
Comments
3 min read
đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.