DEV Community

# observability

Gaining deep insights into system behavior through metrics, logs, and traces.

Posts

đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.
Two KubeCons, One Conference: While Everyone Demos AI Agents, Engineers Are Fighting With Syslogs

Two KubeCons, One Conference: While Everyone Demos AI Agents, Engineers Are Fighting With Syslogs

Comments
7 min read
🎄 On the First Day of Debugging: The Twelve Characters of Christmas

🎄 On the First Day of Debugging: The Twelve Characters of Christmas

Comments
9 min read
Part 9 — Operating the gateway: logs, traces, health, and degraded mode

Part 9 — Operating the gateway: logs, traces, health, and degraded mode

Comments
9 min read
Embedding Drift Detection: A 50-Line Monitor for Production RAG

Embedding Drift Detection: A 50-Line Monitor for Production RAG

Comments
6 min read
Tool-Result Truncation: The Silent Bug That Makes Agents Lie

Tool-Result Truncation: The Silent Bug That Makes Agents Lie

Comments
8 min read
LLM Observability Audit: 32% Error Rate, 720K-Token Bug, and One $1.11 Call

LLM Observability Audit: 32% Error Rate, 720K-Token Bug, and One $1.11 Call

Comments
7 min read
AI-Augmented SRE: Where It Earns Its Keep, And Where It Doesn't

AI-Augmented SRE: Where It Earns Its Keep, And Where It Doesn't

Comments
5 min read
OpenTelemetry in TypeScript: Trace Your Hono Service in 50 Lines

OpenTelemetry in TypeScript: Trace Your Hono Service in 50 Lines

1
Comments
12 min read
What changed in Iris v0.4.0

What changed in Iris v0.4.0

Comments
6 min read
Three Tools, Three Layers: Sentry, Langfuse, and LangGraph for Multi-Agent Fleets

Three Tools, Three Layers: Sentry, Langfuse, and LangGraph for Multi-Agent Fleets

Comments
7 min read
Integrating ServiceNow Incident Management with Elastic AI Agents

Integrating ServiceNow Incident Management with Elastic AI Agents

Comments
6 min read
Zero-config Golang Heap Profiling

Zero-config Golang Heap Profiling

Comments
10 min read
When Monitoring Becomes “Wrong”: The Limits of Watching Only Ping and Disk in Zabbix

When Monitoring Becomes “Wrong”: The Limits of Watching Only Ping and Disk in Zabbix

Comments
3 min read
Real-Time Monitoring for AI Agents: Beyond Log Streaming

Real-Time Monitoring for AI Agents: Beyond Log Streaming

Comments
1 min read
GPU Utilization Is a Counter, Not a Cause

GPU Utilization Is a Counter, Not a Cause

Comments
8 min read
đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.