DEV Community

Site Reliability Engineering

Site Reliability Engineering principles, practices, and culture.

Posts

đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.
Go Circuit Breakers That Fail Friendly: The 94% Cascade Prevention We Measured

Go Circuit Breakers That Fail Friendly: The 94% Cascade Prevention We Measured

Comments
13 min read
How to Compute Zero Trust Effectiveness: Four Metrics That Survive a Breach

How to Compute Zero Trust Effectiveness: Four Metrics That Survive a Breach

Comments
5 min read
MCP in Production Reality vs the Spec

MCP in Production Reality vs the Spec

Comments
3 min read
RAG vs MCP is the wrong debate — here's the right framing for production AI systems

RAG vs MCP is the wrong debate — here's the right framing for production AI systems

Comments
4 min read
The Context Window Is RAM — Why Your Agent's SLIs Are Telling You It's Full

The Context Window Is RAM — Why Your Agent's SLIs Are Telling You It's Full

3
Comments
5 min read
“But it worked on my machine.”

“But it worked on my machine.”

Comments
1 min read
How I Created a DDoS Protection Engine

How I Created a DDoS Protection Engine

Comments
11 min read
Why P95 Latency Is the Only Metric That Matters at 3 AM

Why P95 Latency Is the Only Metric That Matters at 3 AM

Comments
4 min read
AI agents don’t need more autonomy. They need route, boundary, and receipt.

AI agents don’t need more autonomy. They need route, boundary, and receipt.

3
Comments
3 min read
I built a reference site for the recurring hard parts of software work

I built a reference site for the recurring hard parts of software work

Comments
2 min read
AI Ops Agents Are a New Class of Attack Surface

AI Ops Agents Are a New Class of Attack Surface

Comments
7 min read
ObserveX: Building a Centralized Observability Platform for Modern Infrastructure

ObserveX: Building a Centralized Observability Platform for Modern Infrastructure

Comments
12 min read
Service Level Objectives for Complex Microservices

Service Level Objectives for Complex Microservices

Comments
3 min read
Building a Unified Operational Timeline for Multi-Tenant OpenStack Environments

Building a Unified Operational Timeline for Multi-Tenant OpenStack Environments

3
Comments
3 min read
Flip the Axis: A Layer-Based Approach to Multi-Service Migrations

Flip the Axis: A Layer-Based Approach to Multi-Service Migrations

Comments
8 min read
đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.