DEV Community

Site Reliability Engineering

Site Reliability Engineering principles, practices, and culture.

Posts

đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.
Observability and Failure Recovery in Distributed Financial Systems: When Correct Systems Still Break

Observability and Failure Recovery in Distributed Financial Systems: When Correct Systems Still Break

1
Comments
5 min read
Throw a Prompt at your IDE and see it get done!

Throw a Prompt at your IDE and see it get done!

3
Comments
1 min read
Docker Monitoring Without a Platform: docker stats + cgroups (DevOps)

Docker Monitoring Without a Platform: docker stats + cgroups (DevOps)

Comments
3 min read
Chapter 4: GitOps with Terraform + ArgoCD — Self-Hosting LLMs as a Platform Product

Chapter 4: GitOps with Terraform + ArgoCD — Self-Hosting LLMs as a Platform Product

2
Comments
28 min read
Introducing the Zen of DevOps

Introducing the Zen of DevOps

16
Comments
7 min read
Systems That Don’t Gaslight You: Engineering for Clarity Under Failure

Systems That Don’t Gaslight You: Engineering for Clarity Under Failure

1
Comments
5 min read
Complexity Is a Liability (Until It Isn't)

Complexity Is a Liability (Until It Isn't)

1
Comments
12 min read
Automation Scales Decisions, Not Understanding

Automation Scales Decisions, Not Understanding

1
Comments
9 min read
Documentation That Works When Everything Breaks

Documentation That Works When Everything Breaks

1
Comments
5 min read
The Architecture Drift Nobody Measures

The Architecture Drift Nobody Measures

2
Comments 2
9 min read
Reliability Is a Socio-Technical Problem

Reliability Is a Socio-Technical Problem

1
Comments
11 min read
When Asynchronous Systems Fail Quietly, Reliability Teams Pay the Price

When Asynchronous Systems Fail Quietly, Reliability Teams Pay the Price

Comments
5 min read
The Most Expensive Kubernetes Mistake: Memory Limits

The Most Expensive Kubernetes Mistake: Memory Limits

1
Comments 2
3 min read
Trust Is a Feature You Can Break

Trust Is a Feature You Can Break

1
Comments
5 min read
What a 60-second war-room scan reveals

What a 60-second war-room scan reveals

Comments
3 min read
đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.