DEV Community

Site Reliability Engineering

Site Reliability Engineering principles, practices, and culture.

Posts

đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.
O que realmente quebra em migrações de nuvem em larga escala — Solução !

O que realmente quebra em migrações de nuvem em larga escala — Solução !

Comments
4 min read
LGTM != Production Ready: Why your CI pipeline is missing the most important step

LGTM != Production Ready: Why your CI pipeline is missing the most important step

Comments
3 min read
Rate Limiting: How to Stop Your API From Drowning in Requests

Rate Limiting: How to Stop Your API From Drowning in Requests

Comments
4 min read
On-Call Burnout: What Incident Data Doesn’t Show

On-Call Burnout: What Incident Data Doesn’t Show

5
Comments 2
5 min read
Time-to-Owner in Incident Response: How Platform Teams Cut Escalation Delay

Time-to-Owner in Incident Response: How Platform Teams Cut Escalation Delay

1
Comments
9 min read
When AI Becomes Your On-Call Engineer: The Future of Incident Response

When AI Becomes Your On-Call Engineer: The Future of Incident Response

11
Comments 1
2 min read
Sentrix: An AI SRE Copilot That Debates Its Own Scaling Decisions

Sentrix: An AI SRE Copilot That Debates Its Own Scaling Decisions

1
Comments
2 min read
Why Your Chaos Experiments Are Probably Wasting Time (and How to Fix It)

Why Your Chaos Experiments Are Probably Wasting Time (and How to Fix It)

3
Comments 2
3 min read
Why AI SRE tools don't work (and what we're doing differently)

Why AI SRE tools don't work (and what we're doing differently)

4
Comments 2
4 min read
How a 2% Latency Spike Collapses a 20-Service System and How to Prevent It

How a 2% Latency Spike Collapses a 20-Service System and How to Prevent It

1
Comments
3 min read
Your Retry Config is Wrong (And So Was Mine)

Your Retry Config is Wrong (And So Was Mine)

1
Comments
8 min read
Linux Privileges:Peeling Back the Curtain Of How Linux Really Handles Users, Privileges, and Processes

Linux Privileges:Peeling Back the Curtain Of How Linux Really Handles Users, Privileges, and Processes

4
Comments
5 min read
Observability and Failure Recovery in Distributed Financial Systems: When Correct Systems Still Break

Observability and Failure Recovery in Distributed Financial Systems: When Correct Systems Still Break

1
Comments
5 min read
Throw a Prompt at your IDE and see it get done!

Throw a Prompt at your IDE and see it get done!

2
Comments
1 min read
Docker Monitoring Without a Platform: docker stats + cgroups (DevOps)

Docker Monitoring Without a Platform: docker stats + cgroups (DevOps)

Comments
3 min read
đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.