DEV Community

Site Reliability Engineering

Site Reliability Engineering principles, practices, and culture.

Posts

đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.
I Built 20 AI-Powered DevOps Tools Because I Got Tired of Doing This Stuff Manually

I Built 20 AI-Powered DevOps Tools Because I Got Tired of Doing This Stuff Manually

Comments
3 min read
Building an Autonomous SRE Agent: From Raw Telemetry to Safe, AI-Driven Remediation

Building an Autonomous SRE Agent: From Raw Telemetry to Safe, AI-Driven Remediation

1
Comments
8 min read
Building GBIM Observability From Correlation IDs to a Populated k6 Dashboard

Building GBIM Observability From Correlation IDs to a Populated k6 Dashboard

Comments
7 min read
Using the github actions to automate monitoring dashboards

Using the github actions to automate monitoring dashboards

1
Comments
4 min read
Closed-Loop SRE for Kubernetes: Auto-Remediating Pod Crashloops Before the On-Call Pages

Closed-Loop SRE for Kubernetes: Auto-Remediating Pod Crashloops Before the On-Call Pages

1
Comments
6 min read
Designing for Partial Failure: Why 'Everything is Highly Available' Is a Myth

Designing for Partial Failure: Why 'Everything is Highly Available' Is a Myth

Comments
3 min read
What Site Reliability Engineering Actually Is, and Why It's a National Infrastructure Discipline

What Site Reliability Engineering Actually Is, and Why It's a National Infrastructure Discipline

Comments
10 min read
Using Jenkins MCP to speed up DevOps workflows

Using Jenkins MCP to speed up DevOps workflows

Comments
2 min read
Incident Retrospectives Without Blame

Incident Retrospectives Without Blame

Comments
1 min read
Aurora Actions: User-Defined Background Automations for Incident Response

Aurora Actions: User-Defined Background Automations for Incident Response

1
Comments
10 min read
Alert Fatigue: The Silent Productivity Killer

Alert Fatigue: The Silent Productivity Killer

Comments
1 min read
Why SLIs Matter More Than SLOs

Why SLIs Matter More Than SLOs

Comments
1 min read
The Configuration Drift Discovery During a Drill

The Configuration Drift Discovery During a Drill

Comments
4 min read
We list 3 self-host PagerDuty alternatives. None of them are alive. (May 2026)

We list 3 self-host PagerDuty alternatives. None of them are alive. (May 2026)

Comments
5 min read
The PagerDuty Migration Playbook

The PagerDuty Migration Playbook

Comments
1 min read
đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.