DEV Community

Site Reliability Engineering

Site Reliability Engineering principles, practices, and culture.

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
Kubernetes Node Management - Drain, Cordon and Uncordon

Kubernetes Node Management - Drain, Cordon and Uncordon

6
Comments
2 min read
Why Use a Status Page Aggregator?

Why Use a Status Page Aggregator?

Comments
5 min read
How to Write Effective Incident Post-Mortems: A Complete Guide

How to Write Effective Incident Post-Mortems: A Complete Guide

6
Comments
6 min read
I Built an AI-Powered CLI to Help Debug Production Incidents | Meet Incident Helper

I Built an AI-Powered CLI to Help Debug Production Incidents | Meet Incident Helper

6
Comments
3 min read
🚀 My First Real K8s Deploy! Getting the Django Notes App Live🎉

🚀 My First Real K8s Deploy! Getting the Django Notes App Live🎉

2
Comments 1
6 min read
Stop Breaking OpenTofu: These 5 Errors Are Killing Your Deployment

Stop Breaking OpenTofu: These 5 Errors Are Killing Your Deployment

3
Comments
3 min read
No More Surprises: Get Notified on Terraform Deprecations

No More Surprises: Get Notified on Terraform Deprecations

11
Comments 1
3 min read
🧰 Mastering `map()` and `tolist()` in Terraform: Real Use Cases & Examples

🧰 Mastering `map()` and `tolist()` in Terraform: Real Use Cases & Examples

4
Comments
2 min read
Troubleshoot Container OOM Kills with eBPF

Troubleshoot Container OOM Kills with eBPF

12
Comments 4
11 min read
Alarm Suppression is Not Root Cause Analysis

Alarm Suppression is Not Root Cause Analysis

Comments
6 min read
10 kubectl Plugins That Help Make You the Most Valuable Kubernetes Engineer in the Room

10 kubectl Plugins That Help Make You the Most Valuable Kubernetes Engineer in the Room

35
Comments 2
12 min read
🔁 Rollback in DevOps: Why Every Deployment Needs a Safety Net

🔁 Rollback in DevOps: Why Every Deployment Needs a Safety Net

6
Comments 2
5 min read
3 Types of Chaos Experiments and How To Run Them

3 Types of Chaos Experiments and How To Run Them

2
Comments
9 min read
DevOps vs SRE: Detailed Comparison

DevOps vs SRE: Detailed Comparison

1
Comments
3 min read
Platform Engineering vs Site reliability Engineering (SRE)

Platform Engineering vs Site reliability Engineering (SRE)

1
Comments
3 min read
👋 Sign in for the ability to sort posts by relevant, latest, or top.