DEV Community

Site Reliability Engineering

Site Reliability Engineering principles, practices, and culture.

Posts

đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.
Why a Status Page Should Not Depend on Third-Party CDNs

Why a Status Page Should Not Depend on Third-Party CDNs

1
Comments 2
4 min read
Building a Config Drift Detector for AWS (with Snapshots, Lambdas, and a Next.js Dashboard)

Building a Config Drift Detector for AWS (with Snapshots, Lambdas, and a Next.js Dashboard)

Comments
5 min read
Running Cluster on 100% Spot Instances: How K8s Does It Better Than ECS

Running Cluster on 100% Spot Instances: How K8s Does It Better Than ECS

Comments
4 min read
Two Terraform Traps That Burned Me: Hidden Defaults & Circular Dependencies

Two Terraform Traps That Burned Me: Hidden Defaults & Circular Dependencies

Comments
4 min read
Why Your Engineering Wiki is a Graveyard (And How to Fix It)

Why Your Engineering Wiki is a Graveyard (And How to Fix It)

Comments
3 min read
How to Make Engineering Knowledge Searchable (A Complete Guide)

How to Make Engineering Knowledge Searchable (A Complete Guide)

1
Comments
3 min read
Shift-Left Reliability

Shift-Left Reliability

Comments
4 min read
How to pass the CKA Exam on the first try [GUARANTEED]

How to pass the CKA Exam on the first try [GUARANTEED]

Comments 1
4 min read
Backpressure, Buffers, and Logging Sidecars

Backpressure, Buffers, and Logging Sidecars

2
Comments
5 min read
Wild Ride from Raw Syscalls to Figuring Out NSS and libc

Wild Ride from Raw Syscalls to Figuring Out NSS and libc

2
Comments 1
4 min read
You’re Running EC2 Instances That Do Nothing

You’re Running EC2 Instances That Do Nothing

1
Comments
2 min read
10 Proven Ways to Cut Your AWS Bill

10 Proven Ways to Cut Your AWS Bill

1
Comments
3 min read
AWS DevOps Agent

AWS DevOps Agent

Comments
4 min read
Why Most DevOps Tutorials Fail in Production Environments

Why Most DevOps Tutorials Fail in Production Environments

Comments
2 min read
Kubernetes Persistence Series Part 3: Controllers & Resilience — Why Kubernetes Self-Heals

Kubernetes Persistence Series Part 3: Controllers & Resilience — Why Kubernetes Self-Heals

8
Comments
4 min read
đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.