DEV Community

Site Reliability Engineering

Site Reliability Engineering principles, practices, and culture.

Posts

đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.
Rightsizing Kubernetes Requests with the In-Place Vertical Pod Autoscaler

Rightsizing Kubernetes Requests with the In-Place Vertical Pod Autoscaler

2
Comments
3 min read
AWS Security Series: AWS Access Key is Compromised. Now What? An Incident Response Playbook.

AWS Security Series: AWS Access Key is Compromised. Now What? An Incident Response Playbook.

Comments
3 min read
Kubernetes Is Not a Container Platform (And That Changes Everything)

Kubernetes Is Not a Container Platform (And That Changes Everything)

Comments
1 min read
What is performance engineering: A Gatling take

What is performance engineering: A Gatling take

Comments
8 min read
Announcing Reliability Delta: Clear, Objective Insight into Whether Your Release Made Your System Better or Worse

Announcing Reliability Delta: Clear, Objective Insight into Whether Your Release Made Your System Better or Worse

Comments
4 min read
What 100+ Production Incidents Taught Me About System Design

What 100+ Production Incidents Taught Me About System Design

9
Comments 5
5 min read
Production Canary Architecture (what actually guarantees zero downtime)

Production Canary Architecture (what actually guarantees zero downtime)

3
Comments
3 min read
Utilizing the Go 1.25 Flight Recorder with tracing middleware

Utilizing the Go 1.25 Flight Recorder with tracing middleware

1
Comments
6 min read
How AI-Powered Observability Actually Changes Life For CIOs

How AI-Powered Observability Actually Changes Life For CIOs

Comments
5 min read
Reverse Proxy en Docker con Nginx y SSL automático

Reverse Proxy en Docker con Nginx y SSL automático

Comments
7 min read
The 23-Minute Rule: Why 'Quick Questions' Are Destroying Your Team's Velocity

The 23-Minute Rule: Why 'Quick Questions' Are Destroying Your Team's Velocity

Comments
3 min read
The Hidden Currency of Tech Leadership: The Resilience Loop

The Hidden Currency of Tech Leadership: The Resilience Loop

Comments
1 min read
Building an Air-gapped Hardened Kubernetes Cluster with Kubespray

Building an Air-gapped Hardened Kubernetes Cluster with Kubespray

Comments
3 min read
Your AI SRE needs better observability, not bigger models.

Your AI SRE needs better observability, not bigger models.

10
Comments
17 min read
Why Log Masking Matters in Kubernetes (and How We Enforced PCI Safety with Fluent Bit)

Why Log Masking Matters in Kubernetes (and How We Enforced PCI Safety with Fluent Bit)

Comments
4 min read
đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.