DEV Community

Site Reliability Engineering

Site Reliability Engineering principles, practices, and culture.

Posts

đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.
Rightsizing Kubernetes Requests with the In-Place Vertical Pod Autoscaler

Rightsizing Kubernetes Requests with the In-Place Vertical Pod Autoscaler

2
Comments
3 min read
The Limitations of Text Embeddings in RAG Applications: A Deep Engineering Dive

The Limitations of Text Embeddings in RAG Applications: A Deep Engineering Dive

Comments
19 min read
AWS Security Series: AWS Access Key is Compromised. Now What? An Incident Response Playbook.

AWS Security Series: AWS Access Key is Compromised. Now What? An Incident Response Playbook.

Comments
3 min read
Kubernetes Is Not a Container Platform (And That Changes Everything)

Kubernetes Is Not a Container Platform (And That Changes Everything)

Comments
1 min read
Project: One App — Three Probes — Real Failures

Project: One App — Three Probes — Real Failures

1
Comments
3 min read
What is performance engineering: A Gatling take

What is performance engineering: A Gatling take

Comments
8 min read
Top APM Tools in 2026: What Every Developer and Engineering Team Should Know

Top APM Tools in 2026: What Every Developer and Engineering Team Should Know

Comments
4 min read
Announcing Reliability Delta: Clear, Objective Insight into Whether Your Release Made Your System Better or Worse

Announcing Reliability Delta: Clear, Objective Insight into Whether Your Release Made Your System Better or Worse

Comments
4 min read
What 100+ Production Incidents Taught Me About System Design

What 100+ Production Incidents Taught Me About System Design

9
Comments 5
5 min read
Production Canary Architecture (what actually guarantees zero downtime)

Production Canary Architecture (what actually guarantees zero downtime)

3
Comments
3 min read
Utilizing the Go 1.25 Flight Recorder with tracing middleware

Utilizing the Go 1.25 Flight Recorder with tracing middleware

1
Comments
6 min read
How AI-Powered Observability Actually Changes Life For CIOs

How AI-Powered Observability Actually Changes Life For CIOs

Comments
5 min read
Reverse Proxy en Docker con Nginx y SSL automático

Reverse Proxy en Docker con Nginx y SSL automático

Comments
7 min read
The 23-Minute Rule: Why 'Quick Questions' Are Destroying Your Team's Velocity

The 23-Minute Rule: Why 'Quick Questions' Are Destroying Your Team's Velocity

Comments
3 min read
The Hidden Currency of Tech Leadership: The Resilience Loop

The Hidden Currency of Tech Leadership: The Resilience Loop

Comments
1 min read
đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.