DEV Community

Site Reliability Engineering

Site Reliability Engineering principles, practices, and culture.

Posts

đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.
Why your developers hate your internal tooling (and how to fix it)

Why your developers hate your internal tooling (and how to fix it)

Comments
2 min read
PORT VS SOCKET

PORT VS SOCKET

2
Comments
3 min read
Debugging Missing Kubernetes Events: A Deep Dive into the Event Spam Filter

Debugging Missing Kubernetes Events: A Deep Dive into the Event Spam Filter

Comments
3 min read
Your Identity System Is Your Biggest Single Point of Failure

Your Identity System Is Your Biggest Single Point of Failure

1
Comments
5 min read
Context Switching Between DevOps Tools Is Costing You More Than You Think

Context Switching Between DevOps Tools Is Costing You More Than You Think

2
Comments
3 min read
Multi-Cloud Cascading Failure Risks: Why Active-Active is a Trap

Multi-Cloud Cascading Failure Risks: Why Active-Active is a Trap

1
Comments
4 min read
OpenSRM: An Open Specification for Service Reliability

OpenSRM: An Open Specification for Service Reliability

5
Comments
6 min read
DevOps com IA: Quem Está no Controle do Pipeline?

DevOps com IA: Quem Está no Controle do Pipeline?

Comments
13 min read
Building a Personal Expense Tracker with OpenTelemetry and CI/CD

Building a Personal Expense Tracker with OpenTelemetry and CI/CD

2
Comments
3 min read
Kubernetes Operators: A Deep Dive into the Internals

Kubernetes Operators: A Deep Dive into the Internals

2
Comments
19 min read
Assumptions Do

Assumptions Do

1
Comments
9 min read
Building Reliable Software: Planning for Things to Break

Building Reliable Software: Planning for Things to Break

Comments
8 min read
How I Troubleshot a KVM Memory Issue That Led to Swap & High CPU (Runbook + Real Scenario)

How I Troubleshot a KVM Memory Issue That Led to Swap & High CPU (Runbook + Real Scenario)

2
Comments
3 min read
Rotating Residential Proxy Evaluation Mini-Lab You Can Run in 90 Minutes

Rotating Residential Proxy Evaluation Mini-Lab You Can Run in 90 Minutes

Comments
6 min read
Build an AI Code Review Agent in GitHub Actions (That Actually Reduces Incidents

Build an AI Code Review Agent in GitHub Actions (That Actually Reduces Incidents

6
Comments 4
4 min read
đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.