DEV Community

Site Reliability Engineering

Site Reliability Engineering principles, practices, and culture.

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
The Resilience Playbook: 23 Strategies for Bulletproof Applications 🚀

The Resilience Playbook: 23 Strategies for Bulletproof Applications 🚀

Comments
4 min read
DSA Won’t Save You in Production

DSA Won’t Save You in Production

Comments
2 min read
Automating DNS with ExternalDNS on EKS and Istio: Lessons From Real-World Gotchas

Automating DNS with ExternalDNS on EKS and Istio: Lessons From Real-World Gotchas

Comments
4 min read
🔮 Une nouvelle manière de vulgariser la programmation : plonge dans le monde magique de Grand Père Kernel

🔮 Une nouvelle manière de vulgariser la programmation : plonge dans le monde magique de Grand Père Kernel

1
Comments
2 min read
The Human-in-the-Loop Factor: Partnering With Amazon Q During a Production Incident

The Human-in-the-Loop Factor: Partnering With Amazon Q During a Production Incident

2
Comments
11 min read
Unlocking Site Reliability Engineering Tools for DevOps Incident Management

Unlocking Site Reliability Engineering Tools for DevOps Incident Management

Comments
4 min read
Build Node.js app in Replit & use s3 as static web hosting serving with CDN

Build Node.js app in Replit & use s3 as static web hosting serving with CDN

Comments
2 min read
WTF is Site Reliability Engineering?

WTF is Site Reliability Engineering?

1
Comments
3 min read
ComunicaOps: Criando Alicerces para Construção de Plataformas

ComunicaOps: Criando Alicerces para Construção de Plataformas

3
Comments
2 min read
Blue/Green e Canary no Kubernetes com Argo Rollouts [Lab Session]

Blue/Green e Canary no Kubernetes com Argo Rollouts [Lab Session]

15
Comments
11 min read
Why Platform Engineering? A Tale from a Busy Kitchen

Why Platform Engineering? A Tale from a Busy Kitchen

Comments
1 min read
Unboxing Terraform Internals – Part 1: The Big Picture

Unboxing Terraform Internals – Part 1: The Big Picture

Comments
5 min read
Orchestrating end-to-end service deployment using TypeScript workflows

Orchestrating end-to-end service deployment using TypeScript workflows

4
Comments
2 min read
Build C Projects Like a Pro: A Guide to Idiomatic Makefiles

Build C Projects Like a Pro: A Guide to Idiomatic Makefiles

1
Comments 2
7 min read
I Built an AI-Powered CLI to Help Debug Production Incidents | Meet Incident Helper

I Built an AI-Powered CLI to Help Debug Production Incidents | Meet Incident Helper

1
Comments
3 min read
Amazon API Gateway Observability Best Practices with Datadog

Amazon API Gateway Observability Best Practices with Datadog

1
Comments
4 min read
HashiCorp Nomad vs. Kubernetes: Understanding the Workload Orchestrator with Practical Examples

HashiCorp Nomad vs. Kubernetes: Understanding the Workload Orchestrator with Practical Examples

Comments
1 min read
Chaos Engineering in Production: Building Resilient Systems with Chaos Mesh

Chaos Engineering in Production: Building Resilient Systems with Chaos Mesh

Comments
1 min read
When APIs Fail: A Developer's Journey with Retries, Back Off, and Jitter

When APIs Fail: A Developer's Journey with Retries, Back Off, and Jitter

2
Comments
11 min read
OpenTofu CI/CD Guide: How to Automate Infrastructure Changes with Confidence

OpenTofu CI/CD Guide: How to Automate Infrastructure Changes with Confidence

2
Comments
3 min read
Cost-Tracking and Model-Spend Monitoring with LiteLLM

Cost-Tracking and Model-Spend Monitoring with LiteLLM

1
Comments
2 min read
Unleashing Resilience: 15+ Essential Chaos Engineering Tools for Robust Systems

Unleashing Resilience: 15+ Essential Chaos Engineering Tools for Robust Systems

Comments
6 min read
AI-Powered Kubernetes Debugging with Python and Ollama

AI-Powered Kubernetes Debugging with Python and Ollama

Comments
6 min read
Understanding `kube-system` in Kubernetes: A City Analogy You’ll Never Forget

Understanding `kube-system` in Kubernetes: A City Analogy You’ll Never Forget

5
Comments
2 min read
Top 15 Must-Have CI/CD Tools for DevOps & SRE Success

Top 15 Must-Have CI/CD Tools for DevOps & SRE Success

Comments
6 min read
loading...