DEV Community

Site Reliability Engineering

Site Reliability Engineering principles, practices, and culture.

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
5 DevOps Books to Read for FREE

5 DevOps Books to Read for FREE

210
Comments 7
2 min read
4 YouTube Resources to Get Started with Kubernetes

4 YouTube Resources to Get Started with Kubernetes

59
Comments
2 min read
AWS VPC 101

AWS VPC 101

31
Comments
10 min read
Conferences in the Time of COVID-19: Cloud and Infrastructure

Conferences in the Time of COVID-19: Cloud and Infrastructure

8
Comments
3 min read
Monitoring with Prometheus and Grafana

Monitoring with Prometheus and Grafana

11
Comments
10 min read
How to Classify Incidents

How to Classify Incidents

7
Comments
6 min read
Building a Multi-Tenant gRPC Development Platform with Ambassador and AWS EKS

Building a Multi-Tenant gRPC Development Platform with Ambassador and AWS EKS

6
Comments
9 min read
Kafka Chaos Engineering With Litmus

Kafka Chaos Engineering With Litmus

33
Comments
10 min read
Blameless' SRE Journey

Blameless' SRE Journey

8
Comments
8 min read
LitmusChaos in CNCF Sandbox

LitmusChaos in CNCF Sandbox

12
Comments
3 min read
Twitter's Reliability Journey

Twitter's Reliability Journey

6
Comments
6 min read
Top Practices for Runbook Automation

Top Practices for Runbook Automation

16
Comments 1
6 min read
Incident Postmortem Template

Incident Postmortem Template

10
Comments
6 min read
SRE: A Human Approach to Systems

SRE: A Human Approach to Systems

8
Comments
7 min read
Leverage JIRA with Squadcast throughout the incident lifecycle

Leverage JIRA with Squadcast throughout the incident lifecycle

1
Comments
3 min read
👋 Sign in for the ability to sort posts by relevant, latest, or top.