DEV Community

Site Reliability Engineering

Site Reliability Engineering principles, practices, and culture.

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
Introduce Chaos Platform 2.0 for Azure

Introduce Chaos Platform 2.0 for Azure

7
Comments
2 min read
What Is Nix and Why You Should Use It

What Is Nix and Why You Should Use It

9
Comments
7 min read
How They SRE

How They SRE

8
Comments 1
1 min read
The Key Differences between SLI, SLO, and SLA in SRE

The Key Differences between SLI, SLO, and SLA in SRE

16
Comments
9 min read
The True Cost of Building your Own Incident Management System (IMS)

The True Cost of Building your Own Incident Management System (IMS)

2
Comments
5 min read
Managing health checks at scale

Managing health checks at scale

6
Comments
5 min read
Top Observability tools for DevOps Engineers and SREs

Top Observability tools for DevOps Engineers and SREs

17
Comments
7 min read
From SysAdmin to SRE: How to evolve your skillset

From SysAdmin to SRE: How to evolve your skillset

2
Comments
6 min read
Google Down worldwide | Why is Google Down? Let's break it down

Google Down worldwide | Why is Google Down? Let's break it down

15
Comments
4 min read
How to SRE without an SRE on your team

How to SRE without an SRE on your team

3
Comments
10 min read
Top Open Source projects for SREs and DevOps

Top Open Source projects for SREs and DevOps

2
Comments
7 min read
How small changes to your SLOs can be SMART for your business - A narrative case study

How small changes to your SLOs can be SMART for your business - A narrative case study

5
Comments
11 min read
Creating Chaos and a Giveaway ⚒ 🎁

Creating Chaos and a Giveaway ⚒ 🎁

19
Comments 6
2 min read
LitmusChaos: A Reflection On The Past Six Months

LitmusChaos: A Reflection On The Past Six Months

21
Comments
15 min read
Essence of Terraform

Essence of Terraform

33
Comments 1
3 min read
👋 Sign in for the ability to sort posts by relevant, latest, or top.