DEV Community

Site Reliability Engineering

Site Reliability Engineering principles, practices, and culture.

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
SRE Deployment Engineer Managing Reliable & Automated Deployments

SRE Deployment Engineer Managing Reliable & Automated Deployments

1
Comments
4 min read
Postmortem: A Importância de uma Análise Estruturada de Incidentes em SRE

Postmortem: A Importância de uma Análise Estruturada de Incidentes em SRE

2
Comments
4 min read
K8s Plugins For Solid Security

K8s Plugins For Solid Security

Comments
2 min read
What are Kata Containers?

What are Kata Containers?

Comments
2 min read
Designing a fault-tolerant etcd cluster on AWS

Designing a fault-tolerant etcd cluster on AWS

8
Comments 1
5 min read
Zero-Downtime Blue-Green Deployment with a Simple 'git pull & bash run.sh' Command

Zero-Downtime Blue-Green Deployment with a Simple 'git pull & bash run.sh' Command

1
Comments
1 min read
Internal Developer Portals: Autonomy, Governance and the Golden Path

Internal Developer Portals: Autonomy, Governance and the Golden Path

1
Comments
15 min read
DynamoDB: Query x Scan! Para de torrar dinheiro usando Scan em produção

DynamoDB: Query x Scan! Para de torrar dinheiro usando Scan em produção

38
Comments 6
4 min read
7 Kubernetes Security Best Practices in 2024

7 Kubernetes Security Best Practices in 2024

6
Comments
3 min read
How to Fix Kubernetes Node Disk Pressure

How to Fix Kubernetes Node Disk Pressure

4
Comments
2 min read
Some of the less-known ping types you should know

Some of the less-known ping types you should know

6
Comments 1
1 min read
How a Pod is Deleted - Behind the Scenes Breakdown

How a Pod is Deleted - Behind the Scenes Breakdown

8
Comments 2
2 min read
How to Set up Disk and Bandwidth Limits in Docker

How to Set up Disk and Bandwidth Limits in Docker

2
Comments
2 min read
How To Fix OOMKilled

How To Fix OOMKilled

1
Comments
2 min read
SRE Culture Embedding Reliability into Engineering Teams

SRE Culture Embedding Reliability into Engineering Teams

Comments
3 min read
Creating an Efficient IT Incident Management Plan: A Guide to Templates and Best Practices

Creating an Efficient IT Incident Management Plan: A Guide to Templates and Best Practices

Comments
7 min read
SLOs and Customer Experience: Uniting Engineering Excellence with Customer Satisfaction

SLOs and Customer Experience: Uniting Engineering Excellence with Customer Satisfaction

Comments
5 min read
SRE and the Enterprise: Building a Culture of Reliability at Scale

SRE and the Enterprise: Building a Culture of Reliability at Scale

Comments
4 min read
SRE vs DevOps: What’s the Difference and Why Does It Matter? 🤓

SRE vs DevOps: What’s the Difference and Why Does It Matter? 🤓

Comments
1 min read
Best Practices for Choosing a Status Page Provider

Best Practices for Choosing a Status Page Provider

Comments
5 min read
How to Define Engineering Standards (with Backstage)

How to Define Engineering Standards (with Backstage)

Comments
10 min read
Introducing Botkube Fuse: The Platform Engineer’s Copilot

Introducing Botkube Fuse: The Platform Engineer’s Copilot

6
Comments
4 min read
Accelerating Business Growth with a Platform Engineering Team

Accelerating Business Growth with a Platform Engineering Team

Comments
5 min read
The Pulse Of Technology: Why IT Monitoring Is Non-Negotiable In 2024

The Pulse Of Technology: Why IT Monitoring Is Non-Negotiable In 2024

Comments
13 min read
How to improve DORA metrics as a release engineer

How to improve DORA metrics as a release engineer

5
Comments
10 min read
loading...