DEV Community 👩‍💻👨‍💻

Site Reliability Engineering

Posts

đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.
SRE and Tasks of an SRE explained âś…

SRE and Tasks of an SRE explained âś…

Reactions 83 Comments 1
13 min read
Understanding the Business as a Devops Engineer

Understanding the Business as a Devops Engineer

Reactions 12 Comments
4 min read
#90DaysOfDevOps - Day 4

#90DaysOfDevOps - Day 4

Reactions 2 Comments
4 min read
What is DevOps? REALLY understand it

What is DevOps? REALLY understand it

Reactions 257 Comments 4
12 min read
Engineer On-Call: The Dos and Don'ts

Engineer On-Call: The Dos and Don'ts

Reactions 3 Comments
3 min read
How-to setup a HA/DR database in AWS? [4 - HA Database]

How-to setup a HA/DR database in AWS? [4 - HA Database]

Reactions 6 Comments
4 min read
How-to setup a HA/DR database in AWS? [7 - Dynamic Terraform backend definition]

How-to setup a HA/DR database in AWS? [7 - Dynamic Terraform backend definition]

Reactions 6 Comments
2 min read
How-to setup a HA/DR database in AWS? [5 - DR database]

How-to setup a HA/DR database in AWS? [5 - DR database]

Reactions 6 Comments
3 min read
How-to setup a HA/DR database in AWS? [3 - Simple database]

How-to setup a HA/DR database in AWS? [3 - Simple database]

Reactions 6 Comments
3 min read
How-to setup a HA/DR database in AWS? [8 - Multiple instances in multiple regions]

How-to setup a HA/DR database in AWS? [8 - Multiple instances in multiple regions]

Reactions 6 Comments
2 min read
How-to setup a HA/DR database in AWS? [6 - Create from snapshot]

How-to setup a HA/DR database in AWS? [6 - Create from snapshot]

Reactions 6 Comments
2 min read
How-to setup a HA/DR database in AWS? [9 - Generate a random value]

How-to setup a HA/DR database in AWS? [9 - Generate a random value]

Reactions 6 Comments
3 min read
How-to setup a HA/DR database in AWS? [1]

How-to setup a HA/DR database in AWS? [1]

Reactions 4 Comments
3 min read
The Universal Language: Reliability for Non-Engineering Teams

The Universal Language: Reliability for Non-Engineering Teams

Reactions 4 Comments
7 min read
How-to setup a HA/DR database in AWS? [2 - Definitions]

How-to setup a HA/DR database in AWS? [2 - Definitions]

Reactions 2 Comments
4 min read
Choosing a database instance class in AWS with the maximum simultaneous connexions

Choosing a database instance class in AWS with the maximum simultaneous connexions

Reactions 2 Comments
2 min read
Building an SRE Team with Specialization

Building an SRE Team with Specialization

Reactions 4 Comments
7 min read
What happens when Amazon accidentally sends all of their support traffic your way?

What happens when Amazon accidentally sends all of their support traffic your way?

Reactions 28 Comments 2
3 min read
How Disaster Ready Are Your Backup Systems, Really?

How Disaster Ready Are Your Backup Systems, Really?

Reactions 2 Comments
6 min read
DevOps - Deployment strategies

DevOps - Deployment strategies

Reactions 5 Comments
6 min read
#90DaysOfDevOps - Day 3

#90DaysOfDevOps - Day 3

Reactions 2 Comments
5 min read
#90DaysOfDevOps - Day 1

#90DaysOfDevOps - Day 1

Reactions 24 Comments 4
4 min read
Fylamynt and Squadcast Team Up To Handle Cloud Incident Response, Management, and Remediation

Fylamynt and Squadcast Team Up To Handle Cloud Incident Response, Management, and Remediation

Reactions 5 Comments
4 min read
Some DevOps Terms definitions

Some DevOps Terms definitions

Reactions 8 Comments
4 min read
Como criar uma função personalizada para RBAC

Como criar uma função personalizada para RBAC

Reactions 6 Comments
4 min read
Machine Learning for Anomaly Detection: Decreasing Time to Find Root Cause by Automating Log Analysis

Machine Learning for Anomaly Detection: Decreasing Time to Find Root Cause by Automating Log Analysis

Reactions 3 Comments
7 min read
Circumvent STDIN when installing packages with apt

Circumvent STDIN when installing packages with apt

Reactions 4 Comments
2 min read
How to Write Meaningful Retrospectives

How to Write Meaningful Retrospectives

Reactions 2 Comments
6 min read
Hosting and Scaling Applications

Hosting and Scaling Applications

Reactions 3 Comments
3 min read
Starting an SRE Team? Stay Away From Uptime.

Starting an SRE Team? Stay Away From Uptime.

Reactions 8 Comments 2
5 min read
Solving the Diamond Problem with a Spacelift Trigger policy

Solving the Diamond Problem with a Spacelift Trigger policy

Reactions 12 Comments
4 min read
How important is Observability for SRE?

How important is Observability for SRE?

Reactions 4 Comments 2
6 min read
Post-mortem: Kubernetes pods don't start because of too many services

Post-mortem: Kubernetes pods don't start because of too many services

Reactions 5 Comments
3 min read
Keeping the Stakes Low while Breaking Production

Keeping the Stakes Low while Breaking Production

Reactions 27 Comments 5
4 min read
Implementing Graceful Shutdown in Go

Implementing Graceful Shutdown in Go

Reactions 15 Comments 5
14 min read
What You Need to Break into DevOps and SRE

What You Need to Break into DevOps and SRE

Reactions 64 Comments
3 min read
Don't panic when using CLI

Don't panic when using CLI

Reactions 7 Comments
2 min read
Virtual Webinar on 'Reliability Reimagined: How SREs spearhead competitive CX'

Virtual Webinar on 'Reliability Reimagined: How SREs spearhead competitive CX'

Reactions 6 Comments
1 min read
DevOps & SRE Words Matter: How Our Language has Evolved

DevOps & SRE Words Matter: How Our Language has Evolved

Reactions 8 Comments 2
6 min read
Understanding DevOps

Understanding DevOps

Reactions 11 Comments
4 min read
Moving large amounts of data on AWS

Moving large amounts of data on AWS

Reactions 7 Comments
5 min read
How to improve your influence as an SRE

How to improve your influence as an SRE

Reactions 2 Comments
7 min read
Incident Remediation With Jenkins and Terraform

Incident Remediation With Jenkins and Terraform

Reactions 15 Comments
3 min read
Differences between Site Reliability Engineer Vs. Software Engineer Vs. Cloud Engineer Vs. DevOps Engineer

Differences between Site Reliability Engineer Vs. Software Engineer Vs. Cloud Engineer Vs. DevOps Engineer

Reactions 13 Comments
7 min read
Application Performance Monitoring For SREs

Application Performance Monitoring For SREs

Reactions 5 Comments
3 min read
Preventing Alert Fatigue

Preventing Alert Fatigue

Reactions 2 Comments 1
4 min read
DevOps Horror Stories to Slow Development and Freeze Operations

DevOps Horror Stories to Slow Development and Freeze Operations

Reactions 3 Comments
4 min read
React faster: Forward Prometheus Alerts to Teams

React faster: Forward Prometheus Alerts to Teams

Reactions 3 Comments
3 min read
IR - Incident Response, Repair, Resolution or Remediation?

IR - Incident Response, Repair, Resolution or Remediation?

Reactions 10 Comments 1
2 min read
Terraform tips for newcomers

Terraform tips for newcomers

Reactions 5 Comments
1 min read
From Ad-hoc Scripting to Workflow as Code: The Evolution of Runbooks

From Ad-hoc Scripting to Workflow as Code: The Evolution of Runbooks

Reactions 16 Comments
2 min read
What is SRE (Site Reliability Engineering)?

What is SRE (Site Reliability Engineering)?

Reactions 12 Comments
3 min read
Can I Automate Away SRE Roles?

Can I Automate Away SRE Roles?

Reactions 10 Comments
2 min read
SRE vs DevOps

SRE vs DevOps

Reactions 7 Comments
2 min read
Incident Response vs. Incident Managment

Incident Response vs. Incident Managment

Reactions 9 Comments
2 min read
A Comparison of SRE Workflow Tools

A Comparison of SRE Workflow Tools

Reactions 13 Comments
4 min read
Efficient On-Call Practices For SREs

Efficient On-Call Practices For SREs

Reactions 2 Comments
5 min read
My thoughts on the HashiCorp Infrastructure Automation Certification

My thoughts on the HashiCorp Infrastructure Automation Certification

Reactions 3 Comments 1
2 min read
EKS - Disk configuration

EKS - Disk configuration

Reactions 4 Comments
1 min read
Testing Terraform The Right Way

Testing Terraform The Right Way

Reactions 9 Comments 1
3 min read
loading...