DEV Community

loading...

Site Reliability Engineering

👋 Sign in for the ability sort posts by top and latest.
How does deployment work at your organization?

How does deployment work at your organization?

Reactions 70 Comments 73
1 min read
go apps + jaeger tracing

go apps + jaeger tracing

Reactions 8 Comments 2
1 min read
What you can show on your status page

What you can show on your status page

Reactions 4 Comments
4 min read
April Fools and the Broken Promises of One-off Hacks

April Fools and the Broken Promises of One-off Hacks

Reactions 128 Comments 8
4 min read
DevOps Engineer vs. SRE?

DevOps Engineer vs. SRE?

Reactions 10 Comments 6
1 min read
Ask DEV: LightWeight APM for Kubernetes using OpenTelemetry?

Ask DEV: LightWeight APM for Kubernetes using OpenTelemetry?

Reactions 5 Comments
2 min read
Incident Response in the time of Remote Work

Incident Response in the time of Remote Work

Reactions 8 Comments
7 min read
Dreams and Nightmares of Ops

Dreams and Nightmares of Ops

Reactions 33 Comments 2
10 min read
Have you considered Site Reliability Engineering as a path?

Have you considered Site Reliability Engineering as a path?

Reactions 66 Comments 12
1 min read
Towards Operational Excellence — Part 2

Towards Operational Excellence — Part 2

Reactions 7 Comments
11 min read
Towards Operational Excellence — Part 3

Towards Operational Excellence — Part 3

Reactions 7 Comments
11 min read
SRE in layman’s terms (4 core concepts)

SRE in layman’s terms (4 core concepts)

Reactions 5 Comments
4 min read
⁉ Why I started developing 💡 my new software project by building a 🚀 Continuous Deployment 🔃 pipeline

⁉ Why I started developing 💡 my new software project by building a 🚀 Continuous Deployment 🔃 pipeline

Reactions 7 Comments 1
7 min read
List of DevOps/SRe Conferences in 2020

List of DevOps/SRe Conferences in 2020

Reactions 6 Comments 1
1 min read
Deploy an Angular App Using Google Cloud Run

Deploy an Angular App Using Google Cloud Run

Reactions 9 Comments 4
4 min read
How does your team handle critical production errors?

How does your team handle critical production errors?

Reactions 9 Comments 5
1 min read
Folks, what are some conferences in DevOps/SRE space that you look forward to?

Folks, what are some conferences in DevOps/SRE space that you look forward to?

Reactions 7 Comments 1
1 min read
Towards Operational Excellence — Part 1

Towards Operational Excellence — Part 1

Reactions 20 Comments
10 min read
Molly Struve had a long winding journey to SRE... and other things I learned recording her DevJourney

Molly Struve had a long winding journey to SRE... and other things I learned recording her DevJourney

Reactions 5 Comments
3 min read
Beyond Blameless

Beyond Blameless

Reactions 9 Comments
6 min read
DevOps vs. Site Reliability Engineering (SRE)

DevOps vs. Site Reliability Engineering (SRE)

Reactions 50 Comments
31 min read
SLOs with Stackdriver Service Monitoring

SLOs with Stackdriver Service Monitoring

Reactions 7 Comments
8 min read
A Team Without A Manager: Working Within A Dysfunctional Team

A Team Without A Manager: Working Within A Dysfunctional Team

Reactions 14 Comments
4 min read
Resources to learn about DevOps cultural concepts and some tools

Resources to learn about DevOps cultural concepts and some tools

Reactions 6 Comments
1 min read
The Night Before Code Freeze

The Night Before Code Freeze

Reactions 51 Comments 1
4 min read
How To Get AWS Lambda Logs Into CloudWatch

How To Get AWS Lambda Logs Into CloudWatch

Reactions 8 Comments
6 min read
Rapid Docker on AWS: How to monitor the application?

Rapid Docker on AWS: How to monitor the application?

Reactions 10 Comments
4 min read
Becoming a Site Reliability Engineer (SRE)

Becoming a Site Reliability Engineer (SRE)

Reactions 14 Comments
14 min read
DevOps vs. SRE? 4 Important Differences

DevOps vs. SRE? 4 Important Differences

Reactions 18 Comments
8 min read
Devops Week News - Issue #158

Devops Week News - Issue #158

Reactions 4 Comments
1 min read
Introduction to open source observability tools on Kubernetes

Introduction to open source observability tools on Kubernetes

Reactions 7 Comments
1 min read
How ITIL4 and SRE align with DevOps

How ITIL4 and SRE align with DevOps

Reactions 11 Comments
4 min read
What Is a Site Reliability Engineer? Should You Become One?

What Is a Site Reliability Engineer? Should You Become One?

Reactions 11 Comments
10 min read
SLI, SLO, and SLA

SLI, SLO, and SLA

Reactions 11 Comments
2 min read
Managing CNAMEs with Azure Resource Manager Templates

Managing CNAMEs with Azure Resource Manager Templates

Reactions 25 Comments
3 min read
Using the Azure Portal to Check Configured Privileges

Using the Azure Portal to Check Configured Privileges

Reactions 8 Comments
1 min read
How to troubleshoot potential DOS attacks

How to troubleshoot potential DOS attacks

Reactions 16 Comments
5 min read
Making On-Call Not Suck

Making On-Call Not Suck

Reactions 122 Comments 17
7 min read
Switching From Resque to Sidekiq

Switching From Resque to Sidekiq

Reactions 70 Comments 6
7 min read
Minimal Monitoring for Production Services

Minimal Monitoring for Production Services

Reactions 15 Comments
4 min read
For the Love of Bleep! Building a Scalable Monitoring System

For the Love of Bleep! Building a Scalable Monitoring System

Reactions 140 Comments 12
6 min read
Three quick tips when setting up a new node with Chef Infra!

Three quick tips when setting up a new node with Chef Infra!

Reactions 7 Comments
2 min read
Testing Infrastructure at ✨ Corp, a DevOps Story

Testing Infrastructure at ✨ Corp, a DevOps Story

Reactions 20 Comments 2
6 min read
Building Rootless Applications and Services

Building Rootless Applications and Services

Reactions 7 Comments 1
6 min read
What It Means To Be A Site Reliability Engineer

What It Means To Be A Site Reliability Engineer

Reactions 305 Comments 13
5 min read
Building Solid Foundations for Operable Applications, Tools and Services

Building Solid Foundations for Operable Applications, Tools and Services

Reactions 6 Comments
2 min read
Tracking one metric opened a whole new world for me

Tracking one metric opened a whole new world for me

Reactions 17 Comments
9 min read
SWEs are ruining SRE

SWEs are ruining SRE

Reactions 18 Comments 1
5 min read
Have you ever heard a more beautiful phrase than this?

Have you ever heard a more beautiful phrase than this?

Reactions 150 Comments 27
1 min read
Running Production Systems: Level 1, Software Firefighting

Running Production Systems: Level 1, Software Firefighting

Reactions 29 Comments
7 min read
「最新DevOps事例勉強会」に行ってきました

「最新DevOps事例勉強会」に行ってきました

Reactions 8 Comments
4 min read
SRE Vs DevOps. What are the factors that overlap?

SRE Vs DevOps. What are the factors that overlap?

Reactions 35 Comments 13
1 min read
Technical Debt and Embracing Risk: How to find the MVP?

Technical Debt and Embracing Risk: How to find the MVP?

Reactions 20 Comments
5 min read
6 Devops interview questions

6 Devops interview questions

Reactions 26 Comments 3
4 min read
10 open-source Kubernetes tools for highly effective SRE and Ops Teams

10 open-source Kubernetes tools for highly effective SRE and Ops Teams

Reactions 28 Comments
6 min read
How to Monitor the SRE Golden Signals

How to Monitor the SRE Golden Signals

Reactions 18 Comments
7 min read
Look Upstream to Solve your Team's Reliability Issues

Look Upstream to Solve your Team's Reliability Issues

Reactions 2 Comments
10 min read
How to Improve On-Call with Better Practices and Tools

How to Improve On-Call with Better Practices and Tools

Reactions 2 Comments
5 min read
Leaders, Here's how to Encourage Full Service Ownership

Leaders, Here's how to Encourage Full Service Ownership

Reactions 3 Comments
5 min read
Using this one simple trick you can cut your GCP compute costs by as much as 80%!

Using this one simple trick you can cut your GCP compute costs by as much as 80%!

Reactions 4 Comments
2 min read
loading...