DEV Community

Site Reliability Engineering

Site Reliability Engineering principles, practices, and culture.

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
How To Create an Incident Communication Plan

How To Create an Incident Communication Plan

Comments
7 min read
Siglas da Observabilidade SLI, SLO, SLE, MTTA, MTTR, MTBF e MTTF

Siglas da Observabilidade SLI, SLO, SLE, MTTA, MTTR, MTBF e MTTF

3
Comments
3 min read
Unpacking the Power of AWS ECS: A Comparative Look at ECS on EC2 vs. ECS on Fargate

Unpacking the Power of AWS ECS: A Comparative Look at ECS on EC2 vs. ECS on Fargate

2
Comments
3 min read
Did You Know About AWS Always-Free Services

Did You Know About AWS Always-Free Services

8
Comments 2
3 min read
Site Reliability Engineering (SRE) Consulting Services

Site Reliability Engineering (SRE) Consulting Services

Comments
2 min read
Extensões do Visual Studio Code para um SRE

Extensões do Visual Studio Code para um SRE

7
Comments
2 min read
Cloud9 starter guide with Spring Boot

Cloud9 starter guide with Spring Boot

13
Comments 3
3 min read
Vérifier les droits d'un utilisateur dans Kubernetes

Vérifier les droits d'un utilisateur dans Kubernetes

6
Comments
2 min read
New dog is ready to rock

New dog is ready to rock

2
Comments
3 min read
Monitorer son opérateur

Monitorer son opérateur

6
Comments
3 min read
Monitor an operator

Monitor an operator

2
Comments
2 min read
Datadog vs New Relic: A Duel for Dominance in LLM Observability Platforms

Datadog vs New Relic: A Duel for Dominance in LLM Observability Platforms

7
Comments
3 min read
Development vs Staging vs Production: What's the Difference?

Development vs Staging vs Production: What's the Difference?

3
Comments
6 min read
How to create a SLO for Cloud Run programatically

How to create a SLO for Cloud Run programatically

1
Comments 1
3 min read
The System Resiliency Pyramid

The System Resiliency Pyramid

2
Comments 1
5 min read
K8s operator - Synchronize resources outside Kubernetes cluster

K8s operator - Synchronize resources outside Kubernetes cluster

8
Comments
2 min read
Full Stack Observability: Connecting AWS with Datadog

Full Stack Observability: Connecting AWS with Datadog

3
Comments
4 min read
Handling Concurrent Load During an AWS Outage: A Tradeoff To Consider

Handling Concurrent Load During an AWS Outage: A Tradeoff To Consider

Comments
3 min read
Demystifying ETCD on Kubernetes: Understanding and Backing Up Your Cluster's Heartbeat

Demystifying ETCD on Kubernetes: Understanding and Backing Up Your Cluster's Heartbeat

1
Comments
2 min read
How to minimize your carbon footprint with Kube-Green?

How to minimize your carbon footprint with Kube-Green?

6
Comments
6 min read
Comment minimiser votre emprunte carbone avec Kube-Green?

Comment minimiser votre emprunte carbone avec Kube-Green?

7
Comments
7 min read
Observability Anti-Patterns and How AWS Can Help Overcome Them

Observability Anti-Patterns and How AWS Can Help Overcome Them

4
Comments
7 min read
5 Ways to Improve Your API Reliability

5 Ways to Improve Your API Reliability

1
Comments
11 min read
K8s Operator - Index with name ... does not exist

K8s Operator - Index with name ... does not exist

5
Comments
2 min read
K8s Operator - Index with name ... does not exist

K8s Operator - Index with name ... does not exist

5
Comments
2 min read
loading...