Bringing reliability closer to you with Reliably and DataDog
4 easy steps to setup AWS WorkSpaces (Screenshot’s included)
Serverless Stonks checker app for Wall Street Bets: week 3 activity report
6 Easy steps for sharing AWS Encrypted RDS snapshot between two accounts.
Introducing Teaming in LitmusChaos to ease your Chaos Engineering experience
What AWS Lambda metrics should you definitely be monitoring?
7 Ways SRE Is Changing IT Ops And How To Prepare For Those Changes
Litmus 2.0 - Simplifying Chaos Engineering for Enterprises
Everything You Need to Know About Kubernetes Operator and SRE
Como continuar a execução de um build do Jenkins quando um stage falha
A different approach working with Ansible variables
Having On-call Nightmares? Runbooks can Help you Wake Up.
How to track your product's SLO/ErrorBudget: A simple tool to keep track of things!
SRE2AUX: How Flight Controllers were the first SREs
So you Want an SRE Tool. Do you Build, Buy, or Open Source?
Kubernetes Health Checks - 2 Ways to Improve Stability in Your Production Applications
It's all Chaos! And it Makes for Resilience at Scale
How We Built and Use Runbook Documentation at Blameless
Deep Dive into Docker Internals - Union Filesystem
Top Reliability and Scaling Practices from Experts at Citrix, Greenlight Financial, and Incognia
Reliability as an Inseparable Part of Software Engineering
Getting Started as an SRE? Here are 3 Things You Need to Know.
The Key Differences between SLI, SLO, and SLA in SRE
How to Backup your Applications Data to S3 with Walrus
What is the right AWS Kubernetes distribution for you?
The True Cost of Building your Own Incident Management System (IMS)