Reliability concepts: Availability, Resiliency, Robustness, Fault-Tolerance, and Reliability
Ensuring reliability: SLOs, on-call process, and postmortems
10 most important Metrics you must know as a DevOps Engineer
SRE book notes: Introduction to Site Reliability Engineering
Observability is becoming mission critical, but who watches the watchmen?
Reliability Restaurant – How to approach software reliability as a mindset
Improve Resilience with Controlled Chaos Engineering
How does chaos engineering relate to the mathematical definitions of chaos?
Error Economics - How to avoid breaking the budget
Bringing reliability closer to you with Reliably and DataDog
What Do Reliability, Scalability, and Maintainability Mean?
SRE + Honeycomb: Observability for Service Reliability
What are the most important features you need in your logging product?
How our team improved perceived reliability of Kaggle Notebooks
Falando sobre SRE - Parte 01 - Uma breve introdução
How SLOs Help Evernote's SRE Team Manage Tech Debt
Designing software for the enterprise - Pt.1 Reliability