April Fools and the Broken Promises of One-off Hacks
Ask DEV: LightWeight APM for Kubernetes using OpenTelemetry?
Have you considered Site Reliability Engineering as a path?
⁉ Why I started developing 💡 my new software project by building a 🚀 Continuous Deployment 🔃 pipeline
How does your team handle critical production errors?
Folks, what are some conferences in DevOps/SRE space that you look forward to?
Molly Struve had a long winding journey to SRE... and other things I learned recording her DevJourney
A Team Without A Manager: Working Within A Dysfunctional Team
Resources to learn about DevOps cultural concepts and some tools
Rapid Docker on AWS: How to monitor the application?
Introduction to open source observability tools on Kubernetes
What Is a Site Reliability Engineer? Should You Become One?
Managing CNAMEs with Azure Resource Manager Templates
Using the Azure Portal to Check Configured Privileges
For the Love of Bleep! Building a Scalable Monitoring System
Three quick tips when setting up a new node with Chef Infra!
Building Solid Foundations for Operable Applications, Tools and Services
Tracking one metric opened a whole new world for me
Have you ever heard a more beautiful phrase than this?
Running Production Systems: Level 1, Software Firefighting
Technical Debt and Embracing Risk: How to find the MVP?
10 open-source Kubernetes tools for highly effective SRE and Ops Teams
Look Upstream to Solve your Team's Reliability Issues
How to Improve On-Call with Better Practices and Tools
Leaders, Here's how to Encourage Full Service Ownership