In my past experience as an SRE, I learned some valuable lessons about how to respond to and learn from incidents. If you want the TL;DR, I’ll summarize them here: Declare and run retros for the small incidents. It’s less stressful, and action items become much more actionable. Decrease the time it takes to analyze an […]
The post SRE’s Guide to Pragmatic Incident Response appeared first on DevOps.com.
Source: DevOps.com