MDD Monitoring Driven Development
From CitconWiki
Revision as of 01:51, 17 October 2022 by Pascaldufour (talk | contribs)
- My Philosophy on Alerting (based my observations while I was a Site Reliability Engineer at Google) by Rob Ewaschuk: https://docs.google.com/document/d/199PqyG3UsyXlwieHaqbGiWVa8eMWi8zzAn0YfcApr8Q/edit
- Patrick Debois: Codifying devops practices: https://jedi.be/blog/2012/05/12/codifying-devops-area-practices/
good questions to ask:
- what does this data mean?
- If we are not wachting it -> delete it?
- Should we try "Failure Friday"?
- Should we use "Daily Red"?
- Is this indicator fast enough (leading or lagging indicator) to react?