-
Book Overview & Buying
-
Table Of Contents
SLIs and SLOs Demystified
By :
In previous chapters, we discussed the history of reliability engineering and the positive impact it has on a team and organization. We also took the opportunity to work through hypothetical system environments to build SLIs and SLOs for an internal distributed system focusing on different touchpoints within. However, once our metrics are in place, how do we proceed when the customer experience has decreased due to some incident? How do we notify the team of an issue that has occurred or is about to occur with respect to the customer experience thresholds identified? What is the appropriate cadence, timing, and levels of sensitivity to avoid overwhelming on-call engineers or interfering with other consumption-based alerting configured for product engineering teams and their respective on-call rotations?
These questions are answered and resolved through the principles of monitoring and alerting specific to SLOs. You might ask, what are the fundamental...
Change the font size
Change margin width
Change background colour