-
Book Overview & Buying
-
Table Of Contents
SLIs and SLOs Demystified
By :
SLIs and SLOs Demystified
By:
Overview of this book
In today's digital landscape, ensuring service reliability is more than just a necessity—it’s a competitive advantage. SLIs and SLOs Demystified equips software engineers, SREs, and business leaders with the knowledge to build, measure, and manage service level indicators (SLIs) and service level objectives (SLOs) efficiently. Written by Alexandra F. McCoy—an experienced site reliability engineer with over a decade of experience in the cloud and technology industry—this book simplifies complex reliability concepts for engineers at all levels.
Starting with a review of reliability engineering basics, Alexandra provides a step-by-step approach to defining impactful SLIs, facilitating productive SLO discussions, and integrating observability into your monitoring strategy. You'll also see how these principles apply to web applications, distributed systems, databases, and new features through real-world examples that can help you develop SLIs and SLOs for your specific environment. The book goes beyond implementation to explore the financial impact of reliability, alerting strategies, integration with incident management, and using error budgets for business decisions.
By the end of this book, you’ll be able to drive operational excellence, minimize unplanned downtime, and optimize end user experiences with well-established reliability metrics.
Table of Contents (20 chapters)
Preface
Chapter 1: SLIs and SLOs at the Heart of Reliability
Chapter 2: Establishing an SLI and SLO Team
Chapter 3: Things to Consider When Crafting Your SLIs and SLOs
Chapter 4: Observability and Monitoring Are a Necessity and a Must
Chapter 5: The Financial Impact of Not Adopting Indicators
Part 2: The Tough Stuff – Kickstarting the SLI and SLO Conversation
Chapter 6: Workshop Preparation: Structuring the SLI and SLO Conversation
Chapter 7: Scenario 1: SLIs and SLOs for Web Applications
Chapter 8: Scenario 2: SLIs and SLOs for Distributed Systems
Chapter 9: Scenario 3: Optimizing SLIs and SLOs for Database Performance
Chapter 10: Scenario 4: Developing SLIs and SLOs for New Features
Part 3: Help! We’ve Identified Our SLIs and SLOs… Now What?
Chapter 11: SLO Monitoring and Alerting
Chapter 12: Service Level Performance Metrics: Daily Operations
Chapter 13: SLO Preservation and Incident Management
Chapter 14: SLIs and SLOs as a Service
Index