Practical Site Reliability Engineering

Practical Site Reliability Engineering

By : Pethuru Raj Chelliah, Shreyash Naithani, Shailender Singh

Buy this Book

Practical Site Reliability Engineering

By: Pethuru Raj Chelliah, Shreyash Naithani, Shailender Singh

Buy this Book

Overview of this book

Site reliability engineering (SRE) is being touted as the most competent paradigm in establishing and ensuring next-generation high-quality software solutions. This book starts by introducing you to the SRE paradigm and covers the need for highly reliable IT platforms and infrastructures. As you make your way through the next set of chapters, you will learn to develop microservices using Spring Boot and make use of RESTful frameworks. You will also learn about GitHub for deployment, containerization, and Docker containers. Practical Site Reliability Engineering teaches you to set up and sustain containerized cloud environments, and also covers architectural and design patterns and reliability implementation techniques such as reactive programming, and languages such as Ballerina and Rust. In the concluding chapters, you will get well-versed with service mesh solutions such as Istio and Linkerd, and understand service resilience test practices, API gateways, and edge/fog computing. By the end of this book, you will have gained experience on working with SRE concepts and be able to deliver highly reliable apps and services.

Title Page

Dedication

About Packt

Contributors

Preface

Free Chapter

Demystifying the Site Reliability Engineering Paradigm

Setting the context for practical SRE

Plunging into the SRE discipline

The need for highly reliable platforms and infrastructures

Reactive systems

Highly reliable IT infrastructures

The vitality of the SRE domain

Summary

Microservices Architecture and Containers

What are microservices?

Microservice design principles

Deploying microservices

Practical examples of microservice deployment

Microservices using Spring Boot and the RESTful framework

Jersey Framework

Representational State Transfer (REST)

Important facts about microservices

Summary

Microservice Resiliency Patterns

Briefing microservices and containers

IT reliability challenges and solution approaches

The promising and potential approaches for resiliency and reliability

Summary

DevOps as a Service

What is DaaS?

Collaboration with development and QA teams

Summary

Container Cluster and Orchestration Platforms

Resilient microservices

Application and volume containers

Clustering and managing containers

Container orchestration and management

Summary

Architectural and Design Patterns

Architecture pattern

Design pattern

Summary

Reliability Implementation Techniques

Ballerina programming

Reliability

Rust programming

Summary

Realizing Reliable Systems - the Best Practices

Reliable IT systems – the emerging traits and tips

MSA for reliable software

Service mesh solutions

Microservices design – best practices

Asynchronous messaging patterns for event-driven microservices

The role of EDA to produce reactive applications

Reliable IT infrastructures

Infrastructure as code

Summary

Service Resiliency

Delineating the containerization paradigm

Demystifying microservices architecture

Decoding the growing role of Kubernetes for the container era

Describing the service mesh concept

Why is service mesh paramount?

Service mesh architectures

Summary

Containers, Kubernetes, and Istio Monitoring

Prometheus

Grafana

Summary

Post-Production Activities for Ensuring and Enhancing IT Reliability

Modern IT infrastructure

Monitoring clouds, clusters, and containers

Cloud infrastructure and application monitoring

The monitoring tool capabilities

Prognostic, predictive, and prescriptive analytics

Log analytics

IT operational analytics

IT performance and scalability analytics

IT security analytics

The importance of root-cause analysis

Summary

Monitoring is not a one-time task. We should be regularly measuring what's going on with our Kubernetes pods or our microservices. Monitoring plays a crucial role in the microservice system, as we need to monitor all endpoints in our microservices. To achieve a higher quality product, we should be able to detect failures before our customer does. We should enable anomaly detection and notify our operation team to troubleshoot the problem. We have to set up the necessary monitoring and alerts on both the infrastructure side and the application side.In this chapter, we saw how to use Prometheus and Grafana metrics to create powerful dashboards and alerts.

In the next chapter, we will talk about post-production activities and best practices for ensuring and enhancing the IT reliability.

Practical Site Reliability Engineering

By : Pethuru Raj Chelliah, Shreyash Naithani, Shailender Singh

Practical Site Reliability Engineering

By: Pethuru Raj Chelliah, Shreyash Naithani, Shailender Singh

Overview of this book

Related Content you might be interested in

Current Title:

Practical Site Reliability Engineering

Hands-On RESTful API Design Patterns and Best Practices

Architectural Patterns

Learning Docker

Summary