Book Image

Kubernetes in Production Best Practices

By : Aly Saleh, Murat Karslioglu

Book Image

Kubernetes in Production Best Practices

By: Aly Saleh, Murat Karslioglu

Overview of this book

Although out-of-the-box solutions can help you to get a cluster up and running quickly, running a Kubernetes cluster that is optimized for production workloads is a challenge, especially for users with basic or intermediate knowledge. With detailed coverage of cloud industry standards and best practices for achieving scalability, availability, operational excellence, and cost optimization, this Kubernetes book is a blueprint for managing applications and services in production. You'll discover the most common way to deploy and operate Kubernetes clusters, which is to use a public cloud-managed service from AWS, Azure, or Google Cloud Platform (GCP). This book explores Amazon Elastic Kubernetes Service (Amazon EKS), the AWS-managed version of Kubernetes, for working through practical exercises. As you get to grips with implementation details specific to AWS and EKS, you'll understand the design concepts, implementation best practices, and configuration applicable to other cloud-managed services. Throughout the book, you’ll also discover standard and cloud-agnostic tools, such as Terraform and Ansible, for provisioning and configuring infrastructure. By the end of this book, you’ll be able to leverage Kubernetes to operate and manage your production environments confidently.

Preface

Who this book is for

What this book covers

To get the most out of this book

Download the example code files

Download the color images

Conventions used

Chapter 1: Introduction to Kubernetes Infrastructure and Production-Readiness

Chapter 1: Introduction to Kubernetes Infrastructure and Production-Readiness

The basics of Kubernetes infrastructure

Why Kubernetes is challenging in production

Kubernetes production-readiness

Kubernetes infrastructure best practices

Cloud-native approach

Further reading

Free Chapter

Chapter 2: Architecting Production-Grade Kubernetes Infrastructure

Chapter 2: Architecting Production-Grade Kubernetes Infrastructure

Understanding Kubernetes infrastructure design considerations

Exploring Kubernetes deployment strategy alternatives

Designing an Amazon EKS infrastructure

Further reading

Chapter 3: Provisioning Kubernetes Clusters Using AWS and Terraform

Chapter 3: Provisioning Kubernetes Clusters Using AWS and Terraform

Technical requirements

Implementation principles and best practices

Cluster deployment and rollout strategy

Preparing Terraform

Creating the network infrastructure

Creating the cluster infrastructure

Cleaning up and destroying infrastructure resources

Further reading

Chapter 4: Managing Cluster Configuration with Ansible

Chapter 4: Managing Cluster Configuration with Ansible

Technical requirements

Installing the required tools

Implementation principles

Kubernetes configuration management

Configuring the clusters

Destroying the cluster's resources

Further reading

Chapter 5: Configuring and Enhancing Kubernetes Networking Services

Chapter 5: Configuring and Enhancing Kubernetes Networking Services

Technical requirements

Introducing networking production readiness

Configuring Kube Proxy

Configuring the Amazon CNI plugin

Configuring CoreDNS

Configuring ExternalDNS

Configuring NGINX Ingress Controller

Deploying the cluster's network services

Destroying the cluster's resources

Further reading

Chapter 6: Securing Kubernetes Effectively

Chapter 6: Securing Kubernetes Effectively

Technical requirements

Securing Kubernetes infrastructure

Managing cluster access

Managing secrets and certificates

Securing workloads and apps

Ensuring cluster security and compliance

Bonus security tips

Deploying the security configurations

Destroying the cluster

Further reading

Chapter 7: Managing Storage and Stateful Applications

Chapter 7: Managing Storage and Stateful Applications

Technical requirements

Implementation principles

Understanding the challenges with stateful applications

Tuning Kubernetes storage

Choosing a persistent storage solution

Deploying stateful applications

Further reading

Chapter 8: Deploying Seamless and Reliable Applications

Chapter 8: Deploying Seamless and Reliable Applications

Technical requirements

Understanding the challenges with container images

Learning application deployment strategies

Scaling applications and achieving higher availability

Further reading

Chapter 9: Monitoring, Logging, and Observability

Chapter 9: Monitoring, Logging, and Observability

Technical requirements

Understanding the challenges with Kubernetes observability

Learning site reliability best practices

Monitoring, metrics, and visualization

Logging and tracing

Further reading

Chapter 10: Operating and Maintaining Efficient Kubernetes Clusters

Chapter 10: Operating and Maintaining Efficient Kubernetes Clusters

Technical requirements

Learning about cluster maintenance and upgrades

Preparing for backups and disaster recovery

Validating cluster quality

Further reading

Other Books You May Enjoy

Other Books You May Enjoy

Packt is searching for authors like you

Leave a review - let other readers know what you think

Customer Reviews

5 star

0

4 star

0

3 star

0

2 star

0

1 star

0

Learning site reliability best practices

In this section, we will learn about considerations and best practices followed by the industry site reliability experts that handle technical site availability issues when observed.

Site Reliability Engineering (SRE) is a discipline introduced by the Google engineering team. Google's approach of operating their core services at scale still represents a model for SRE best practices today. You can read more about the foundations and practices on the Google SRE resources site at https://sre.google/resources/. Before we learn about the monitoring and metric visualization tools, let's learn about a few common-sense SRE best practices we should consider:

Automate everything possible and automate now: SREs should take every opportunity to automate time-consuming infrastructure tasks. As part of a DevOps culture, SREs work with autonomous teams choosing their own services, which makes the unification of tools almost impossible...