Book Image

Repeatability, Reliability, and Scalability through GitOps

By : Bryan Feuling

Book Image

Repeatability, Reliability, and Scalability through GitOps

By: Bryan Feuling

Overview of this book

The world of software delivery and deployment has come a long way in the last few decades. From waterfall methods to Agile practices, every company that develops its own software has to overcome various challenges in delivery and deployment to meet customer and market demands. This book will guide you through common industry practices for software delivery and deployment. Throughout the book, you'll follow the journey of a DevOps team that matures their software release process from quarterly deployments to continuous delivery using GitOps. With the help of hands-on tutorials, projects, and self-assessment questions, you'll build your knowledge of GitOps basics, different types of GitOps practices, and how to decide which GitOps practice is the best for your company. As you progress, you'll cover everything from building declarative language files to the pitfalls in performing continuous deployment with GitOps. By the end of this book, you'll be well-versed with the fundamentals of delivery and deployment, the different schools of GitOps, and how to best leverage GitOps in your teams.

Preface

Who this book is for

What this book covers

To get the most out of this book

Download the color images

Conventions used

Section 1: Fundamentals of GitOps

Section 1: Fundamentals of GitOps

Free Chapter

Chapter 1: The Fundamentals of Delivery and Deployment

Chapter 1: The Fundamentals of Delivery and Deployment

How did we get here?

What is a deployment process?

What is a delivery process?

What makes any practice continuous?

Chapter 2: Exploring Common Industry Delivery and Deployment Practices

Chapter 2: Exploring Common Industry Delivery and Deployment Practices

Common industry practices for deployment

Common industry practices for delivery

Common tools used for deployment and delivery

Needs versus wants

The automation test

Chapter 3: The "What" and "Why" of GitOps

Chapter 3: The "What" and "Why" of GitOps

Why does a continuous process matter?

What is GitOps?

Declarative execution and declarative state

To push or pull ?

Section 2: GitOps Types, Benefits, and Drawbacks

Section 2: GitOps Types, Benefits, and Drawbacks

Chapter 4: The Original GitOps – Continuous Deployment in Kubernetes

Chapter 4: The Original GitOps – Continuous Deployment in Kubernetes

Original GitOps basics

Kubernetes and operators

Manifest explosion

Benefits and drawbacks of originalist GitOps

Common originalist GitOps tools

Chapter 5: The Purist GitOps – Continuous Deployment Everywhere

Chapter 5: The Purist GitOps – Continuous Deployment Everywhere

Purist GitOps basics

Servers, containers, and serverless deployment-as-code

Declarative files overload

Benefits and drawbacks of purist GitOps

Common purist GitOps tools

Chapter 6: Verified GitOps – Continuous Delivery Declaratively Defined

Chapter 6: Verified GitOps – Continuous Delivery Declaratively Defined

Verified GitOps basics

Test, governance, deploy, verify, and restore as code

One file, many files, or somewhere in-between

Benefits and drawbacks of verified GitOps

Common verified GitOps tools

Chapter 7: Best Practices for Delivery, Deployment, and GitOps

Chapter 7: Best Practices for Delivery, Deployment, and GitOps

The purpose of best practices

Continuous deployment considerations

Continuous delivery considerations

Where GitOps ties in

Section 3: Hands-On Practical GitOps

Section 3: Hands-On Practical GitOps

Chapter 8: Practicing the Basics – Declarative Language File Building

Chapter 8: Practicing the Basics – Declarative Language File Building

Nesting and flat files

XML file building

JSON file building

YAML file building

Templatization types

Chapter 9: Originalist Gitops in Practice – Continuous Deployment

Chapter 9: Originalist Gitops in Practice – Continuous Deployment

Setting up minikube

Setting up VSCode

Setting up Kubernetes and Helm

Local continuous deployment with GitOps file building

Originalist GitOps with Argo CD

Chapter 10: Verified GitOps Setup – Continuous Delivery GitOps with Harness

Chapter 10: Verified GitOps Setup – Continuous Delivery GitOps with Harness

Mapping out the process

One manifest or many

A manifest for integrations

A manifest for configuration

A manifest for execution

A manifest for delivery

Verified GitOps with Harness

Chapter 11: Pitfall Examples – Experiencing Issues with GitOps

Chapter 11: Pitfall Examples – Experiencing Issues with GitOps

Building and testing Kubernetes manifests

Failure strategies

Governance and approvals

Proprietary manifest building

Chapter 12: What's Next?

Chapter 12: What's Next?

Delivery versus deployment

GitOps – what and why

Continuous deployment GitOps – originalist and purist

Continuous delivery GitOps – verified

Best practices first, then GitOps

Other Books You May Enjoy

Other Books You May Enjoy

Packt is searching for authors like you

Leave a review - let other readers know what you think

Customer Reviews

5 star

0

4 star

0

3 star

0

2 star

0

1 star

0

How did we get here?

It's 8 a.m. on a Saturday and the release party's post-mortem has finally been completed. Throughout the release, every encountered issue resulted in a Root Cause Analysis process. Once each of the RCAs were done, the release team would then create and assign tickets as needed, resulting in action items for the different teams in the Engineering organization. With the post-mortem being completed, the release team can hand-off the monitoring of the production application to the weekend support team and head home.

The final production servers were upgraded with the new application release at around 3 a.m. that morning with all of the application health checks successfully passing by 3:30 a.m. And yet, even with the early morning finishing time, this is a significant improvement when comparing it to the release parties of a few years ago. Previously, the applications were released every 6 to 12 months, rather than the quarterly release cadence that the company is currently on.

Their company had hired a consulting agency to advise them on how to improve their application's mean-time-to-market and reduce their production outages in order to meet business initiatives and demands. The outcome suggested by this consulting agency was to release the application more frequently than once or twice a year. As a result, the releases have been quicker and less prone to error, which the business has taken notice of. The release parties still require pulling an all-nighter, but the previous release parties were more like all-weekenders or longer.

The on-call engineering team still has to be brought in for every release, but at least they aren't required to be a part of the release party for the entire time. And the most recent release only required a conference bridge for about 4 hours to solve issues with the underlying code or provide quick fixes. Overall, the operations team, infrastructure team, network team, and security team were able to solve most of the issues that showed up, which accounted for significantly more confidence in the newer release cadence.

The different teams should be able to accomplish the backlog of issues before the next release. But the team with the largest issue backlog were the systems administrators , who build, integrate, administrate, and troubleshoot the many different tools used during the releases.

After 12 straight hours with over 15 members across a host of different teams, the release party was complete. When considering the time associated with the attempt to improve the process throughout the quarter, as well as the actual release itself, it is not difficult to run the mental math on the associated costs. The teams need to figure out a way to make the releases more reliable, repeatable, and scalable.

This analogy is all too familiar for many who have been involved in the engineering side of a business during the Waterfall software development life cycle days. When applications were first made available as a SaaS (Software as a Service) solution,the common release cadence was an annual release. Throughout the year a company would deploy small release, often called patches, which mainly consisted of hardware, software, or security updates.

Since the yearly update was essentially releasing a brand-new product, the release process required significant involvement from every team across the entire engineering organization. The release was a major event , often taking an entire weekend, or longer, from every team available. Many in the industry had dubbed this event a release party. Each release party included significant amounts of caffeine and food, which accompanied a host of people hunched over their laptops as they watched the output of the release on a massive projector screen. Yet the worst part of this whole scenario was that this was the expected release style for every company at the time.

The quarterly release cadence was a novel idea that revolutionized how companies would develop and test their code. The code changes were smaller in nature and the teams evolved their thinking from a new product every year to a new subversion every quarter. Some user experience changes may be introduced, but most of the user experience in the application would remain the same from release to release. Another major benefit to the increased release frequency was the significant reduction in lead time, which is the time it takes to go from a feature being requested to being available in production.

Alongside the release parties were two very important processes when issues would arise during the release:

Root-cause analysis (RCA)
Post-mortem

An RCA would occur anytime there was a significant issue in production that would halt or severely affect the functionality or availability of the application. Often, the RCA process would start with the teams analyzing what was wrong, fixing the issue, validating that the fix worked, and then documenting how the issue arose and what the root cause was. Every release party would result in at least one RCA taking place, and would exponentially increase in number relative to the total amount of production servers involved in the release party.

The post-mortem was a retrospective process after the release was completed and the teams were confident in production operating as expected. The release captain would gather any and all information related to RCAs, bugs, errors, and so on, and create the required documentation and tickets. At the end of the post-mortem, a weekend support team would be briefed on the release party outcome and any items needing to be monitored.

The desire to automate the release of the application had been a central focus of every engineering organizations for years Automation was seen as the best way to enforce reliability and repeatability into the release process, and most of the common tools in use today were created with the intention and purpose of release automation. These tools, and really the underlying processes they address, intend to solve two major concepts in the software development life cycle: deliveries and deployments.