Book Image

Accelerate DevOps with GitHub

By : Michael Kaufmann
Book Image

Accelerate DevOps with GitHub

By: Michael Kaufmann

Overview of this book

This practical guide to DevOps uses GitHub as the DevOps platform and shows how you can leverage the power of GitHub for collaboration, lean management, and secure and fast software delivery. The chapters provide simple solutions to common problems, thereby helping teams that are already on their DevOps journey to further advance into DevOps and speed up their software delivery performance. From finding the right metrics to measure your success to learning from other teams’ success stories without merely copying what they’ve done, this book has it all in one place. As you advance, you’ll find out how you can leverage the power of GitHub to accelerate your value delivery – by making work visible with GitHub Projects, measuring the right metrics with GitHub Insights, using solid and proven engineering practices with GitHub Actions and Advanced Security, and moving to event-based and loosely coupled software architecture. By the end of this GitHub book, you'll have understood what factors influence software delivery performance and how you can measure your capabilities, thus realizing where you stand in your journey and how you can move forward.
Table of Contents (31 chapters)
1
Part 1: Lean Management and Collaboration
7
Part 2: Engineering DevOps Practices
14
Part 3: Release with Confidence
19
Part 4: Software Architecture
22
Part 5: Lean Product Management
25
Part 6: GitHub for your Enterprise

Fault injection and chaos engineering

If you want to level up testing in production, you can practice fault injection – also known as chaos engineering. This means that you inject faults into your production system to see how it behaves under pressure and if your failover mechanisms and circuit breakers work. Possible faults could include high CPU load, high memory usage, disk I/O pressure, low disk space, or a service or entire machine being shut down or rebooted. Other possibilities include processes being killed, the system's time being changed, network traffic being dropped, latency being injected, and DNS servers being blocked.

Practicing chaos engineering makes your system resilient. You cannot compare this to classical load or performance testing!

Different tools can help you with chaos engineering. Gremlin (https://www.gremlin.com/), for example, is an agent-based SaaS offering that supports most cloud providers (Azure, AWS, and Google Cloud) and all operating...