SRE on AWS
As the world shifts toward cloud evolution, the reliability of all types of applications running on the cloud has become a critical business imperative. The cloud changes the way we think about managing systems. The focus has shifted from hardware to software and from manual processes to automated tasks. SRE is one such practice that focuses on building sturdy, flexible systems with continuous improvement and automation at its core.
What is SRE?
SRE is the process of automating IT infrastructure tasks to improve the reliability of scalable software systems. Tasks can range from change management, system management, incident response, and application monitoring. The SRE practice aims to improve the stability and quality of the service that is being made available to the end users. You can see a depiction of SRE in the following diagram:
Figure 12.5 – SRE
As shown in Figure 12.5, SRE sits right at the crossroads of operations, engineering...