Observability 101

The need for observability in a distributed application environment

What is observability?

Building blocks of observability

Benefits of observability

Chapter 2: Overview of the Observability Landscape on AWS

Overview of observability tools in AWS

Overview of native observability services in AWS

Overview of AWS-managed open source observability services in AWS

Adoption of observability services in AWS

Chapter 3: Gathering Operational Data and Alerting Using Amazon CloudWatch

Overview of CloudWatch metrics and logs

Deployment and configuration of the CloudWatch agent in an EC2 instance

Overview of CloudWatch alarms and dashboards

Overview of Amazon EventBridge

Chapter 4: Implementing Distributed Tracing Using AWS X-Ray

Navigating the AWS X-Ray console

Overview of AWS X-Ray

End-to-end instrumentation of a sample application deployed in an EC2 instance

Part 2: Automated and Machine Learning-Powered Observability on AWS

Chapter 5: Insights into Operational Data with CloudWatch

Deriving operational intelligence from CloudWatch metrics

Exploring CloudWatch Application Insights

Exploring CloudWatch Logs Insights

Exploring CloudWatch Contributor Insights and its use cases

Chapter 6: Observability for Containerized Applications on AWS

Introduction to CloudWatch Container Insights

Implementing observability for a distributed application running on Amazon EKS

Implementing observability for a distributed application running on Amazon ECS

End-to-end visibility of containerized applications using AWS App Mesh

Understanding and troubleshooting performance bottlenecks in containers

Chapter 7: Observability for Serverless Applications on AWS

Deploying a basic serverless application running on AWS Lambda

CloudWatch Lambda Insights

End-to-end tracing of the Node.js application

Troubleshooting performance issues using X-Ray groups

Chapter 8: End User Experience Monitoring on AWS

End user experience monitoring

CloudWatch Synthetics

CloudWatch RUM

Part 3: Open Source Managed Services on AWS

Chapter 9: Collecting Metrics and Traces Using OpenTelemetry

An open standard to collect metrics and traces using AWS Distro for OpenTelemetry

How to instrument once for multiple monitoring destinations

Instrumenting a container application running on ECS using OpenTelemetry

Chapter 10: Deploying and Configuring an Amazon Managed Service for Prometheus

Prometheus and Grafana overview

Setting up Amazon Managed Service for Prometheus and Grafana

Ingesting telemetry data

Querying Prometheus metrics via API and Grafana

Implementing container monitoring

Chapter 11: Deploying the Elasticsearch, Logstash, and Kibana Stack Using Amazon OpenSearch Service

Amazon OpenSearch Service overview

Setup and configuration of Amazon OpenSearch Service

Observability of the application traces and logs using Amazon OpenSearch Service

Anomaly detection in Amazon OpenSearch Service

Security for Amazon OpenSearch Service

Part 4: Scaled Observability and Beyond

Chapter 12: Augmenting the Human Operator with Amazon DevOps Guru

Overview of Amazon DevOps Guru

Reviewing Amazon DevOps Guru insights for serverless applications in AWS

Understanding Relational Database Service (RDS) performance issues using DevOps Guru

AI and ML insights

Observability best practices at scale

Chapter 13: Observability Best Practices at Scale

Exploring cross-account cross-Region CloudWatch

Chapter 14: Be Well-Architected for Operational Excellence

An overview of the AWS Well-Architected Framework

Applying the Well-architected framework and exploring automated solutions

Understanding management and governance in the  Well-Architected Framework

Overview of Cloud Adoption Framework 3.0

Chapter 15: The Role of Observability in the Cloud Adoption Framework

Cloud transformation journey

Developing an observability strategy for your organization

Role of observability in the CAF and the best practices for quicker adoption of the cloud

Beyond observability