Book Image

Modern Distributed Tracing in .NET

By : Liudmila Molkova

Book Image

Modern Distributed Tracing in .NET

By: Liudmila Molkova

Overview of this book

As distributed systems become more complex and dynamic, their observability needs to grow to aid the development of holistic solutions for performance or usage analysis and debugging. Distributed tracing brings structure, correlation, causation, and consistency to your telemetry, thus allowing you to answer arbitrary questions about your system and creating a foundation for observability vendors to build visualizations and analytics. Modern Distributed Tracing in .NET is your comprehensive guide to observability that focuses on tracing and performance analysis using a combination of telemetry signals and diagnostic tools. You'll begin by learning how to instrument your apps automatically as well as manually in a vendor-neutral way. Next, you’ll explore how to produce useful traces and metrics for typical cloud patterns and get insights into your system and investigate functional, configurational, and performance issues. The book is filled with instrumentation examples that help you grasp how to enrich auto-generated telemetry or produce your own to get the level of detail your system needs, along with controlling your costs with sampling, aggregation, and verbosity. By the end of this book, you'll be ready to adopt and leverage tracing and other observability signals and tools and tailor them to your needs as your system evolves.

Preface

Who this book is for

What this book covers

To get the most out of this book

Download the example code files

Download the color images

Conventions used

Download a free PDF copy of this book

Part 1: Introducing Distributed Tracing

Part 1: Introducing Distributed Tracing

Free Chapter

Chapter 1: Observability Needs of Modern Applications

Chapter 1: Observability Needs of Modern Applications

Understanding why logs and counters are not enough

Introducing distributed tracing

Reviewing context propagation

Ensuring consistency and structure

Performance analysis overview

Further reading

Chapter 2: Native Monitoring in .NET

Chapter 2: Native Monitoring in .NET

Technical requirements

Building a sample application

Monitoring with runtime counters

Enabling auto-collection with OpenTelemetry

Exploring auto-generated telemetry

Chapter 3: The .NET Observability Ecosystem

Chapter 3: The .NET Observability Ecosystem

Technical requirements

Using instrumentations for popular libraries

Leveraging infrastructure

Instrumenting serverless environments

Chapter 4: Low-Level Performance Analysis with Diagnostic Tools

Chapter 4: Low-Level Performance Analysis with Diagnostic Tools

Technical requirements

Investigating common performance problems

Using diagnostics tools in production

Part 2: Instrumenting .NET Applications

Part 2: Instrumenting .NET Applications

Chapter 5: Configuration and Control Plane

Chapter 5: Configuration and Control Plane

Technical requirements

Controlling costs with sampling

Enriching and filtering telemetry

Customizing context propagation

Processing a pipeline with the OpenTelemetry Collector

Chapter 6: Tracing Your Code

Chapter 6: Tracing Your Code

Technical requirements

Tracing with System.Diagnostics or the OpenTelemetry API shim

Using ambient context

Recording events

Correlating spans with links

Testing your instrumentation

Chapter 7: Adding Custom Metrics

Chapter 7: Adding Custom Metrics

Technical requirements

Metrics in .NET – past and present

Using an asynchronous gauge

Using histograms

Chapter 8: Writing Structured and Correlated Logs

Chapter 8: Writing Structured and Correlated Logs

Technical requirements

Logging evolution in .NET

Logging with ILogger

Capturing logs with OpenTelemetry

Managing logging costs

Part 3: Observability for Common Cloud Scenarios

Part 3: Observability for Common Cloud Scenarios

Chapter 9: Best Practices

Chapter 9: Best Practices

Technical requirements

Choosing the right signal

Getting more with less

Staying consistent with semantic conventions

Chapter 10: Tracing Network Calls

Chapter 10: Tracing Network Calls

Technical requirements

Instrumenting client calls

Instrumenting server calls

Instrumenting streaming calls

Observability in action

Chapter 11: Instrumenting Messaging Scenarios

Chapter 11: Instrumenting Messaging Scenarios

Technical requirements

Observability in messaging scenarios

Instrumenting the producer

Instrumenting the consumer

Instrumenting batching scenarios

Performance analysis in messaging scenarios

Chapter 12: Instrumenting Database Calls

Chapter 12: Instrumenting Database Calls

Technical requirements

Instrumenting database calls

Tracing cache calls

Analyzing performance

Part 4: Implementing Distributed Tracing in Your Organization

Part 4: Implementing Distributed Tracing in Your Organization

Chapter 13: Driving Change

Chapter 13: Driving Change

Understanding the importance of observability

The onboarding process

Continuous observability

Further reading

Chapter 14: Creating Your Own Conventions

Chapter 14: Creating Your Own Conventions

Technical requirements

Defining custom conventions

Sharing common schema and code

Using OpenTelemetry schemas and tools

Chapter 15: Instrumenting Brownfield Applications

Chapter 15: Instrumenting Brownfield Applications

Technical requirements

Instrumenting legacy services

Propagating context

Consolidating telemetry from legacy monitoring tools

Assessments

Chapter 1 – Observability Needs of Modern Applications

Chapter 2 – Native Monitoring in .NET

Chapter 3 – The .NET Observability Ecosystem

Chapter 4 – Low-Level Performance Analysis with Diagnostic Tools

Chapter 5 – Configuration and Control Plane

Chapter 6 – Tracing Your Code

Chapter 7 – Adding Custom Metrics

Chapter 8 – Writing Structured and Correlated Logs

Chapter 9 – Best Practices

Chapter 10 – Tracing Network Calls

Chapter 11 – Instrumenting Messaging Scenarios

Chapter 12 – Instrumenting Database Calls

Chapter 13 – Driving Change

Chapter 14 – Creating Your Own Conventions

Chapter 15 – Instrumenting Brownfield Applications

Index

Other Books You May Enjoy

Other Books You May Enjoy

Packt is searching for authors like you

Share Your Thoughts

Download a free PDF copy of this book

Customer Reviews

5 star

0

4 star

0

3 star

0

2 star

0

1 star

0

Chapter 14 – Creating Your Own Conventions

A possible solution is to define and document the stability level for attributes.

For example, new conventions are always added at the alpha stability level. Once it’s fully implemented and deployed, and you’re mostly happy with the outcome, the convention can be graduated to beta.

Conventions should stay in beta until someone tries to use them for alerts, reports, or dashboards. If it works fine, or after feedback is addressed, the convention becomes stable. After that, it cannot be changed in a breaking manner.

It should be possible to validate actual telemetry to some extent.

For example, it should be possible to write a test processor (an in-process one or a custom collector component) that identifies specific spans, events, or metrics that should follow the convention and checks whether the conventions are applied consistently. This test processor could warn about issues found, flag unknown...