Book Image

Modern Distributed Tracing in .NET

By : Liudmila Molkova
Book Image

Modern Distributed Tracing in .NET

By: Liudmila Molkova

Overview of this book

As distributed systems become more complex and dynamic, their observability needs to grow to aid the development of holistic solutions for performance or usage analysis and debugging. Distributed tracing brings structure, correlation, causation, and consistency to your telemetry, thus allowing you to answer arbitrary questions about your system and creating a foundation for observability vendors to build visualizations and analytics. Modern Distributed Tracing in .NET is your comprehensive guide to observability that focuses on tracing and performance analysis using a combination of telemetry signals and diagnostic tools. You'll begin by learning how to instrument your apps automatically as well as manually in a vendor-neutral way. Next, you’ll explore how to produce useful traces and metrics for typical cloud patterns and get insights into your system and investigate functional, configurational, and performance issues. The book is filled with instrumentation examples that help you grasp how to enrich auto-generated telemetry or produce your own to get the level of detail your system needs, along with controlling your costs with sampling, aggregation, and verbosity. By the end of this book, you'll be ready to adopt and leverage tracing and other observability signals and tools and tailor them to your needs as your system evolves.
Table of Contents (23 chapters)
1
Part 1: Introducing Distributed Tracing
6
Part 2: Instrumenting .NET Applications
11
Part 3: Observability for Common Cloud Scenarios
16
Part 4: Implementing Distributed Tracing in Your Organization

Chapter 14 – Creating Your Own Conventions

  1. A possible solution is to define and document the stability level for attributes.

For example, new conventions are always added at the alpha stability level. Once it’s fully implemented and deployed, and you’re mostly happy with the outcome, the convention can be graduated to beta.

Conventions should stay in beta until someone tries to use them for alerts, reports, or dashboards. If it works fine, or after feedback is addressed, the convention becomes stable. After that, it cannot be changed in a breaking manner.

  1. It should be possible to validate actual telemetry to some extent.

For example, it should be possible to write a test processor (an in-process one or a custom collector component) that identifies specific spans, events, or metrics that should follow the convention and checks whether the conventions are applied consistently. This test processor could warn about issues found, flag unknown...