Book Image

Scalable Data Analytics with Azure Data Explorer

By : Jason Myerscough
Book Image

Scalable Data Analytics with Azure Data Explorer

By: Jason Myerscough

Overview of this book

Azure Data Explorer (ADX) enables developers and data scientists to make data-driven business decisions. This book will help you rapidly explore and query your data at scale and secure your ADX clusters. The book begins by introducing you to ADX, its architecture, core features, and benefits. You'll learn how to securely deploy ADX instances and navigate through the ADX Web UI, cover data ingestion, and discover how to query and visualize your data using the powerful Kusto Query Language (KQL). Next, you'll get to grips with KQL operators and functions to efficiently query and explore your data, as well as perform time series analysis and search for anomalies and trends in your data. As you progress through the chapters, you'll explore advanced ADX topics, including deploying your ADX instances using Infrastructure as Code (IaC). The book also shows you how to manage your cluster performance and monthly ADX costs by handling cluster scaling and data retention periods. Finally, you'll understand how to secure your ADX environment by restricting access with best practices for improving your KQL query performance. By the end of this Azure book, you'll be able to securely deploy your own ADX instance, ingest data from multiple sources, rapidly query your data, and produce reports with KQL and Power BI.
Table of Contents (18 chapters)
1
Section 1: Introduction to Azure Data Explorer
5
Section 2: Querying and Visualizing Your Data
11
Section 3: Advanced Azure Data Explorer Topics

Summary

Congratulations on completing your first steps in learning about Azure Data Explorer! In this chapter, you learned about the different stages of the data analytics pipeline. Understanding the stages of the pipeline helps simplify your ability to comprehend the workflow of taking raw data and performing analysis on the data and visualizing your findings.

We then introduced some of the popular Azure data analytics services and mapped them to the different stages of the data analytics pipeline. Some of the services, such as Event Hubs, will be used in later chapters to ingest data into our own ADX databases.

We then learned what ADX is, what the main features are, and briefly looked at the ADX architecture to understand how ADX provides excellent performance by using both column stores and row stores, and how ADX scales both vertically and horizontally efficiently by implementing one of the fundamental Azure design principles of decoupling compute and storage. We then discussed some of the use cases of ADX that we will use throughout this book, such as time series analysis.

Finally, we learned how to connect to ADX clusters and query databases using the ADX UI. In the next chapter, we will learn how to create and manage our own ADX clusters and databases using the Azure portal, PowerShell, and the Azure CLI.

Before moving on to the next chapter, try modifying ${HOME}/Scalable-Data-Analytics-with-Azure-Data-Explorer/Chapter01/first-query.kql and display an area chart. The solution can be found at ${HOME}/Scalable-Data-Analytics-with-Azure-Data-Explorer/Chapter01/population-areachart.kql. What other types of charts can you render?

Additionally, here is some information you should know. The Azure Data Explorer UI supports a feature known as IntelliSense, as shown in Figure 1.11. IntelliSense provides code completion and hints when you are writing your queries, so you do not need to worry about memorizing all the keywords:

Figure 1.11 – IntelliSense features

Figure 1.11 – IntelliSense features

We will be using IntelliSense throughout this book when using both Visual Studio Code and the Azure Data Explorer Web UI. Visual Studio Code will be used for editing our scripts and ARM templates, and the Azure Data Explorer Web UI is where we will execute most of our KQL queries.