Book Image

Azure Data and AI Architect Handbook

By : Olivier Mertens, Breght Van Baelen
Book Image

Azure Data and AI Architect Handbook

By: Olivier Mertens, Breght Van Baelen

Overview of this book

With data’s growing importance in businesses, the need for cloud data and AI architects has never been higher. The Azure Data and AI Architect Handbook is designed to assist any data professional or academic looking to advance their cloud data platform designing skills. This book will help you understand all the individual components of an end-to-end data architecture and how to piece them together into a scalable and robust solution. You’ll begin by getting to grips with core data architecture design concepts and Azure Data & AI services, before exploring cloud landing zones and best practices for building up an enterprise-scale data platform from scratch. Next, you’ll take a deep dive into various data domains such as data engineering, business intelligence, data science, and data governance. As you advance, you’ll cover topics ranging from learning different methods of ingesting data into the cloud to designing the right data warehousing solution, managing large-scale data transformations, extracting valuable insights, and learning how to leverage cloud computing to drive advanced analytical workloads. Finally, you’ll discover how to add data governance, compliance, and security to solutions. By the end of this book, you’ll have gained the expertise needed to become a well-rounded Azure Data & AI architect.
Table of Contents (18 chapters)
1
Part 1: Introduction to Azure Data Architect
4
Part 2: Data Engineering on Azure
8
Part 3: Data Warehousing and Analytics
13
Part 4: Data Security, Governance, and Compliance

Data protection

The core component in data protection, apart from discovery and classification covered in the last chapter, is data encryption. This can be done depending on the state of the data, as seen in Figure 11.1.

Figure 11.1 – Data is either at rest, in transit, or in use

Figure 11.1 – Data is either at rest, in transit, or in use

Data is either at rest (inside the database or storage), in transit (when moving the data from one place to another), or in use. While the data is at rest or in transit, the data should be encrypted to maximize security.

Encryption at rest

The first layer of protection is provided by Azure automatically, by encrypting data at rest using one of the strongest block ciphers in the world, 256-bit Advanced Encryption Standard (AES) encryption.

The key for this server-side encryption (SSE) can be managed either by the platform (Microsoft-managed key) or by the organization (customer-managed key).

A second layer of data encryption can be added for SQL databases: transparent...