Azure Data and AI Architect Handbook

By : Olivier Mertens, Breght Van Baelen

Azure Data and AI Architect Handbook

By: Olivier Mertens, Breght Van Baelen

Overview of this book

With data’s growing importance in businesses, the need for cloud data and AI architects has never been higher. The Azure Data and AI Architect Handbook is designed to assist any data professional or academic looking to advance their cloud data platform designing skills. This book will help you understand all the individual components of an end-to-end data architecture and how to piece them together into a scalable and robust solution. You’ll begin by getting to grips with core data architecture design concepts and Azure Data & AI services, before exploring cloud landing zones and best practices for building up an enterprise-scale data platform from scratch. Next, you’ll take a deep dive into various data domains such as data engineering, business intelligence, data science, and data governance. As you advance, you’ll cover topics ranging from learning different methods of ingesting data into the cloud to designing the right data warehousing solution, managing large-scale data transformations, extracting valuable insights, and learning how to leverage cloud computing to drive advanced analytical workloads. Finally, you’ll discover how to add data governance, compliance, and security to solutions. By the end of this book, you’ll have gained the expertise needed to become a well-rounded Azure Data & AI architect.

Preface

Who is this book for?

What this book covers

Conventions used

Get in touch

Reviews

Share Your Thoughts

Download a free PDF copy of this book

Part 1: Introduction to Azure Data Architect

Free Chapter

Chapter 1: Introduction to Data Architectures

Understanding the value of data

A data architecture reference diagram

Challenges of on-premises architectures

Summary

Chapter 2: Preparing for Cloud Adoption

The Azure WAF

Data landing zones

Summary

Part 2: Data Engineering on Azure

Chapter 3: Ingesting Data into the Cloud

Batch and streaming ingestion

ADLS for raw data ingestion

Batch ingestion architectures

Streaming ingestion architectures

Summary

Chapter 4: Transforming Data on Azure

Designing data pipelines on Azure

Transforming data on Azure

Data transformation architectures

Data transformations in data lake tiers

Operationalizing data pipelines on Azure

Summary

Chapter 5: Storing Data for Consumption

Classifying the data type

Determining how the data will be used

Choosing the right storage solution on Azure

Summary

Part 3: Data Warehousing and Analytics

Chapter 6: Data Warehousing

Fundamental concepts of data warehousing

Approaches to data warehousing

SCDs

Building a data warehouse in the cloud

Summary

Chapter 7: The Semantic Layer

Multidimensional versus tabular models

The VertiPaq engine for tabular models

Modes in tabular models

Tools for the semantic layer

Summary

Chapter 8: Visualizing Data Using Power BI

Learning how Power BI works

Choosing the right license and pricing

Practicing your skills

Moving to self-service BI

Summary

Chapter 9: Advanced Analytics Using AI

Knowing the roles in data science

Designing AI solutions

Understanding AI on Azure

AI architectures on Azure

Summary

Part 4: Data Security, Governance, and Compliance

Chapter 10: Enterprise-Level Data Governance and Compliance

The importance of data governance and compliance

Governing data with Microsoft Purview

Applying enterprise-level data governance

Summary

Chapter 11: Introduction to Data Security

Summary

Index

Why subscribe?

Other Books You May Enjoy

Packt is searching for authors like you

Share your thoughts

Download a free PDF copy of this book

Customer Reviews

5 star

4 star

3 star

2 star

1 star

Batch and streaming ingestion

Regardless of the type (batch or streaming), data ingestion is located in the first layer of the data architecture, as seen in Figure 3.1:

Figure 3.1 – Reference diagram for cloud data architectures: the ingestion layer (on the left) forms the first layer of the architecture

The ingestion layer forms the front door for the solution. Here, we pull in data using data pipelines and, in enterprise-level solutions, commonly have it land in a massive-scale, unstructured storage service such as a data lake.

The type of ingestion plays a key role in the design of a cloud data architecture. Batch ingestion was, and in most cases still is, the norm for ingesting data into the cloud. A batch approach refers to the periodical ingestion or processing of (usually large) bulks of data. Streaming ingestion, as the name suggests, involves continuous streams of data.

In general, batch ingestion and processing have long been the...

Azure Data and AI Architect Handbook

By : Olivier Mertens, Breght Van Baelen

Azure Data and AI Architect Handbook

By: Olivier Mertens, Breght Van Baelen

Overview of this book

Related Content you might be interested in

Current Title:

Azure Data and AI Architect Handbook

Microsoft Certified Azure Data Fundamentals (Exam DP-900) Certification Guide

Engineering Data Mesh in Azure Cloud

Data Lakehouse in Action

Batch and streaming ingestion