Book Image

Cloud Analytics with Microsoft Azure - Second Edition

By : Has Altaiar, Jack Lee, Michael Peña
Book Image

Cloud Analytics with Microsoft Azure - Second Edition

By: Has Altaiar, Jack Lee, Michael Peña

Overview of this book

Cloud Analytics with Microsoft Azure serves as a comprehensive guide for big data analysis and processing using a range of Microsoft Azure features. This book covers everything you need to build your own data warehouse and learn numerous techniques to gain useful insights by analyzing big data. The book begins by introducing you to the power of data with big data analytics, the Internet of Things (IoT), machine learning, artificial intelligence, and DataOps. You will learn about cloud-scale analytics and the services Microsoft Azure offers to empower businesses to discover insights. You will also be introduced to the new features and functionalities added to the modern data warehouse. Finally, you will look at two real-world business use cases to demonstrate high-level solutions using Microsoft Azure. The aim of these use cases will be to illustrate how real-time data can be analyzed in Azure to derive meaningful insights and make business decisions. You will learn to build an end-to-end analytics pipeline on the cloud with machine learning and deep learning concepts. By the end of this book, you will be proficient in analyzing large amounts of data with Azure and using it effectively to benefit your organization.
Table of Contents (7 chapters)

Azure services

The following sections will elaborate on each of the Azure services that are shown in the solution design of Figure 4.1. For each service, it will first explain why this component is needed, then why Azure services are fit for purpose for Coolies, and then finally show a brief practical example of the core part of the implementation.

Azure Data Lake Storage Gen2

Role in the design

Azure Data Lake Storage Gen2 works as Coolies' central data store. This enables Coolies to bring massive amounts of data from varying sources together. Moreover, the type and format of Coolies' datasets vary significantly (structured, semi-structured, and unstructured), which requires a more capable data store than mere tabular storage, which is where Azure Data Lake Storage Gen2 is needed. Azure Data Lake Storage Gen2 can store schema-less data as blobs and can handle varying formats (for instance, text files, images, videos, social media feeds, and zipped files). The ability...