Book Image

Cloud Scale Analytics with Azure Data Services

By : Patrik Borosch
Book Image

Cloud Scale Analytics with Azure Data Services

By: Patrik Borosch

Overview of this book

Azure Data Lake, the modern data warehouse architecture, and related data services on Azure enable organizations to build their own customized analytical platform to fit any analytical requirements in terms of volume, speed, and quality. This book is your guide to learning all the features and capabilities of Azure data services for storing, processing, and analyzing data (structured, unstructured, and semi-structured) of any size. You will explore key techniques for ingesting and storing data and perform batch, streaming, and interactive analytics. The book also shows you how to overcome various challenges and complexities relating to productivity and scaling. Next, you will be able to develop and run massive data workloads to perform different actions. Using a cloud-based big data-modern data warehouse-analytics setup, you will also be able to build secure, scalable data estates for enterprises. Finally, you will not only learn how to develop a data warehouse but also understand how to create enterprise-grade security and auditing big data programs. By the end of this Azure book, you will have learned how to develop a powerful and efficient analytical platform to meet enterprise needs.
Table of Contents (20 chapters)
1
Section 1: Data Warehousing and Considerations Regarding Cloud Computing
4
Section 2: The Storage Layer
7
Section 3: Cloud-Scale Data Integration and Data Transformation
14
Section 4: Data Presentation, Dashboarding, and Distribution

Conventions used

There are a number of text conventions used throughout this book.

Code in text: Indicates code words in text, database table names, folder names, filenames, file extensions, pathnames, dummy URLs, user input, and Twitter handles. Here is an example: "You can now start entering an alias for your input connection. Name it something such as airdelaystreaminginput."

A block of code is set as follows:

SELECT
    t1.Cartype,
    SUM(t2.mgNOx/60) as SumNOx
FROM
    Cartraffic as t1 TIMESTAMPED BY ObservedT
JOIN
    CarStats as t2
ON
    t1.Cartype = t2.Cartype
GROUP BY 
    t1.Cartype,
    TUMBLINGWINDOW(minute, 10)

When we wish to draw your attention to a particular part of a code block, the relevant lines or items are set in bold:

SELECT 
    CensusStation,
    COUNT(*) as Amount
FROM
    Cartraffic 
TIMESTAMP BY 
    ObservedT
GROUP BY 
    CensusStation,
    System.Timestamp()

Bold: Indicates a new term, an important word, or words that you see onscreen. For example, words in menus or dialog boxes appear in the text like this. Here is an example: "Please click Create to start the provisioning of your configuration."

Tips or important notes

Appear like this.