Chapter 7: Using Databricks Spark Clusters

Book Overview & Buying
Table Of Contents

Cloud Scale Analytics with Azure Data Services

By : Borosch

4.9 (7)

Buy this Book

Cloud Scale Analytics with Azure Data Services

4.9 (7)

By: Borosch

Buy this Book

Overview of this book

Azure Data Lake, the modern data warehouse architecture, and related data services on Azure enable organizations to build their own customized analytical platform to fit any analytical requirements in terms of volume, speed, and quality. This book is your guide to learning all the features and capabilities of Azure data services for storing, processing, and analyzing data (structured, unstructured, and semi-structured) of any size. You will explore key techniques for ingesting and storing data and perform batch, streaming, and interactive analytics. The book also shows you how to overcome various challenges and complexities relating to productivity and scaling. Next, you will be able to develop and run massive data workloads to perform different actions. Using a cloud-based big data-modern data warehouse-analytics setup, you will also be able to build secure, scalable data estates for enterprises. Finally, you will not only learn how to develop a data warehouse but also understand how to create enterprise-grade security and auditing big data programs. By the end of this Azure book, you will have learned how to develop a powerful and efficient analytical platform to meet enterprise needs.

Preface

Who this book is for

What this book covers

To get the most out of this book

Download the example code files

Download the color images

Conventions used

Get in touch

Reviews

Share Your Thoughts

Section 1: Data Warehousing and Considerations Regarding Cloud Computing

Free Chapter

Chapter 1: Balancing the Benefits of Data Lakes Over Data Warehouses

Distinguishing between Data Warehouses and Data Lakes

Understanding the opportunities of modern cloud computing

Exploring the benefits of AI and ML

Answering the question

Summary

Chapter 2: Connecting Requirements and Technology

Formulating your requirements

Understanding basic architecture patterns

Finding the right Azure tool for the right purpose

Understanding Industry Data Models

Thinking about different sizes

Understanding the supporting services

Summary

Questions

Section 2: The Storage Layer

Chapter 3: Understanding the Data Lake Storage Layer

Technical requirements

Setting up your Cloud Big Data Storage

Organizing your data lake

Implementing a data model in your Data Lake

Monitoring your storage account

Talking about backups

Implementing access control in your Data Lake

Setting the networking options

Discovering additional knowledge

Summary

Further reading

Chapter 4: Understanding Synapse SQL Pools and SQL Options

Uncovering MPP in the cloud – the power of 60

Provisioning a Synapse dedicated SQL pool

Talking about partitioning

Implementing workload management

Scaling the database

Loading data

Understanding other SQL options in Azure

Summary

Further reading

Section 3: Cloud-Scale Data Integration and Data Transformation

Chapter 5: Integrating Data into Your Modern Data Warehouse

Technical requirements

Setting up Azure Data Factory

Examining the authoring environment

Using wizards

Adding data transformation logic

Understanding integration runtimes

Integrating with DevOps

Summary

Further reading

Chapter 6: Using Synapse Spark Pools

Technical requirements

Setting up a Synapse Spark pool

Examining the Synapse Spark architecture

Programming with Synapse Spark pools

Using additional libraries with your Spark pool

Handling security

Monitoring your Synapse Spark pools

Summary

Further reading

Chapter 7: Using Databricks Spark Clusters

Technical requirements

Provisioning Databricks

Examining the Databricks workspace

Understanding the Databricks components

Setting up security

Monitoring Databricks

Summary

Further reading

Chapter 8: Streaming Data into Your MDWH

Technical requirements

Provisioning ASA

Implementing an ASA job

Understanding ASA SQL

Using Structured Streaming with Spark

Security in your streaming solution

Monitoring your streaming solution

Summary

Further reading

Chapter 9: Integrating Azure Cognitive Services and Machine Learning

Technical requirements

Understanding Azure Cognitive Services

Using Cognitive Services with your data

Examining Azure Machine Learning

Using Azure Machine Learning with your modern data warehouse

Summary

Further reading

Chapter 10: Loading the Presentation Layer

Technical requirements

Understanding the loading strategy with Synapse-dedicated SQL pools

Loading data into Synapse-dedicated SQL pools

Using Synapse serverless SQL pools

Integrating data with Synapse Spark pools

Exchanging metadata between computes

Summary

Further reading

Section 4: Data Presentation, Dashboarding, and Distribution

Chapter 11: Developing and Maintaining the Presentation Layer

Developing with Synapse Studio

Backing up and DR in Azure Synapse

Monitoring your MDWH

Understanding security in your MDWH

Summary

Further reading

Chapter 12: Distributing Data

Technical requirements

Building data marts with Power BI

Creating data models with Azure Analysis Services

Distributing data using Azure Data Share

Summary

Further reading

Chapter 13: Introducing Industry Data Models

Understanding Common Data Model

Examining and leveraging predefined entities

Discovering Azure Industry Data Workbench

Summary

Further reading

Chapter 14: Establishing Data Governance

Technical requirements

Discovering Azure Purview

Classifying data

Integrating with Azure services

Using data lineage

Discovering Insights

Discovering more Purview

Summary

Further reading

Why subscribe?

Other Books You May Enjoy

Packt is searching for authors like you

Share Your Thoughts

Cloud Scale Analytics with Azure Data Services

By : Borosch

Cloud Scale Analytics with Azure Data Services

By: Borosch

Overview of this book

Confirmation

Buy this book with your credits?

Submit Your Feedback

Create a Free Account To Continue Reading

Sign in to activate your 7-day free access