-
Book Overview & Buying
-
Table Of Contents
Mastering Azure Databricks for Data Engineers
By :
Mastering Azure Databricks for Data Engineers
By:
Overview of this book
Embark on a transformative journey through the Azure Databricks platform with this expertly crafted course. We start by laying a solid foundation, guiding you through the course prerequisites and familiarizing you with the resources at your disposal. Our introduction section covers the essentials of data engineering and how Apache Spark integrates with Databricks, setting the stage for a deep dive into the platform.
As you progress, you’ll create an Azure cloud account and Databricks workspace, gaining insights into the platform's architecture. Hands-on sessions will enable you to create Spark clusters, work with Databricks notebooks, and utilize magic commands and utilities effectively. We then delve into the Databricks File System (DBFS), teaching you how to manage and mount data storage efficiently.
The course further explores Unity Catalog for secure data management, Delta Lake for robust data processing, and incremental ingestion tools for real-time data handling. You'll also master Databricks Delta Live Tables (DLT), enhancing your skills in building scalable data pipelines. Our final sections cover automation features, including working with Databricks Repos, Workflows, REST API, and CLI, ensuring you can automate and streamline your data projects.
Table of Contents (12 chapters)
Before you start
Introduction
Getting Started
Working in Databricks Workspace
Working with Databricks File System - DBFS
Working with Unity Catalog
Working with Delta Lake and Delta Tables
Working with Databricks Incremental Ingestion Tools
Working with Databricks Delta Live Tables (DLT)
Databricks Project and Automation Features
Capstone Project
Final Word