Book Image

Hands-On Data Warehousing with Azure Data Factory

By : Christian Cote, Michelle Gutzait, Giuseppe Ciaburro
Book Image

Hands-On Data Warehousing with Azure Data Factory

By: Christian Cote, Michelle Gutzait, Giuseppe Ciaburro

Overview of this book

ETL is one of the essential techniques in data processing. Given data is everywhere, ETL will always be the vital process to handle data from different sources. Hands-On Data Warehousing with Azure Data Factory starts with the basic concepts of data warehousing and ETL process. You will learn how Azure Data Factory and SSIS can be used to understand the key components of an ETL solution. You will go through different services offered by Azure that can be used by ADF and SSIS, such as Azure Data Lake Analytics, Machine Learning and Databrick’s Spark with the help of practical examples. You will explore how to design and implement ETL hybrid solutions using different integration services with a step-by-step approach. Once you get to grips with all this, you will use Power BI to interact with data coming from different sources in order to reveal valuable insights. By the end of this book, you will not only learn how to build your own ETL solutions but also address the key challenges that are faced while building them.
Table of Contents (12 chapters)

Databricks notebook


We are now ready to consume and transform data from the Sales.txt file we created in the previous section. We'll go back to the Databricks workspace and create a new notebook. The easiest way to achieve it is to use a template. As shown in the following screenshot, we select the Data icon in the toolbar at the left of our workspace. We go to Tables | Spark Data Sources | Azure Blob Storage. Click on Create Table in Notebook to create the new notebook:

The notebook opens, with lots of sections. We'll first rename it, right-click on the notebook name and select Rename.

Enter ADFV2Notebook and click on Rename to rename it.

We'll now move it to the ADFCalls folder. Right-click again on the newly renamed notebook and select Move from the submenu, as shown next:

The following dialog box appears. Select the folder ADFCalls and click on Select to move the notebook to its desired location:

Now that the notebook is named and located properly, we'll modify its content.

A notebook is used...