Book Image

Hands-On Data Warehousing with Azure Data Factory

By : Christian Cote, Michelle Gutzait, Giuseppe Ciaburro
Book Image

Hands-On Data Warehousing with Azure Data Factory

By: Christian Cote, Michelle Gutzait, Giuseppe Ciaburro

Overview of this book

ETL is one of the essential techniques in data processing. Given data is everywhere, ETL will always be the vital process to handle data from different sources. Hands-On Data Warehousing with Azure Data Factory starts with the basic concepts of data warehousing and ETL process. You will learn how Azure Data Factory and SSIS can be used to understand the key components of an ETL solution. You will go through different services offered by Azure that can be used by ADF and SSIS, such as Azure Data Lake Analytics, Machine Learning and Databrick’s Spark with the help of practical examples. You will explore how to design and implement ETL hybrid solutions using different integration services with a step-by-step approach. Once you get to grips with all this, you will use Power BI to interact with data coming from different sources in order to reveal valuable insights. By the end of this book, you will not only learn how to build your own ETL solutions but also address the key challenges that are faced while building them.
Table of Contents (12 chapters)

Creating and configuring Data Lake Store


We will first create and configure the Data Lake Store:

  1. Open the Azure Portal. If you are just starting, you will not see any resource configured under the All resources and ALL SUBSCRIPTIONS section:
  1. On the top left, click on Create a resource; enter the words Data Lake in Search the Marketplace:

  1. Select Data Lake Store from the list (third option in the image) if you have no Data Lake stores yet; the following screen will open up:
  1. Select Create.
  2. Enter the details of the Data Lake. Note that the name has to be all lowercase and with no special characters. You will get a message as you type if you've entered any incorrect character. In this case, we are not using any encryption, for simplicity. Note that the default is encryption enabled. For more information about the encryption options, see Encryption of data in Azure Data Lake Store (https://docs.microsoft.com/en-us/azure/data-lake-store/data-lake-store-get-started-portal).
  1. Select Create.

 

  1. Once the Data...