Creating a datastore and ingesting data
After having a look through the options for storing data in Azure for ML processing, we will now create a storage account, which we will use throughout the book for our raw data and ML datasets. In addition, we will have a look at how to transfer some data into our storage account manually and how to perform this task automatically by utilizing integration engines available in Azure.
Creating Blob Storage and connecting it with the Azure Machine Learning workspace
Let's start by creating a storage account. Any storage account will come with a file share, a queue, and table storage for you to utilize in other scenarios. In addition to those three, you can either end up with Blob Storage or a Data Lake, depending on the settings you provide at creation time. By default, a Blob storage account will be created. If we instead want a Data Lake account, we must set the enable-hierarchical-namespace
setting to True
, as Data Lake offers an...