Book Image

Limitless Analytics with Azure Synapse

By : Prashant Kumar Mishra
Book Image

Limitless Analytics with Azure Synapse

By: Prashant Kumar Mishra

Overview of this book

Azure Synapse Analytics, which Microsoft describes as the next evolution of Azure SQL Data Warehouse, is a limitless analytics service that brings enterprise data warehousing and big data analytics together. With this book, you'll learn how to discover insights from your data effectively using this platform. The book starts with an overview of Azure Synapse Analytics, its architecture, and how it can be used to improve business intelligence and machine learning capabilities. Next, you'll go on to choose and set up the correct environment for your business problem. You'll also learn a variety of ways to ingest data from various sources and orchestrate the data using transformation techniques offered by Azure Synapse. Later, you'll explore how to handle both relational and non-relational data using the SQL language. As you progress, you'll perform real-time streaming and execute data analysis operations on your data using various languages, before going on to apply ML techniques to derive accurate and granular insights from data. Finally, you'll discover how to protect sensitive data in real time by using security and privacy features. By the end of this Azure book, you'll be able to build end-to-end analytics solutions while focusing on data prep, data management, data warehousing, and AI tasks.
Table of Contents (20 chapters)
1
Section 1: The Basics and Key Concepts
4
Section 2: Data Ingestion and Orchestration
8
Section 3: Azure Synapse for Data Scientists and Business Analysts
14
Section 4: Best Practices

Creating a Synapse workspace

Synapse workspace provides an integrated console to manage, monitor, and administer all the components and services of Azure Synapse Analytics. In order to get started with Azure Synapse Analytics, we need to create an Azure Synapse workspace, which provides an experience to access different features related to Azure Synapse Analytics.

You can create a Synapse workspace in the Azure portal just by providing some basic details. Follow these steps to create your first Azure Synapse workspace:

  1. Go to https://portal.azure.com and provide your credentials.
  2. Click on Create a resource:
    Figure 1.2 – A screenshot of the Azure portal

    Figure 1.2 – A screenshot of the Azure portal

  3. Search for Azure Synapse using the search bar.
  4. Select Azure Synapse Analytics (Workspaces preview) from the search drop-down and click on Create:
    Figure 1.3 – A screenshot of the Azure Synapse Analytics page in Azure Marketplace

    Figure 1.3 – A screenshot of the Azure Synapse Analytics page in Azure Marketplace

  5. You need to provide basic details to create your Synapse Analytics workspace:
    • Subscription: You need to select your subscription. If you have many subscriptions in your Azure account, you need to select a specific one that you are going to use to create a Synapse workspace.

      Important note

      All resources in a subscription are billed together.

    • Resource group: A Resource group is a container that holds all the resources for the solution, or only those resources that you want to manage under one group. Select a Resource group for the Synapse workspace. If you do not already have a Resource group created, click on Create new right below the text field for Resource group:
Figure 1.4 – A screenshot highlighting the field to provide a Resource group name

Figure 1.4 – A screenshot highlighting the field to provide a Resource group name

  • Workspace name: Provide an appropriate name for the workspace that you are going to create.

    Important note

    This name must be unique, so it is better to keep it specific to your team/project. 

  • Region: You can see many options in the dropdown. Select the most appropriate region for your Synapse Analytics workspace:
Figure 1.5 – A screenshot of regions appearing in a drop-down list

Figure 1.5 – A screenshot of regions appearing in a drop-down list

  • Select Data Lake Storage Gen2: This will be the primary storage account for the workspace, holding catalog data and metadata associated with the workspace:
Figure 1.6 – A screenshot highlighting fields of Select Data Lake Storage Gen2

Figure 1.6 – A screenshot highlighting fields of Select Data Lake Storage Gen2

  • Account name: You can select from the dropdown or you can create a new one. Only Data Lake Gen2 accounts with a hierarchical namespace enabled will appear in the dropdown. However, if you click on Create new, then it will create a Data Lake Gen2 account with hierarchical namespace enabled.

    Important note

    A storage account name must be between 3 and 24 characters in length and use numbers and lowercase letters only.

  • File system name: Again, you can select from the dropdown or you can create a new one. To create a new file system name, click on Create new and provide an appropriate name for it. A file system name must contain only lowercase letters, numbers, or hyphens:
Figure 1.7 – A screenshot highlighting assignment of the Storage Blob Data Contributor role

Figure 1.7 – A screenshot highlighting assignment of the Storage Blob Data Contributor role

  1. Click on Security + networking to configure security options and networking settings for your workspace, as seen in Figure 1.8.

    Provide SQL administrator credentials that can be used for administrator access to the workspace's SQL pools. We will talk about SQL pools in future chapters:

    Figure 1.8 – A screenshot of the Security + networking form for Azure Synapse

    Figure 1.8 – A screenshot of the Security + networking form for Azure Synapse

  2. Click on Tags to provide a name-value pair to this resource.
  3. Go to the next page to review the summary and click on Create after verifying all the details on the summary page.
  4. In your Azure Synapse workspace in the Azure portal, click Open Synapse Studio:
Figure 1.9 – A screenshot highlighting the link for launching Synapse Studio

Figure 1.9 – A screenshot highlighting the link for launching Synapse Studio

This deployment takes just a couple of minutes and creates a workspace that bundles Synapse analytics, ETL, reporting, modeling, and analysis together under one umbrella. Now you are ready to build your enterprise-level solution!