Book Image

Azure Data Scientist Associate Certification Guide

By : Andreas Botsikas, Michael Hlobil
Book Image

Azure Data Scientist Associate Certification Guide

By: Andreas Botsikas, Michael Hlobil

Overview of this book

The Azure Data Scientist Associate Certification Guide helps you acquire practical knowledge for machine learning experimentation on Azure. It covers everything you need to pass the DP-100 exam and become a certified Azure Data Scientist Associate. Starting with an introduction to data science, you'll learn the terminology that will be used throughout the book and then move on to the Azure Machine Learning (Azure ML) workspace. You'll discover the studio interface and manage various components, such as data stores and compute clusters. Next, the book focuses on no-code and low-code experimentation, and shows you how to use the Automated ML wizard to locate and deploy optimal models for your dataset. You'll also learn how to run end-to-end data science experiments using the designer provided in Azure ML Studio. You'll then explore the Azure ML Software Development Kit (SDK) for Python and advance to creating experiments and publishing models using code. The book also guides you in optimizing your model's hyperparameters using Hyperdrive before demonstrating how to use responsible AI tools to interpret and debug your models. Once you have a trained model, you'll learn to operationalize it for batch or real-time inferences and monitor it in production. By the end of this Azure certification study guide, you'll have gained the knowledge and the practical skills required to pass the DP-100 exam.
Table of Contents (17 chapters)
1
Section 1: Starting your cloud-based data science journey
6
Section 2: No code data science experimentation
9
Section 3: Advanced data science tooling and capabilities

To get the most out of this book

This book tries to provide you with everything you need to learn. The Further reading section of each chapter contains links to pages that will help you deep dive, into topics that are peripheral to the contents of this book. It will help if you have some basic familiarity with the Azure portal and have read some Python code in the past.

In this book, we guide you to use the Notebooks experience available within the AzureML studio. If you want to execute the same code on your workstation instead of the cloud-based experience, you will need a Python environment to run Jupyter notebooks. The easiest way to run Jupyter notebooks on your workstation is through VSCode, a free cross-platform editor with fantastic Python support. You will also need to install Git in your workstation to clone the book’s GitHub repository.

If you are using the digital version of this book, we advise you to type the code yourself or access the code from the book's GitHub repository (a link is available in the next section). Doing so will help you avoid any potential errors related to the copying and pasting of code.

If you face any issue executing the code, ensure that you have cloned the latest version from the GitHub repository. If the problem persists, feel free to open a GitHub issue to describe the issue you are facing and help you solve it.