Book Image

Mastering Azure Machine Learning - Second Edition

By : Christoph Körner, Marcel Alsdorf

Book Image

Mastering Azure Machine Learning - Second Edition

By: Christoph Körner, Marcel Alsdorf

Overview of this book

Azure Machine Learning is a cloud service for accelerating and managing the machine learning (ML) project life cycle that ML professionals, data scientists, and engineers can use in their day-to-day workflows. This book covers the end-to-end ML process using Microsoft Azure Machine Learning, including data preparation, performing and logging ML training runs, designing training and deployment pipelines, and managing these pipelines via MLOps. The first section shows you how to set up an Azure Machine Learning workspace; ingest and version datasets; as well as preprocess, label, and enrich these datasets for training. In the next two sections, you'll discover how to enrich and train ML models for embedding, classification, and regression. You'll explore advanced NLP techniques, traditional ML models such as boosted trees, modern deep neural networks, recommendation systems, reinforcement learning, and complex distributed ML training techniques - all using Azure Machine Learning. The last section will teach you how to deploy the trained models as a batch pipeline or real-time scoring service using Docker, Azure Machine Learning clusters, Azure Kubernetes Services, and alternative deployment targets. By the end of this book, you’ll be able to combine all the steps you’ve learned by building an MLOps pipeline.

Preface

Who this book is for

What this book covers

To get the most out of this book

Download the example code files

Download the color images

Conventions used

Share Your Thoughts

Section 1: Introduction to Azure Machine Learning

Section 1: Introduction to Azure Machine Learning

Free Chapter

Chapter 1: Understanding the End-to-End Machine Learning Process

Chapter 1: Understanding the End-to-End Machine Learning Process

Grasping the idea behind ML

Understanding the mathematical basis for statistical analysis and ML modeling

Discovering the end-to-end ML process

Chapter 2: Choosing the Right Machine Learning Service in Azure

Chapter 2: Choosing the Right Machine Learning Service in Azure

Choosing an Azure service for ML

Managed ML services

Custom ML services

Custom compute services for ML

Chapter 3: Preparing the Azure Machine Learning Workspace

Chapter 3: Preparing the Azure Machine Learning Workspace

Technical requirements

Deploying an Azure Machine Learning workspace

Exploring the Azure Machine Learning service

Running ML experiments with Azure Machine Learning

Section 2: Data Ingestion, Preparation, Feature Engineering, and Pipelining

Section 2: Data Ingestion, Preparation, Feature Engineering, and Pipelining

Chapter 4: Ingesting Data and Managing Datasets

Chapter 4: Ingesting Data and Managing Datasets

Technical requirements

Choosing data storage solutions for Azure Machine Learning

Creating a datastore and ingesting data

Using datasets in Azure Machine Learning

Chapter 5: Performing Data Analysis and Visualization

Chapter 5: Performing Data Analysis and Visualization

Technical requirements

Understanding data exploration techniques

Performing data analysis on a tabular dataset

Understanding dimensional reduction techniques

Chapter 6: Feature Engineering and Labeling

Chapter 6: Feature Engineering and Labeling

Technical requirements

Understanding and applying feature engineering

Handling data labeling

Chapter 7: Advanced Feature Extraction with NLP

Chapter 7: Advanced Feature Extraction with NLP

Technical requirements

Understanding categorical data

Building a simple bag-of-words model

Leveraging term importance and semantics

Implementing end-to-end language models

Chapter 8: Azure Machine Learning Pipelines

Chapter 8: Azure Machine Learning Pipelines

Technical requirements

Using pipelines in ML workflows

Building and publishing an ML pipeline

Integrating pipelines with other Azure services

Section 3: The Training and Optimization of Machine Learning Models

Section 3: The Training and Optimization of Machine Learning Models

Chapter 9: Building ML Models Using Azure Machine Learning

Chapter 9: Building ML Models Using Azure Machine Learning

Technical requirements

Working with tree-based ensemble classifiers

Training an ensemble classifier model using LightGBM

Chapter 10: Training Deep Neural Networks on Azure

Chapter 10: Training Deep Neural Networks on Azure

Technical requirements

Introduction to Deep Learning

Training a CNN for image classification

Chapter 11: Hyperparameter Tuning and Automated Machine Learning

Chapter 11: Hyperparameter Tuning and Automated Machine Learning

Technical requirements

Finding the optimal model parameters with HyperDrive

Finding the optimal model with Automated Machine Learning

Chapter 12: Distributed Machine Learning on Azure

Chapter 12: Distributed Machine Learning on Azure

Technical requirements

Exploring methods for distributed ML

Using distributed ML in Azure

Chapter 13: Building a Recommendation Engine in Azure

Chapter 13: Building a Recommendation Engine in Azure

Technical requirements

Introduction to recommendation engines

A content-based recommender system

Collaborative filtering – a rating-based recommender system

Combining content and ratings in hybrid recommendation engines

Automatic optimization through reinforcement learning

Section 4: Machine Learning Model Deployment and Operations

Section 4: Machine Learning Model Deployment and Operations

Chapter 14: Model Deployment, Endpoints, and Operations

Chapter 14: Model Deployment, Endpoints, and Operations

Technical requirements

Preparations for model deployments

Deploying ML models in Azure

ML operations in Azure

Chapter 15: Model Interoperability, Hardware Optimization, and Integrations

Chapter 15: Model Interoperability, Hardware Optimization, and Integrations

Technical requirements

Model interoperability with ONNX

Hardware optimization with FPGAs

Integrating ML models and endpoints with Azure services

Chapter 16: Bringing Models into Production with MLOps

Chapter 16: Bringing Models into Production with MLOps

Technical requirements

Ensuring reproducible builds and deployments

Validating the code, data, and models

Building an end-to-end MLOps pipeline

Chapter 17: Preparing for a Successful ML Journey

Chapter 17: Preparing for a Successful ML Journey

Remembering the importance of data

Starting with a thoughtful infrastructure

Automating recurrent tasks

Expecting constant change

Thinking about your responsibility

Other Books You May Enjoy

Other Books You May Enjoy

Packt is searching for authors like you

Share Your Thoughts

Customer Reviews

5 star

0

4 star

0

3 star

0

2 star

0

1 star

0

Summary

Distributed ML is a great approach to scaling out your training infrastructure in order to gain speed in your training process. It is applied in many real-world scenarios and is very easy to use with Horovod and Azure Machine Learning.

Parallel execution is similar to hyperparameter searching, while distributed execution is similar to Bayesian optimization, which we discussed in detail in the previous chapter. Distributed executions need methods to perform communication (such as one-to-one, one-to-many, many-to-one, and many-to-many) and synchronization (such as barrier synchronization) efficiently. These so-called collective algorithms are provided by communication backends (MPI, Gloo, and NCCL) and allow efficient GPU-to-GPU communication.

DL frameworks build higher-level abstractions on top of communication backends to perform model-parallel and data-parallel training. In data-parallel training, we partition the input data to compute multiple independent parts of the...