Practical Deep Learning at Scale with MLflow

By : Yong Liu

5 (1)

Buy this Book

Practical Deep Learning at Scale with MLflow

5 (1)

By: Yong Liu

Buy this Book

Overview of this book

The book starts with an overview of the deep learning (DL) life cycle and the emerging Machine Learning Ops (MLOps) field, providing a clear picture of the four pillars of deep learning: data, model, code, and explainability and the role of MLflow in these areas. From there onward, it guides you step by step in understanding the concept of MLflow experiments and usage patterns, using MLflow as a unified framework to track DL data, code and pipelines, models, parameters, and metrics at scale. You’ll also tackle running DL pipelines in a distributed execution environment with reproducibility and provenance tracking, and tuning DL models through hyperparameter optimization (HPO) with Ray Tune, Optuna, and HyperBand. As you progress, you’ll learn how to build a multi-step DL inference pipeline with preprocessing and postprocessing steps, deploy a DL inference pipeline for production using Ray Serve and AWS SageMaker, and finally create a DL explanation as a service (EaaS) using the popular Shapley Additive Explanations (SHAP) toolbox. By the end of this book, you’ll have built the foundation and gained the hands-on experience you need to develop a DL pipeline solution from initial offline experimentation to final deployment and production, all within a reproducible and open source framework.

Preface

Who this book is for

What this book covers

To get the most out of this book

Download the example code files

Download the color images

Conventions used

Get in touch

Share Your Thoughts

Section 1 - Deep Learning Challenges and MLflow Prime

Free Chapter

Chapter 1: Deep Learning Life Cycle and MLOps Challenges

Technical requirements

Understanding the DL life cycle and MLOps challenges

Understanding DL data challenges

Understanding DL model challenges

Understanding DL code challenges

Understanding DL explainability challenges

Summary

Further reading

Chapter 2: Getting Started with MLflow for Deep Learning

Technical requirements

Setting up MLflow

Implementing our first DL experiment with MLflow autologging

Exploring MLflow's components and usage patterns

Summary

Further reading

Section 2 – Tracking a Deep Learning Pipeline at Scale

Chapter 3: Tracking Models, Parameters, and Metrics

Technical requirements

Setting up a full-fledged local MLflow tracking server

Tracking model provenance

Tracking model metrics

Tracking model parameters

Summary

Further reading

Chapter 4: Tracking Code and Data Versioning

Technical requirements

Tracking notebook and pipeline versioning

Tracking locally, privately built Python libraries

Tracking data versioning in Delta Lake

Summary

Further reading

Section 3 – Running Deep Learning Pipelines at Scale

Chapter 5: Running DL Pipelines in Different Environments

Technical requirements

An overview of different execution scenarios and environments

Running locally with local code

Running remote code in GitHub locally

Running local code remotely in the cloud

Running remotely in the cloud with remote code in GitHub

Summary

Further reading

Chapter 6: Running Hyperparameter Tuning at Scale

Technical requirements

Understanding automatic HPO for DL pipelines

Creating HPO-ready DL models with Ray Tune and MLflow

Running the first Ray Tune HPO experiment with MLflow

Running HPO with Ray Tune using Optuna and HyperBand

Summary

Further reading

Section 4 – Deploying a Deep Learning Pipeline at Scale

Chapter 7: Multi-Step Deep Learning Inference Pipeline

Technical requirements

Understanding patterns of DL inference pipelines

Implementing a custom MLflow Python model

Implementing preprocessing and postprocessing steps in a DL inference pipeline

Implementing an inference pipeline as a new entry point in the main MLproject

Summary

Further reading

Chapter 8: Deploying a DL Inference Pipeline at Scale

Technical requirements

Understanding different deployment tools and host environments

Deploying locally for batch and web service inference

Deploying using Ray Serve and MLflow deployment plugins

Deploying to AWS SageMaker – a complete end-to-end guide

Summary

Further reading

Section 5 – Deep Learning Model Explainability at Scale

Chapter 9: Fundamentals of Deep Learning Explainability

Technical requirements

Understanding the categories and audience of explainability

Exploring the SHAP Explainability toolbox

Exploring the Transformers Interpret toolbox

Summary

Further reading

Chapter 10: Implementing DL Explainability with MLflow

Technical requirements

Understanding current MLflow explainability integration

Implementing a SHAP explanation using the MLflow artifact logging API

Implementing a SHAP explainer using the MLflow pyfunc API

Summary

Further reading

Why subscribe?

Other Books You May Enjoy

Packt is searching for authors like you

Share Your Thoughts

Customer Reviews

5 (1)

5 star

100%

4 star

3 star

2 star

1 star

Preface

Starting from AlexNet in 2012, which won the large-scale ImageNet competition, to the BERT pre-trained language model in 2018, which topped many natural language processing (NLP) leaderboards, the revolution of modern deep learning (DL) in the broader artificial intelligence (AI) and machine learning (ML) community continues. Yet, the challenges of moving these DL models from offline experimentation to a production environment remain. This is largely due to the complexity and lack of a unified open source framework for supporting the full life cycle development of DL. This book will help you understand the big picture of DL full life cycle development, and implement DL pipelines that can scale from a local offline experiment to a distributed environment and online production clouds, with an emphasis on hands-on project-based learning to support the end-to-end DL process using the popular open source MLflow framework.

The book starts with an overview of the DL full life cycle and the emerging machine learning operations (MLOps) field, providing a clear picture of the four pillars of DL (data, model, code, and explainability) and the role of MLflow in these areas. A basic transfer learning-based NLP sentiment model using PyTorch Lightning Flash is built in the first chapter, which is further developed, tuned, and deployed to production throughout the rest of the book. From there onward, it guides you step-by-step to understand the concept of MLflow experiments and usage patterns, using MLflow as a unified framework to track DL data, code and pipeline, model, parameters, and metrics at scale. We'll run DL pipelines in a distributed execution environment with reproducibility and provenance tracking, and tune DL models through hyperparameter optimization (HPO) with Ray Tune, Optuna and HyperBand. We'll also build a multi-step DL inference pipeline with preprocessing and postprocessing steps, deploy a DL inference pipeline for production using Ray Serve and AWS SageMaker, and finally, provide a DL Explanation-as-a-Service using SHapley Additive exPlanations (SHAP) and MLflow integration.

By the end of this book, you'll have the foundation and hands-on experience to build a DL pipeline from initial offline experimentation to final deployment and production, all within a reproducible and open source framework. Along the way, you will also learn the unique challenges with DL pipelines and how we overcome them with practical and scalable solutions such as using multi-core CPUs, graphical processing units (GPUs), distributed and parallel computing frameworks, and the cloud.

Practical Deep Learning at Scale with MLflow

By : Yong Liu

Practical Deep Learning at Scale with MLflow

By: Yong Liu

Overview of this book

Related Content you might be interested in

Current Title:

Practical Deep Learning at Scale with MLflow

Machine Learning Engineering with MLflow.

Practical Machine Learning on Databricks

Distributed Data Systems with Azure Databricks

Preface