Book Image

Engineering MLOps

By : Emmanuel Raj

Book Image

Engineering MLOps

By: Emmanuel Raj

Overview of this book

Engineering MLps presents comprehensive insights into MLOps coupled with real-world examples in Azure to help you to write programs, train robust and scalable ML models, and build ML pipelines to train and deploy models securely in production. The book begins by familiarizing you with the MLOps workflow so you can start writing programs to train ML models. Then you’ll then move on to explore options for serializing and packaging ML models post-training to deploy them to facilitate machine learning inference, model interoperability, and end-to-end model traceability. You’ll learn how to build ML pipelines, continuous integration and continuous delivery (CI/CD) pipelines, and monitor pipelines to systematically build, deploy, monitor, and govern ML solutions for businesses and industries. Finally, you’ll apply the knowledge you’ve gained to build real-world projects. By the end of this ML book, you'll have a 360-degree view of MLOps and be ready to implement MLOps in your organization.

Preface

Who this book is for

What this book covers

To get the most out of this book

Download the example code files

Download the color images

Conventions used

Section 1: Framework for Building Machine Learning Models

Section 1: Framework for Building Machine Learning Models

Free Chapter

Chapter 1: Fundamentals of an MLOps Workflow

Chapter 1: Fundamentals of an MLOps Workflow

The evolution of infrastructure and software development

Traditional software development challenges

Trends of ML adoption in software development

Understanding MLOps

Concepts and workflow of MLOps

Chapter 2: Characterizing Your Machine Learning Problem

Chapter 2: Characterizing Your Machine Learning Problem

The ML solution development process

Types of ML models

Structuring your MLOps

An implementation roadmap for your solution

Procuring data, requirements, and tools

Tools and infrastructure

Discussing a real-life business problem

Chapter 3: Code Meets Data

Chapter 3: Code Meets Data

Business problem analysis and categorizing the problem

Setting up the resources and tools

10 principles of source code management for ML

What is good data for ML?

Data preprocessing

Data registration and versioning

Toward the ML Pipeline

Chapter 4: Machine Learning Pipelines

Chapter 4: Machine Learning Pipelines

Going through the basics of ML pipelines

Data ingestion and feature engineering

Machine learning training and hyperparameter optimization

Model testing and defining metrics

Model packaging

Registering models and production artifacts

Chapter 5: Model Evaluation and Packaging

Chapter 5: Model Evaluation and Packaging

Model evaluation and interpretability metrics

Production testing methods

Why package ML models?

How to package ML models

Inference ready models

Section 2: Deploying Machine Learning Models at Scale

Section 2: Deploying Machine Learning Models at Scale

Chapter 6: Key Principles for Deploying Your ML System

Chapter 6: Key Principles for Deploying Your ML System

ML in research versus production

Understanding the types of ML inference in production

Hands-on deployment (for the business problem)

Understanding the need for continuous integration and continuous deployment

Chapter 7: Building Robust CI/CD Pipelines

Chapter 7: Building Robust CI/CD Pipelines

Continuous integration, delivery, and deployment in MLOps

Setting up a CI/CD pipeline and the test environment (using Azure DevOps)

Pipeline execution and testing

Pipeline execution triggers

Chapter 8: APIs and Microservice Management

Chapter 8: APIs and Microservice Management

Introduction to APIs and microservices

The need for microservices for ML

Old is gold – REST API-based microservices

Hands-on implementation of serving an ML model as an API

Developing a microservice using Docker

Testing the API

Chapter 9: Testing and Securing Your ML Solution

Chapter 9: Testing and Securing Your ML Solution

Understanding the need for testing and securing your ML application

Testing your ML solution by design

Hands-on deployment and inference testing (a business use case)

Securing your ML solution by design

Chapter 10: Essentials of Production Release

Chapter 10: Essentials of Production Release

Setting up the production infrastructure

Setting up our production environment in the CI/CD pipeline

Testing our production-ready pipeline

Configuring pipeline triggers for automation

Pipeline release management

Toward continuous monitoring

Section 3: Monitoring Machine Learning Models in Production

Section 3: Monitoring Machine Learning Models in Production

Chapter 11: Key Principles for Monitoring Your ML System

Chapter 11: Key Principles for Monitoring Your ML System

Understanding the key principles of monitoring an ML system

Monitoring in the MLOps workflow

Understanding the Explainable Monitoring Framework

Enabling continuous monitoring for the service

Chapter 12: Model Serving and Monitoring

Chapter 12: Model Serving and Monitoring

Serving, monitoring, and maintaining models in production

Exploring different modes of serving ML models

Implementing the Explainable Monitoring framework

Governing your ML system

Chapter 13: Governing the ML System for Continual Learning

Chapter 13: Governing the ML System for Continual Learning

Understanding the need for continual learning

Explainable monitoring – governance

Enabling model retraining

Maintaining the CI/CD pipeline

Other Books You May Enjoy

Other Books You May Enjoy

Packt is searching for authors like you

Leave a review - let other readers know what you think

Customer Reviews

5 star

0

4 star

0

3 star

0

2 star

0

1 star

0

What is good data for ML?

Good ML models are a result of training on good-quality data. Before proceeding to ML training, a pre-requisite is to have good-quality data. Therefore, we need to process the data to increase its quality. So, determining the quality of data is essential. Five characteristics will enable us to discern the quality of data, as follows:

Accuracy: Accuracy is a crucial characteristic of data quality, as having inaccurate data can lead to poor ML model performance and consequences in real life. To check the accuracy of the data, confirm whether the information represents a real-life situation or not.
Completeness: In most cases, incomplete information is unusable and can lead to incorrect outcomes if an ML model is trained on it. It is vital to check the comprehensiveness of the data.
Reliability: Contradictions or duplications in data can lead to the unreliability of the data. Reliability is a vital characteristic; trusting the data is essential...