MLOps with Red Hat OpenShift

By : Ross Brigoli, Faisal Masood

MLOps with Red Hat OpenShift

By: Ross Brigoli, Faisal Masood

Overview of this book

MLOps with OpenShift offers practical insights for implementing MLOps workflows on the dynamic OpenShift platform. As organizations worldwide seek to harness the power of machine learning operations, this book lays the foundation for your MLOps success. Starting with an exploration of key MLOps concepts, including data preparation, model training, and deployment, you’ll prepare to unleash OpenShift capabilities, kicking off with a primer on containers, pods, operators, and more. With the groundwork in place, you’ll be guided to MLOps workflows, uncovering the applications of popular machine learning frameworks for training and testing models on the platform. As you advance through the chapters, you’ll focus on the open-source data science and machine learning platform, Red Hat OpenShift Data Science, and its partner components, such as Pachyderm and Intel OpenVino, to understand their role in building and managing data pipelines, as well as deploying and monitoring machine learning models. Armed with this comprehensive knowledge, you’ll be able to implement MLOps workflows on the OpenShift platform proficiently.

Preface

Who this book is for

What this book covers

To get the most out of this book

Download the example code files

Conventions used

Get in touch

Share Your Thoughts

Download a free PDF copy of this book

Free Chapter

Part 1: Introduction

Chapter 1: Introduction to MLOps and OpenShift

What is MLOps?

Introduction to OpenShift

Understanding operators

Understanding how OpenShift supports MLOps

Red Hat OpenShift Data Science (RHODS)

The advantages of the cloud

ROSA

Summary

References

Part 2: Provisioning and Configuration

Chapter 2: Provisioning an MLOps Platform in the Cloud

Technical requirements

Installing OpenShift on AWS

Installing Red Hat ODS

Installing partner software on RedHat ODS

Installing Pachyderm

Summary

Chapter 3: Building Machine Learning Models with OpenShift

Technical requirements

Using Jupyter Notebooks in OpenShift

Provisioning an S3 store

Using ML frameworks in OpenShift

Using GPU acceleration for model training

Enabling GPU support

Building custom notebooks

Summary

Part 3: Operating ML Workloads

Chapter 4: Managing a Model Training Workflow

Technical requirements

Configuring Pachyderm

Versioning your data with Pachyderm

Training a model using Red Hat ODS

Building a model training pipeline

Summary

Chapter 5: Deploying ML Models as a Service

Packaging and deploying models as a service

Autoscaling the deployed models

Releasing new versions of the model

Securing model endpoints

Summary

Chapter 6: Operating ML Workloads

Monitoring ML models

Installing and configuring Prometheus and Grafana

Logging inference calls

Optimizing cost

Summary

References

Chapter 7: Building a Face Detector Using the Red Hat ML Platform

Architecting a human face detector system

Training a model for face detection

Installing Redis on Red Hat OpenShift

Building and deploying the inferencing application

Bringing it all together

Optimizing cost for your ML platform

Summary

Index

Why subscribe?

Other Books You May Enjoy

Packt is searching for authors like you

Share Your Thoughts

Download a free PDF copy of this book

Customer Reviews

5 star

4 star

3 star

2 star

1 star

Autoscaling the deployed models

While creating a model server, you will be presented with the option to set the number of replicas. This corresponds to the number of instances of the model servers to be created. This allows you to increase or decrease the serving capacity of your model servers. Figure 5.12 shows this option as Model server replicas:

Figure 5.12 – Add model server

However, with this approach, you need to decide on the number of serving instances or replicas at the time of the model server’s creation. OpenShift provides another construct where you can add an automatic scaler that increases or decreases the number of replicas of the model server based on the memory or CPU utilization of the model server instances. This construct is called horizontal pod autoscaling. This allows us to automatically scale workloads to match the demand.

Let’s see how the model server that we defined with the data science project is deployed...

MLOps with Red Hat OpenShift

By : Ross Brigoli, Faisal Masood

MLOps with Red Hat OpenShift

By: Ross Brigoli, Faisal Masood

Overview of this book

Related Content you might be interested in

Current Title:

MLOps with Red Hat OpenShift

Machine Learning on Kubernetes

OpenShift Multi-Cluster Management Handbook

IBM Cloud Pak for Data

Autoscaling the deployed models