Book Image

MLOps with Red Hat OpenShift

By : Ross Brigoli, Faisal Masood
Book Image

MLOps with Red Hat OpenShift

By: Ross Brigoli, Faisal Masood

Overview of this book

MLOps with OpenShift offers practical insights for implementing MLOps workflows on the dynamic OpenShift platform. As organizations worldwide seek to harness the power of machine learning operations, this book lays the foundation for your MLOps success. Starting with an exploration of key MLOps concepts, including data preparation, model training, and deployment, you’ll prepare to unleash OpenShift capabilities, kicking off with a primer on containers, pods, operators, and more. With the groundwork in place, you’ll be guided to MLOps workflows, uncovering the applications of popular machine learning frameworks for training and testing models on the platform. As you advance through the chapters, you’ll focus on the open-source data science and machine learning platform, Red Hat OpenShift Data Science, and its partner components, such as Pachyderm and Intel OpenVino, to understand their role in building and managing data pipelines, as well as deploying and monitoring machine learning models. Armed with this comprehensive knowledge, you’ll be able to implement MLOps workflows on the OpenShift platform proficiently.
Table of Contents (13 chapters)
Free Chapter
1
Part 1: Introduction
3
Part 2: Provisioning and Configuration
6
Part 3: Operating ML Workloads

Installing partner software on RedHat ODS

In order to complete our MLOps platform, we will need to install additional tools to OpenShift to complement the features of ODS. Several tools are considered partner software of the ODS platform. These software products are listed in the ODS console and can be viewed by clicking the Explore menu item in the ODS console, as shown in Figure 2.34:

Figure 2.34 – ODS console showing partner software

Figure 2.34 – ODS console showing partner software

One of the things that you will need to complete the MLOps platform is data versioning, and you will need data lineage too. We will use Pachyderm for this need.

Pachyderm is a powerful OSS tool designed to manage data in modern data pipelines. It serves as a data versioning and lineage system, enabling efficient tracking and control of data changes throughout the data processing life cycle.

With Pachyderm, you can easily keep track of modifications made to your data, similar to how version control systems ...