Book Image

MLOps with Red Hat OpenShift

By : Ross Brigoli, Faisal Masood
Book Image

MLOps with Red Hat OpenShift

By: Ross Brigoli, Faisal Masood

Overview of this book

MLOps with OpenShift offers practical insights for implementing MLOps workflows on the dynamic OpenShift platform. As organizations worldwide seek to harness the power of machine learning operations, this book lays the foundation for your MLOps success. Starting with an exploration of key MLOps concepts, including data preparation, model training, and deployment, you’ll prepare to unleash OpenShift capabilities, kicking off with a primer on containers, pods, operators, and more. With the groundwork in place, you’ll be guided to MLOps workflows, uncovering the applications of popular machine learning frameworks for training and testing models on the platform. As you advance through the chapters, you’ll focus on the open-source data science and machine learning platform, Red Hat OpenShift Data Science, and its partner components, such as Pachyderm and Intel OpenVino, to understand their role in building and managing data pipelines, as well as deploying and monitoring machine learning models. Armed with this comprehensive knowledge, you’ll be able to implement MLOps workflows on the OpenShift platform proficiently.
Table of Contents (13 chapters)
Free Chapter
1
Part 1: Introduction
3
Part 2: Provisioning and Configuration
6
Part 3: Operating ML Workloads

Configuring Pachyderm

Let’s start by configuring Pachyderm. Pachyderm is a platform that assists data scientists in creating complete ML workflows covering all the stages from data ingestion and model training up to deploying into production. Think of it as a version control system (VCS) for your model development workflow.

In traditional software engineering, you may use Git to version control your code. In ML projects, you need to version control your data, and you want a reproducible flow for training your model. Pachyderm provides such capabilities for you. You will see how Red Hat OpenShift enables you to use Pachyderm. Refer to Chapter 3 for instructions on installing the Pachyderm operator.

Follow these steps to configure Pachyderm. Pachyderm needs a relational database management system (RDBMS) to store metadata, and the operator takes care of the Pachyderm and related database components. Pachyderm requires Simple Storage Service (S3) storage to store the Pachyderm...