Book Image

The Definitive Guide to Google Vertex AI

By : Jasmeet Bhatia, Kartik Chaudhary

4 (1)

Book Image

The Definitive Guide to Google Vertex AI

4 (1)

By: Jasmeet Bhatia, Kartik Chaudhary

Overview of this book

While AI has become an integral part of every organization today, the development of large-scale ML solutions and management of complex ML workflows in production continue to pose challenges for many. Google’s unified data and AI platform, Vertex AI, directly addresses these challenges with its array of MLOPs tools designed for overall workflow management. This book is a comprehensive guide that lets you explore Google Vertex AI’s easy-to-advanced level features for end-to-end ML solution development. Throughout this book, you’ll discover how Vertex AI empowers you by providing essential tools for critical tasks, including data management, model building, large-scale experimentations, metadata logging, model deployments, and monitoring. You’ll learn how to harness the full potential of Vertex AI for developing and deploying no-code, low-code, or fully customized ML solutions. This book takes a hands-on approach to developing u deploying some real-world ML solutions on Google Cloud, leveraging key technologies such as Vision, NLP, generative AI, and recommendation systems. Additionally, this book covers pre-built and turnkey solution offerings as well as guidance on seamlessly integrating them into your ML workflows. By the end of this book, you’ll have the confidence to develop and deploy large-scale production-grade ML solutions using the MLOps tooling and best practices from Google.

Preface

Who this book is for

What this book covers

To get the most out of this book

Download the example code files

Conventions used

Share Your Thoughts

Download a free PDF copy of this book

Part 1:The Importance of MLOps in a Real-World ML Deployment

Part 1:The Importance of MLOps in a Real-World ML Deployment

Free Chapter

Chapter 1: Machine Learning Project Life Cycle and Challenges

Chapter 1: Machine Learning Project Life Cycle and Challenges

ML project life cycle

Common challenges in developing real-world ML solutions

Limitations of ML

Chapter 2: What Is MLOps, and Why Is It So Important for Every ML Team?

Chapter 2: What Is MLOps, and Why Is It So Important for Every ML Team?

Why is MLOps important?

Implementing different MLOps maturity levels

How can Vertex AI help with implementing MLOps?

Part 2: Machine Learning Tools for Custom Models on Google Cloud

Part 2: Machine Learning Tools for Custom Models on Google Cloud

Chapter 3: It’s All About Data – Options to Store and Transform ML Datasets

Chapter 3: It’s All About Data – Options to Store and Transform ML Datasets

Moving data to Google Cloud

Where to store data

Transforming data

Chapter 4: Vertex AI Workbench – a One-Stop Tool for AI/ML Development Needs

Chapter 4: Vertex AI Workbench – a One-Stop Tool for AI/ML Development Needs

What is Jupyter Notebook?

Vertex AI Workbench

Custom containers for Vertex AI Workbench

Scheduling notebooks in Vertex AI

Chapter 5: No-Code Options for Building ML Models

Chapter 5: No-Code Options for Building ML Models

ML modeling options in Google Cloud

What is AutoML?

Vertex AI AutoML

Importing data to use with Vertex AI AutoML

Generating predictions using the recently trained model

Deploying a model in Vertex AI

Generating predictions

Chapter 6: Low-Code Options for Building ML Models

Chapter 6: Low-Code Options for Building ML Models

Getting started with BigQuery

Using BQML for feature transformations

Building ML models with BQML

Creating BQML models

Hyperparameter tuning with BQML

Evaluating trained models

Doing inference with BQML

Chapter 7: Training Fully Custom ML Models with Vertex AI

Chapter 7: Training Fully Custom ML Models with Vertex AI

Technical requirements

Building a basic deep learning model with TensorFlow

Packaging a model to submit it to Vertex AI as a training job

Monitoring model training progress

Evaluating trained models

Chapter 8: ML Model Explainability

Chapter 8: ML Model Explainability

What is Explainable AI and why is it important for MLOps practitioners?

Explainable AI techniques

Explainable AI features available in Google Cloud Vertex AI

Chapter 9: Model Optimizations – Hyperparameter Tuning and NAS

Chapter 9: Model Optimizations – Hyperparameter Tuning and NAS

Technical requirements

What is HPT and why is it important?

Setting up HPT jobs on Vertex AI

What is NAS and how is it different from HPT?

NAS on Vertex AI overview

Chapter 10: Vertex AI Deployment and Automation Tools – Orchestration through Managed Kubeflow Pipelines

Chapter 10: Vertex AI Deployment and Automation Tools – Orchestration through Managed Kubeflow Pipelines

Technical requirements

Orchestrating ML workflows using Vertex AI Pipelines (managed Kubeflow pipelines)

Orchestrating ML workflows using Cloud Composer (managed Airflow)

Vertex AI Pipelines versus Cloud Composer

Getting predictions on Vertex AI

Managing deployed models on Vertex AI

Chapter 11: MLOps Governance with Vertex AI

Chapter 11: MLOps Governance with Vertex AI

What is MLOps governance and what are its key components?

Enterprise scenarios that highlight the importance of MLOps governance

Tools in Vertex AI that can help with governance

Part 3: Prebuilt/Turnkey ML Solutions Available in GCP

Part 3: Prebuilt/Turnkey ML Solutions Available in GCP

Chapter 12: Vertex AI – Generative AI Tools

Chapter 12: Vertex AI – Generative AI Tools

GenAI fundamentals

GenAI with Vertex AI

Building and deploying GenAI applications with Vertex AI

Enhancing GenAI performance with model tuning in Vertex AI

Chapter 13: Document AI – An End-to-End Solution for Processing Documents

Chapter 13: Document AI – An End-to-End Solution for Processing Documents

Technical requirements

What is Document AI?

Overview of existing Document AI processors

Creating custom Document AI processors

Chapter 14: ML APIs for Vision, NLP, and Speech

Chapter 14: ML APIs for Vision, NLP, and Speech

Vision AI on Google Cloud

Translation AI on Google Cloud

Natural Language AI on Google Cloud

Speech AI on Google Cloud

Part 4: Building Real-World ML Solutions with Google Cloud

Part 4: Building Real-World ML Solutions with Google Cloud

Chapter 15: Recommender Systems – Predict What Movies a User Would Like to Watch

Chapter 15: Recommender Systems – Predict What Movies a User Would Like to Watch

Different types of recommender systems

Deploying a movie recommender system on Vertex AI

Chapter 16: Vision-Based Defect Detection System – Machines Can See Now!

Chapter 16: Vision-Based Defect Detection System – Machines Can See Now!

Technical requirements

Vision-based defect detection

Deploying a vision model to a Vertex AI endpoint

Getting online predictions from a vision model

Chapter 17: Natural Language Models – Detecting Fake News Articles!

Chapter 17: Natural Language Models – Detecting Fake News Articles!

Technical requirements

Detecting fake news using NLP

Launching model training on Vertex AI

BERT-based fake news classification

Index

Other Books You May Enjoy

Other Books You May Enjoy

Packt is searching for authors like you

Share Your Thoughts

Download a free PDF copy of this book

Customer Reviews

4 (1)

5 star

0

4 star

100%

3 star

0

2 star

0

1 star

0

Deploying a model in Vertex AI

Now, let us walk you through the steps of deploying the trained model on Vertex AI to enable real-time predictions:

Go to Model Registry, click on the model and then the model version you want to deploy, and on the DEPLOY & TEST tab, click DEPLOY TO ENDPOINT.

Figure 5.18 – Initiating model deployment

Figure 5.18 – Initiating model deployment

Type in the desired name of the API endpoint being created and click CONTINUE.

Figure 5.19 – Creating a model endpoint

Figure 5.19 – Creating a model endpoint

You can leave all default options unchanged for quick test deployment, but these are the settings you need to understand:
- Traffic split: If multiple versions of the model are deployed on the same API endpoint, this option allows users to define what percentage of total traffic is allocated to a specific version. For example, when deploying a new model, you might want only 2% of the overall incoming data to be routed to the new model so that...