Chapter 3: Understanding Data Processing | Practical Automated Machine Learning Using H2O.ai

Book Overview & Buying
Table Of Contents

Practical Automated Machine Learning Using H2O.ai

By : Salil Ajgaonkar

4.6 (5)

Buy this Book

Practical Automated Machine Learning Using H2O.ai

4.6 (5)

By: Salil Ajgaonkar

Buy this Book

Overview of this book

With the huge amount of data being generated over the internet and the benefits that Machine Learning (ML) predictions bring to businesses, ML implementation has become a low-hanging fruit that everyone is striving for. The complex mathematics behind it, however, can be discouraging for a lot of users. This is where H2O comes in – it automates various repetitive steps, and this encapsulation helps developers focus on results rather than handling complexities. You’ll begin by understanding how H2O’s AutoML simplifies the implementation of ML by providing a simple, easy-to-use interface to train and use ML models. Next, you’ll see how AutoML automates the entire process of training multiple models, optimizing their hyperparameters, as well as explaining their performance. As you advance, you’ll find out how to leverage a Plain Old Java Object (POJO) and Model Object, Optimized (MOJO) to deploy your models to production. Throughout this book, you’ll take a hands-on approach to implementation using H2O that’ll enable you to set up your ML systems in no time. By the end of this H2O book, you’ll be able to train and use your ML models using H2O AutoML, right from experimentation all the way to production without a single need to understand complex statistics or data science.

Preface

Who this book is for

What this book covers

To get the most out of this book

Download the example code files

Download the color images

Conventions used

Get in touch

Share Your Thoughts

Part 1 H2O AutoML Basics

Free Chapter

Chapter 1: Understanding H2O AutoML Basics

Technical requirements

Understanding AutoML and H2O AutoML

Minimum system requirements to use H2O AutoML

Installing Java

Basic implementation of H2O using Python

Basic implementation of H2O using R

Training your first ML model using H2O AutoML

Summary

Chapter 2: Working with H2O Flow (H2O’s Web UI)

Technical requirements

Understanding the basics of H2O Flow

Working with data functions in H2O Flow

Working with model training functions in H2O Flow

Working with prediction functions in H2O Flow

Summary

Part 2 H2O AutoML Deep Dive

Chapter 3: Understanding Data Processing

Technical requirements

Reframing your dataframe

Handling missing values in the dataframe

Manipulating feature columns of the dataframe

Tokenization of textual data

Encoding data using target encoding

Summary

Chapter 4: Understanding H2O AutoML Architecture and Training

Observing the high-level architecture of H2O

Learning about the flow of interaction between the client and the H2O service

Understanding how H2O AutoML performs hyperparameter optimization and training

Summary

Chapter 5: Understanding AutoML Algorithms

Understanding the different types of ML algorithms

Understanding the Generalized Linear Model algorithm

Understanding the Distributed Random Forest algorithm

Understanding the Gradient Boosting Machine algorithm

Understanding what is Deep Learning

Summary

Chapter 6: Understanding H2O AutoML Leaderboard and Other Performance Metrics

Exploring the H2O AutoML leaderboard performance metrics

Exploring other model performance metrics

Summary

Chapter 7: Working with Model Explainability

Technical requirements

Working with the model explainability interface

Exploring the various explainability features

Summary

Part 3 H2O AutoML Advanced Implementation and Productization

Chapter 8: Exploring Optional Parameters for H2O AutoML

Technical requirements

Experimenting with parameters that support imbalanced classes

Experimenting with parameters that support early stopping

Experimenting with parameters that support cross-validation

Summary

Chapter 9: Exploring Miscellaneous Features in H2O AutoML

Technical requirements

Understanding H2O AutoML integration in scikit-learn

Understanding H2O AutoML event logging

Summary

Chapter 10: Working with Plain Old Java Objects (POJOs)

Technical requirements

Introduction to POJOs

Extracting H2O models as POJOs

Using a H2O model as a POJO

Summary

Chapter 11: Working with Model Object, Optimized (MOJO)

Technical requirements

Understanding what a MOJO is

Extracting H2O models as MOJOs

Viewing model MOJOs

Using H2O AutoML model MOJOs to make predictions

Summary

Chapter 12: Working with H2O AutoML and Apache Spark

Technical requirements

Exploring Apache Spark

Exploring H2O Sparkling Water

Summary

Chapter 13: Using H2O AutoML with Other Technologies

Technical requirements

Using H2O AutoML and Spring Boot

Using H2O AutoML and Apache Storm

Summary

Index

Why subscribe?

Other Books You May Enjoy

Packt is searching for authors like you

Share Your Thoughts

Practical Automated Machine Learning Using H2O.ai

By : Salil Ajgaonkar

Practical Automated Machine Learning Using H2O.ai

By: Salil Ajgaonkar

Overview of this book

Encoding data using target encoding

Confirmation

Buy this book with your credits?

Submit Your Feedback

Create a Free Account To Continue Reading

Sign in to activate your 7-day free access