Book Image

Practical Automated Machine Learning Using H2O.ai

By : Salil Ajgaonkar
Book Image

Practical Automated Machine Learning Using H2O.ai

By: Salil Ajgaonkar

Overview of this book

With the huge amount of data being generated over the internet and the benefits that Machine Learning (ML) predictions bring to businesses, ML implementation has become a low-hanging fruit that everyone is striving for. The complex mathematics behind it, however, can be discouraging for a lot of users. This is where H2O comes in – it automates various repetitive steps, and this encapsulation helps developers focus on results rather than handling complexities. You’ll begin by understanding how H2O’s AutoML simplifies the implementation of ML by providing a simple, easy-to-use interface to train and use ML models. Next, you’ll see how AutoML automates the entire process of training multiple models, optimizing their hyperparameters, as well as explaining their performance. As you advance, you’ll find out how to leverage a Plain Old Java Object (POJO) and Model Object, Optimized (MOJO) to deploy your models to production. Throughout this book, you’ll take a hands-on approach to implementation using H2O that’ll enable you to set up your ML systems in no time. By the end of this H2O book, you’ll be able to train and use your ML models using H2O AutoML, right from experimentation all the way to production without a single need to understand complex statistics or data science.
Table of Contents (19 chapters)
1
Part 1 H2O AutoML Basics
4
Part 2 H2O AutoML Deep Dive
10
Part 3 H2O AutoML Advanced Implementation and Productization

Extracting H2O models as MOJOs

Just like POJOs, you can extract models trained using H2O’s AutoML using any of the H2O-supported languages.

In the following sub-sections, we shall learn how to extract the model MOJOs using the Python and R programming languages, as well as see how we can extract model MOJOs using H2O Flow.

Extracting H2O models as MOJOs in Python

Let’s see how we can extract models as MOJOs using Python. We shall use the same Iris flower dataset for running AutoML.

Follow these steps to train models using H2O AutoML. Then, we shall extract the leader model and download it as a MOJO:

  1. Import the h2o module and spin up your H2O server:
    import h2o
    h2o.init()
  2. Import the Iris dataset by passing the appropriate location of the dataset in your system. Execute the following command:
    data_frame = h2o.import_file("Dataset/iris.data")
  3. Set the feature and label names by executing the following command:
    features = data_frame.columns...