Chapter 5: Advanced Model Building – Part I | Machine Learning at Scale with H2O

Book Overview & Buying
Table Of Contents

Machine Learning at Scale with H2O

By : Gregory Keys, David Whiting

5 (2)

Buy this Book

Machine Learning at Scale with H2O

5 (2)

By: Gregory Keys, David Whiting

Buy this Book

Overview of this book

H2O is an open source, fast, and scalable machine learning framework that allows you to build models using big data and then easily productionalize them in diverse enterprise environments. Machine Learning at Scale with H2O begins with an overview of the challenges faced in building machine learning models on large enterprise systems, and then addresses how H2O helps you to overcome them. You’ll start by exploring H2O’s in-memory distributed architecture and find out how it enables you to build highly accurate and explainable models on massive datasets using your favorite ML algorithms, language, and IDE. You’ll also get to grips with the seamless integration of H2O model building and deployment with Spark using H2O Sparkling Water. You’ll then learn how to easily deploy models with H2O MOJO. Next, the book shows you how H2O Enterprise Steam handles admin configurations and user management, and then helps you to identify different stakeholder perspectives that a data scientist must understand in order to succeed in an enterprise setting. Finally, you’ll be introduced to the H2O AI Cloud platform and explore the entire machine learning life cycle using multiple advanced AI capabilities. By the end of this book, you’ll be able to build and deploy advanced, state-of-the-art machine learning models for your business needs.

Preface

Who this book is for

What this book covers

To get the most out of this book

Download the example code files

Download the color images

Conventions used

Get in touch

Share Your Thoughts

Section 1 – Introduction to the H2O Machine Learning Platform for Data at Scale

Free Chapter

Chapter 1: Opportunities and Challenges

ML at scale

The ML life cycle and three challenge areas for ML at scale

H2O.ai's answer to these challenges

Summary

Chapter 2: Platform Components and Key Concepts

Technical requirements

Hello World – the H2O machine learning code

The components of H2O machine learning at scale

The workflow using H2O components

H2O key concepts

Summary

Chapter 3: Fundamental Workflow – Data to Deployable Model

Technical requirements

Use case and data overview

The fundamental workflow

Variation points – alternatives and extensions to the fundamental workflow

Summary

Section 2 – Building State-of-the-Art Models on Large Data Volumes Using H2O

Chapter 4: H2O Model Building at Scale – Capability Articulation

H2O data capabilities during model building

H2O machine learning algorithms

H2O modeling capabilities

Summary

Chapter 5: Advanced Model Building – Part I

Technical requirements

Splitting data for validation or cross-validation and testing

Algorithm considerations

Model optimization with grid search

H2O AutoML

Feature engineering options

Leveraging H2O Flow to enhance your IDE workflow

Putting it all together – algorithms, feature engineering, grid search, and AutoML

Summary

Chapter 6: Advanced Model Building – Part II

Technical requirements

Modeling in Sparkling Water

UL methods in H2O

Best practices for updating H2O models

Ensuring H2O model reproducibility

Summary

Chapter 7: Understanding ML Models

Selecting model performance metrics

Explaining models built in H2O

Automated model documentation (H2O AutoDoc)

Summary

Chapter 8: Putting It All Together

Technical requirements

Data wrangling

Feature engineering

Model building and evaluation

Preparation for model pipeline deployment

Summary

Section 3 – Deploying Your Models to Production Environments

Chapter 9: Production Scoring and the H2O MOJO

Technical requirements

The model building and model scoring contexts

H2O production scoring

H2O MOJO deep dive

Wrapping MOJOs using the H2O MOJO API

Other things to know about MOJOs

Summary

Chapter 10: H2O Model Deployment Patterns

Technical requirements

Surveying a sample of MOJO deployment patterns

Exploring examples of MOJO scoring with H2O software

Exploring examples of MOJO scoring with third-party software

Exploring examples of MOJO scoring with your target-system software

Exploring examples of accelerators based on H2O Driverless AI integrations

Summary

Section 4 – Enterprise Stakeholder Perspectives

Chapter 11: The Administrator and Operations Views

A model building and deployment view – the personas on the ground

View 1 – Enterprise Steam administrator

View 2 – The operations team

View 3 – The data scientist

Summary

Chapter 12: The Enterprise Architect and Security Views

Technical requirements

The enterprise and security architect view

H2O at Scale enterprise architecture

H2O at Scale security

The data scientist's view of enterprise and security architecture

Summary

Section 5 – Broadening the View – Data to AI Applications with the H2O AI Cloud Platform

Chapter 13: Introducing H2O AI Cloud

Technical requirements

An H2O AI Cloud overview

H2O AI Cloud component breakdown

H2O AI Cloud architecture

Summary

Chapter 14: H2O at Scale in a Larger Platform Context

Technical requirements

A quick recap of H2O AI Cloud

Exploring a baseline reference solution for H2O at scale

Exploring new possibilities for H2O at scale

A Reference H2O Wave app as an enterprise AI integration fabric

Summary

Other Books You May Enjoy

Packt is searching for authors like you

Share Your Thoughts

Appendix : Alternative Methods to Launch H2O Clusters

Local H2O-3 cluster

Local Sparkling Water cluster

H2O-3 cluster in the 90-day free trial environment for H2O AI Cloud

Why subscribe?

Machine Learning at Scale with H2O

By : Gregory Keys, David Whiting

Machine Learning at Scale with H2O

By: Gregory Keys, David Whiting

Overview of this book

Summary

Confirmation

Buy this book with your credits?

Submit Your Feedback

Create a Free Account To Continue Reading

Sign in to activate your 7-day free access