Machine Learning Automation with TPOT

By : Dario Radečić

Machine Learning Automation with TPOT

By: Dario Radečić

Overview of this book

The automation of machine learning tasks allows developers more time to focus on the usability and reactivity of the software powered by machine learning models. TPOT is a Python automated machine learning tool used for optimizing machine learning pipelines using genetic programming. Automating machine learning with TPOT enables individuals and companies to develop production-ready machine learning models cheaper and faster than with traditional methods. With this practical guide to AutoML, developers working with Python on machine learning tasks will be able to put their knowledge to work and become productive quickly. You'll adopt a hands-on approach to learning the implementation of AutoML and associated methodologies. Complete with step-by-step explanations of essential concepts, practical examples, and self-assessment questions, this book will show you how to build automated classification and regression models and compare their performance to custom-built models. As you advance, you'll also develop state-of-the-art models using only a couple of lines of code and see how those models outperform all of your previous models on the same datasets. By the end of this book, you'll have gained the confidence to implement AutoML techniques in your organization on a production level.

Preface

Who this book is for

What this book covers

To get the most out of this book

Download the example code files

Download the color images

Conventions used

Get in touch

Reviews

Section 1: Introducing Machine Learning and the Idea of Automation

Free Chapter

Chapter 1: Machine Learning and the Idea of Automation

Technical requirements

Reviewing the history of machine learning

Reviewing automation

Applying automation to machine learning

Automation options

Summary

Q&A

Further reading

Chapter 3: Exploring Regression with TPOT

Technical requirements

Applying automated regression modeling to the fish market dataset

Applying automated regression modeling to the insurance dataset

Applying automated regression modeling to the vehicle dataset

Summary

Q&A

Chapter 4: Exploring Classification with TPOT

Technical requirements

Applying automated classification models to the iris dataset

Applying automated classification modeling to the titanic dataset

Summary

Q&A

Chapter 5: Parallel Training with TPOT and Dask

Technical requirements

Introduction to parallelism in Python

Introduction to the Dask library

Training machine learning models with TPOT and Dask

Summary

Q&A

Section 3: Advanced Examples and Neural Networks in TPOT

Chapter 6: Getting Started with Deep Learning: Crash Course in Neural Networks

Technical requirements

Overview of deep learning

Introducing artificial neural networks

Using neural networks to classify handwritten digits

Neural networks in regression versus classification

Summary

Q&A

Chapter 7: Neural Network Classifier with TPOT

Technical requirements

Exploring the dataset

Exploring options for training neural network classifiers

Training a neural network classifier

Summary

Questions

Chapter 8: TPOT Model Deployment

Technical requirements

Why do we need model deployment?

Introducing Flask and Flask-RESTful

Best practices for deploying automated models

Deploying machine learning models to localhost

Deploying machine learning models to the cloud

Summary

Question

Chapter 9: Using the Deployed TPOT Model in Production

Technical requirements

Making predictions in a notebook environment

Developing a simple GUI web application

Making predictions in a GUI environment

Summary

Q&A

Why subscribe?

Other Books You May Enjoy

Packt is searching for authors like you

Leave a review - let other readers know what you think

Customer Reviews

5 star

4 star

3 star

2 star

1 star

Applying automated regression modeling to the insurance dataset

This section demonstrates how to apply an automated machine learning solution to a slightly more complicated dataset. You will use the medical insurance cost dataset (https://www.kaggle.com/mirichoi0218/insurance) to predict how much insurance will cost based on a couple of predictor variables. You will learn how to load the dataset, perform exploratory data analysis, how to prepare it, and how to find the best machine learning pipeline with TPOT:

As with the previous example, the first step is to load in the libraries and the dataset. We'll need numpy, pandas, matplotlib, and seaborn to start with the analysis. Here's how to import the libraries and load the dataset:

import numpy as np
import pandas as pd
import matplotlib.pyplot as plt
import seaborn as sns
from matplotlib import rcParams
rcParams['axes.spines.top'] = False
rcParams['axes.spines.right'] = False
df = pd.read_csv(...

Machine Learning Automation with TPOT

By : Dario Radečić

Machine Learning Automation with TPOT

By: Dario Radečić

Overview of this book

Related Content you might be interested in

Current Title:

Machine Learning Automation with TPOT

Hands-On Automated Machine Learning

Machine Learning with LightGBM and Python

Automated Machine Learning

Applying automated regression modeling to the insurance dataset