Python Machine Learning Cookbook - Second Edition

By : Giuseppe Ciaburro, Prateek Joshi

Python Machine Learning Cookbook - Second Edition

By: Giuseppe Ciaburro, Prateek Joshi

Overview of this book

This eagerly anticipated second edition of the popular Python Machine Learning Cookbook will enable you to adopt a fresh approach to dealing with real-world machine learning and deep learning tasks. With the help of over 100 recipes, you will learn to build powerful machine learning applications using modern libraries from the Python ecosystem. The book will also guide you on how to implement various machine learning algorithms for classification, clustering, and recommendation engines, using a recipe-based approach. With emphasis on practical solutions, dedicated sections in the book will help you to apply supervised and unsupervised learning techniques to real-world problems. Toward the concluding chapters, you will get to grips with recipes that teach you advanced techniques including reinforcement learning, deep neural networks, and automated machine learning. By the end of this book, you will be equipped with the skills you need to apply machine learning techniques and leverage the full capabilities of the Python ecosystem through real-world examples.

Preface

Who this book is for

What this book covers

To get the most out of this book

Sections

Get in touch

Free Chapter

The Realm of Supervised Learning

Technical requirements

Introduction

Array creation in Python

Data preprocessing using mean removal

Building a linear regressor

Computing regression accuracy

Achieving model persistence

Building a ridge regressor

Building a polynomial regressor

Estimating housing prices

Computing the relative importance of features

Estimating bicycle demand distribution

Constructing a Classifier

Technical requirements

Introduction

Building a simple classifier

Building a logistic regression classifier

Building a Naive Bayes classifier

Splitting a dataset for training and testing

Evaluating accuracy using cross-validation metrics

Visualizing a confusion matrix

Extracting a performance report

Evaluating cars based on their characteristics

Extracting validation curves

Extracting learning curves

Estimating the income bracket

Predicting the quality of wine

Predictive Modeling

Technical requirements

Introduction

Building a linear classifier using SVMs

Building a nonlinear classifier using SVMs

Tackling class imbalance

Extracting confidence measurements

Finding optimal hyperparameters

Building an event predictor

Estimating traffic

Simplifying machine learning workflow using TensorFlow

Implementing a stacking method

Clustering with Unsupervised Learning

Technical requirements

Introduction

Clustering data using the k-means algorithm

Compressing an image using vector quantization

Grouping data using agglomerative clustering

Evaluating the performance of clustering algorithms

Estimating the number of clusters using the DBSCAN algorithm

Finding patterns in stock market data

Building a customer segmentation model

Using autoencoders to reconstruct handwritten digit images

Visualizing Data

Technical requirements

An introduction to data visualization

Plotting three-dimensional scatter plots

Plotting bubble plots

Animating bubble plots

Drawing pie charts

Plotting date-formatted time series data

Plotting histograms

Visualizing heat maps

Animating dynamic signals

Working with the Seaborn library

Building Recommendation Engines

Technical requirements

Introducing the recommendation engine

Building function compositions for data processing

Building machine learning pipelines

Finding the nearest neighbors

Constructing a k-nearest neighbors classifier

Constructing a k-nearest neighbors regressor

Computing the Euclidean distance score

Computing the Pearson correlation score

Finding similar users in the dataset

Generating movie recommendations

Implementing ranking algorithms

Building a filtering model using TensorFlow

Analyzing Text Data

Technical requirements

Introduction

Preprocessing data using tokenization

Stemming text data

Converting text to its base form using lemmatization

Dividing text using chunking

Building a bag-of-words model

Building a text classifier

Identifying the gender of a name

Analyzing the sentiment of a sentence

Identifying patterns in text using topic modeling

Parts of speech tagging with spaCy

Word2Vec using gensim

Shallow learning for spam detection

Speech Recognition

Technical requirements

Introducing speech recognition

Reading and plotting audio data

Transforming audio signals into the frequency domain

Generating audio signals with custom parameters

Synthesizing music

Extracting frequency domain features

Building HMMs

Building a speech recognizer

Building a TTS system

Dissecting Time Series and Sequential Data

Technical requirements

Introducing time series

Transforming data into a time series format

Slicing time series data

Operating on time series data

Extracting statistics from time series data

Building HMMs for sequential data

Building CRFs for sequential text data

Analyzing stock market data

Using RNNs to predict time series data

Analyzing Image Content

Technical requirements

Introducing computer vision

Operating on images using OpenCV-Python

Detecting edges

Histogram equalization

Detecting corners

Detecting SIFT feature points

Building a Star feature detector

Creating features using Visual Codebook and vector quantization

Training an image classifier using Extremely Random Forests

Building an object recognizer

Using Light GBM for image classification

Biometric Face Recognition

Technical requirements

Introduction

Capturing and processing video from a webcam

Building a face detector using Haar cascades

Building eye and nose detectors

Performing principal component analysis

Performing kernel principal component analysis

Performing blind source separation

Building a face recognizer using a local binary patterns histogram

Recognizing faces using the HOG-based model

Facial landmark recognition

User authentication by face recognition

Reinforcement Learning Techniques

Technical requirements

Introduction

Weather forecasting with MDP

Optimizing a financial portfolio using DP

Finding the shortest path

Deciding the discount factor using Q-learning

Implementing the deep Q-learning algorithm

Developing an AI-based dynamic modeling system

Deep reinforcement learning with double Q-learning

Deep Q-network algorithm with dueling Q-learning

Deep Neural Networks

Technical requirements

Introduction

Building a perceptron

Building a single layer neural network

Building a deep neural network

Creating a vector quantizer

Building a recurrent neural network for sequential data analysis

Visualizing the characters in an OCR database

Building an optical character recognizer using neural networks

Implementing optimization algorithms in ANN

Unsupervised Representation Learning

Technical requirements

Introduction

Using denoising autoencoders to detect fraudulent transactions

Generating word embeddings using CBOW and skipgram representations

Visualizing the MNIST dataset using PCA and t-SNE

Using word embedding for Twitter sentiment analysis

Implementing LDA with scikit-learn

Using LDA to classify text documents

Preparing data for LDA

Automated Machine Learning and Transfer Learning

Technical requirements

Introduction

Working with Auto-WEKA

Using AutoML to generate machine learning pipelines with TPOT

Working with Auto-Keras

Working with auto-sklearn

Using MLBox for selection and leak detection

Convolutional neural networks with transfer learning

Transfer learning with pretrained image classifiers using ResNet-50

Transfer learning using feature extraction with the VGG16 model

Transfer learning with pretrained GloVe embedding

Unlocking Production Issues

Technical requirements

Introduction

Handling unstructured data

Deploying machine learning models

Keeping track of changes into production

Tracking accuracy to optimize model scaling

Other Books You May Enjoy

Leave a review - let other readers know what you think

Customer Reviews

5 star

4 star

3 star

2 star

1 star

Building a ridge regressor

One of the main problems of linear regression is that it's sensitive to outliers. During data collection in the real world, it's quite common to wrongly measure output. Linear regression uses ordinary least squares, which tries to minimize the squares of errors. The outliers tend to cause problems because they contribute a lot to the overall error. This tends to disrupt the entire model.

Let's try to deepen our understanding of the concept of outliers: outliers are values that, compared to others, are particularly extreme (values that are clearly distant from the other observations). Outliers are an issue because they might distort data analysis results; more specifically, descriptive statistics and correlations. We need to find these in the data cleaning phase, however, we can also get started on them in the next stage of data analysis. Outliers can be univariate when they have an extreme value for a single variable, or multivariate when they have a unique combination of values for a number of variables. Let's consider the following diagram:

The two points on the bottom right are clearly outliers, but this model is trying to fit all the points. Hence, the overall model tends to be inaccurate. Outliers are the extreme values of a distribution that are characterized by being extremely high or extremely low compared to the rest of the distribution, and thus representing isolated cases with respect to the rest of the distribution. By visual inspection, we can see that the following output is a better model:

Ordinary least squares considers every single data point when it's building the model. Hence, the actual model ends up looking like the dotted line shown in the preceding graph. We can clearly see that this model is suboptimal.

The regularization method involves modifying the performance function, normally selected as the sum of the squares of regression errors on the training set. When a large number of variables are available, the least square estimates of a linear model often have a low bias but a high variance with respect to models with fewer variables. Under these conditions, there is an overfitting problem. To improve precision prediction by allowing greater bias but a small variance, we can use variable selection methods and dimensionality reduction, but these methods may be unattractive for computational burdens in the first case or provide a difficult interpretation in the other case.

Another way to address the problem of overfitting is to modify the estimation method by neglecting the requirement of an unbiased parameter estimator and instead considering the possibility of using a biased estimator, which may have smaller variance. There are several biased estimators, most of which are based on regularization: Ridge, Lasso, and ElasticNet are the most popular methods.

Getting ready

Ridge regression is a regularization method where a penalty is imposed on the size of the coefficients. As we said in the Building a linear regressor section, in the ordinary least squares method, the coefficients are estimated by determining numerical values that minimize the sum of the squared deviations between the observed responses and the fitted responses, according to the following equation:

Ridge regression, in order to estimate the β coefficients, starts from the basic formula of the residual sum of squares (RSS) and adds the penalty term. λ (≥ 0) is defined as the tuning parameter, which is multiplied by the sum of the β coefficients squared (excluding the intercept) to define the penalty period, as shown in the following equation:

It is evident that having λ = 0 means not having a penalty in the model, that is, we would produce the same estimates as the least squares. On the other hand, having a λ tending toward infinity means having a high penalty effect, which will bring many coefficients close to zero, but will not imply their exclusion from the model. Let's see how to build a ridge regressor in Python.

How to do it...

Let's see how to build a ridge regressor in Python:

You can use the data already used in the previous example: Building a linear regressor (VehiclesItaly.txt). This file contains two values in each line. The first value is the explanatory variable, and the second is the response variable.
Add the following lines to regressor.py. Let's initialize a ridge regressor with some parameters:

from sklearn import linear_model
ridge_regressor = linear_model.Ridge(alpha=0.01, fit_intercept=True, max_iter=10000)

The alpha parameter controls the complexity. As alpha gets closer to 0, the ridge regressor tends to become more like a linear regressor with ordinary least squares. So, if you want to make it robust against outliers, you need to assign a higher value to alpha. We considered a value of 0.01, which is moderate.
Let's train this regressor, as follows:

ridge_regressor.fit(X_train, y_train)
y_test_pred_ridge = ridge_regressor.predict(X_test)
print( "Mean absolute error =", round(sm.mean_absolute_error(y_test, y_test_pred_ridge), 2))
print( "Mean squared error =", round(sm.mean_squared_error(y_test, y_test_pred_ridge), 2))
print( "Median absolute error =", round(sm.median_absolute_error(y_test, y_test_pred_ridge), 2))
print( "Explain variance score =", round(sm.explained_variance_score(y_test, y_test_pred_ridge), 2))
print( "R2 score =", round(sm.r2_score(y_test, y_test_pred_ridge), 2))

Run this code to view the error metrics. You can build a linear regressor to compare and contrast the results on the same data to see the effect of introducing regularization into the model.

How it works...

Ridge regression is a regularization method where a penalty is imposed on the size of the coefficients. Ridge regression is identical to least squares, barring the fact that ridge coefficients are computed by decreasing a quantity that is somewhat different. In ridge regression, a scale transformation has a substantial effect. Therefore, to avoid obtaining different results depending on the predicted scale of measurement, it is advisable to standardize all predictors before estimating the model. To standardize the variables, we must subtract their means and divide by their standard deviations.

Python Machine Learning Cookbook - Second Edition

By : Giuseppe Ciaburro, Prateek Joshi

Python Machine Learning Cookbook - Second Edition

By: Giuseppe Ciaburro, Prateek Joshi

Overview of this book

Related Content you might be interested in

Current Title:

Python Machine Learning Cookbook - Second Edition

Artificial Intelligence with Python

Artificial Intelligence with Python

Keras Reinforcement Learning Projects