Chapter 4: Regularization with Tree-Based Models

Book Overview & Buying
Table Of Contents

The Regularization Cookbook

By : Vincent Vandenbussche

4.3 (7)

Buy this Book

The Regularization Cookbook

4.3 (7)

By: Vincent Vandenbussche

Buy this Book

Overview of this book

Regularization is an infallible way to produce accurate results with unseen data, however, applying regularization is challenging as it is available in multiple forms and applying the appropriate technique to every model is a must. The Regularization Cookbook provides you with the appropriate tools and methods to handle any case, with ready-to-use working codes as well as theoretical explanations. After an introduction to regularization and methods to diagnose when to use it, you’ll start implementing regularization techniques on linear models, such as linear and logistic regression, and tree-based models, such as random forest and gradient boosting. You’ll then be introduced to specific regularization methods based on data, high cardinality features, and imbalanced datasets. In the last five chapters, you’ll discover regularization for deep learning models. After reviewing general methods that apply to any type of neural network, you’ll dive into more NLP-specific methods for RNNs and transformers, as well as using BERT or GPT-3. By the end, you’ll explore regularization for computer vision, covering CNN specifics, along with the use of generative models such as stable diffusion and Dall-E. By the end of this book, you’ll be armed with different regularization techniques to apply to your ML and DL models.

Preface

Who this book is for

What this book covers

To get the most out of this book

Conventions used

Sections

Get in touch

Share Your Thoughts

Download a free PDF copy of this book

Chapter 1: An Overview of Regularization

Technical requirements

Introducing regularization

Key concepts of regularization

Regularization – a multi-dimensional problem

Summary

Free Chapter

Chapter 2: Machine Learning Refresher

Technical requirements

Loading data

Splitting data

Preparing quantitative data

Preparing qualitative data

Training a model

Evaluating a model

Performing hyperparameter optimization

Chapter 3: Regularization with Linear Models

Technical requirements

Training a linear regression model with scikit-learn

Regularizing with ridge regression

Regularizing with lasso regression

Regularizing with elastic net regression

Training a logistic regression model

Regularizing a logistic regression model

Choosing the right regularization

Chapter 4: Regularization with Tree-Based Models

Technical requirements

Building a classification tree

Building regression trees

Regularizing a decision tree

Training the Random Forest algorithm

Regularization of Random Forest

Training a boosting model with XGBoost

Regularization with XGBoost

Chapter 5: Regularization with Data

Technical requirements

Hashing high cardinality features

Aggregating features

Undersampling an imbalanced dataset

Oversampling an imbalanced dataset

Resampling imbalanced data with SMOTE

Chapter 6: Deep Learning Reminders

Technical requirements

Training a perceptron

Training a neural network for regression

Training a neural network for binary classification

Training a multiclass classification neural network

Chapter 7: Deep Learning Regularization

Technical requirements

Regularizing a neural network with L2 regularization

Regularizing a neural network with early stopping

Regularization with network architecture

Regularizing with dropout

Chapter 8: Regularization with Recurrent Neural Networks

Technical requirements

Training an RNN

Training a GRU

Regularizing with dropout

Regularizing with the maximum sequence length

Chapter 9: Advanced Regularization in Natural Language Processing

Technical requirements

Regularization using a word2vec embedding

Data augmentation using word2vec

Zero-shot inference with pre-trained models

Regularization with BERT embeddings

Data augmentation using GPT-3

Chapter 10: Regularization in Computer Vision

Technical requirements

Training a CNN

Regularizing a CNN with vanilla NN methods

Regularizing a CNN with transfer learning for object detection

Semantic segmentation using transfer learning

Chapter 11: Regularization in Computer Vision – Synthetic Image Generation

Technical requirements

Applying image augmentation with Albumentations

Creating synthetic images for object detection

Implementing real-time style transfer

Index

Why subscribe?

Other Books You May Enjoy

Packt is searching for authors like you

Share Your Thoughts

Download a free PDF copy of this book

The Regularization Cookbook

By : Vincent Vandenbussche

The Regularization Cookbook

By: Vincent Vandenbussche

Overview of this book

Training the Random Forest algorithm

Getting ready

Confirmation

Buy this book with your credits?

Submit Your Feedback

Create a Free Account To Continue Reading

Sign in to activate your 7-day free access