Accelerate Model Training with PyTorch 2.X

By : Maicon Melo Alves

Accelerate Model Training with PyTorch 2.X

By: Maicon Melo Alves

Overview of this book

Penned by an expert in High-Performance Computing (HPC) with over 25 years of experience, this book is your guide to enhancing the performance of model training using PyTorch, one of the most widely adopted machine learning frameworks. You’ll start by understanding how model complexity impacts training time before discovering distinct levels of performance tuning to expedite the training process. You’ll also learn how to use a new PyTorch feature to compile the model and train it faster, alongside learning how to benefit from specialized libraries to optimize the training process on the CPU. As you progress, you’ll gain insights into building an efficient data pipeline to keep accelerators occupied during the entire training execution and explore strategies for reducing model complexity and adopting mixed precision to minimize computing time and memory consumption. The book will get you acquainted with distributed training and show you how to use PyTorch to harness the computing power of multicore systems and multi-GPU environments available on single or multiple machines. By the end of this book, you’ll be equipped with a suite of techniques, approaches, and strategies to speed up training , so you can focus on what really matters—building stunning models!

Preface

Who this book is for

What this book covers

To get the most out of this book

Download the example code files

Conventions used

Get in touch

Share Your Thoughts

Download a free PDF copy of this book

Free Chapter

Part 1: Paving the Way

Chapter 1: Deconstructing the Training Process

Technical requirements

Remembering the training process

Understanding the computational burden of the model training phase

Quiz time!

Summary

Chapter 2: Training Models Faster

Technical requirements

What options do we have?

Modifying the application layer

Modifying the environment layer

Quiz time!

Summary

Part 2: Going Faster

Chapter 3: Compiling the Model

Technical requirements

What do you mean by compiling?

Using the Compile API

How does the Compile API work under the hood?

Quiz time!

Summary

Chapter 4: Using Specialized Libraries

Technical requirements

Multithreading with OpenMP

Optimizing Intel CPU with IPEX

Quiz time!

Summary

Chapter 5: Building an Efficient Data Pipeline

Technical requirements

Why do we need an efficient data pipeline?

Accelerating data loading

Quiz time!

Summary

Chapter 6: Simplifying the Model

Technical requirements

Knowing the model simplifying process

Using Microsoft NNI to simplify a model

Quiz time!

Summary

Chapter 7: Adopting Mixed Precision

Technical requirements

Remembering numeric precision

Understanding the mixed precision strategy

Enabling AMP

Quiz time!

Summary

Part 3: Going Distributed

Chapter 8: Distributed Training at a Glance

Technical requirements

A first look at distributed training

Learning the fundamentals of parallelism strategies

Distributed training on PyTorch

Quiz time!

Summary

Chapter 9: Training with Multiple CPUs

Technical requirements

Why distribute the training on multiple CPUs?

Implementing distributed training on multiple CPUs

Getting faster with Intel oneCCL

Quiz time!

Summary

Chapter 10: Training with Multiple GPUs

Technical requirements

Demystifying the multi-GPU environment

Implementing distributed training on multiple GPUs

Quiz time!

Summary

Chapter 11: Training with Multiple Machines

Technical requirements

What is a computing cluster?

Implementing distributed training on multiple machines

Quiz time!

Summary

Index

Why subscribe?

Other Books You May Enjoy

Packt is searching for authors like you

Share Your Thoughts

Download a free PDF copy of this book

Customer Reviews

5 star

4 star

3 star

2 star

1 star

Understanding the computational burden of the model training phase

Now that we’ve brushed up on how the training process works, let’s understand the computational cost required to train a model. By using the terms computational cost or burden, we mean the computing power needed to execute the training process. The higher the computational cost, the higher the time taken to train the model. In the same way, the higher the computational burden, the higher the computing resources required to train the model.

Essentially, we can say the computational burden to train a model is defined by a three-fold factor, as illustrated in Figure 1.6:

Figure 1.6 – Factors that influence the training computational burden

Each one of these factors contributes (to some degree) to the computational complexity imposed by the training process. Let’s talk about each one of them.

Hyperparameters

Hyperparameters define two aspects of neural networks...

Accelerate Model Training with PyTorch 2.X

By : Maicon Melo Alves

Accelerate Model Training with PyTorch 2.X

By: Maicon Melo Alves

Overview of this book

Related Content you might be interested in

Current Title:

Accelerate Model Training with PyTorch 2.X

Distributed Machine Learning with Python

Accelerate Deep Learning Workloads with Amazon SageMaker

Learn CUDA Programming

Understanding the computational burden of the model training phase

Hyperparameters