Mastering PyTorch

By : Ashish Ranjan Jha

Mastering PyTorch

By: Ashish Ranjan Jha

Overview of this book

Deep learning is driving the AI revolution, and PyTorch is making it easier than ever before for anyone to build deep learning applications. This PyTorch book will help you uncover expert techniques to get the most out of your data and build complex neural network models. The book starts with a quick overview of PyTorch and explores using convolutional neural network (CNN) architectures for image classification. You'll then work with recurrent neural network (RNN) architectures and transformers for sentiment analysis. As you advance, you'll apply deep learning across different domains, such as music, text, and image generation using generative models and explore the world of generative adversarial networks (GANs). You'll not only build and train your own deep reinforcement learning models in PyTorch but also deploy PyTorch models to production using expert tips and techniques. Finally, you'll get to grips with training large models efficiently in a distributed manner, searching neural architectures effectively with AutoML, and rapidly prototyping models using PyTorch and fast.ai. By the end of this PyTorch book, you'll be able to perform complex deep learning tasks using PyTorch to build smart artificial intelligence models.

Preface

Who this book is for

What this book covers

To get the most out of this book

Download the example code files

Download the color images

Conventions used

Get in touch

Reviews

Section 1: PyTorch Overview

Free Chapter

Chapter 1: Overview of Deep Learning using PyTorch

Technical requirements

A refresher on deep learning

Exploring the PyTorch library

Training a neural network using PyTorch

Summary

Chapter 2: Combining CNNs and LSTMs

Technical requirements

Building a neural network with CNNs and LSTMs

Building an image caption generator using PyTorch

Summary

Section 2: Working with Advanced Neural Network Architectures

Chapter 3: Deep CNN Architectures

Technical requirements

Why are CNNs so powerful?

Evolution of CNN architectures

Developing LeNet from scratch

Fine-tuning the AlexNet model

Running a pre-trained VGG model

Exploring GoogLeNet and Inception v3

Discussing ResNet and DenseNet architectures

Understanding EfficientNets and the future of CNN architectures

Summary

Chapter 4: Deep Recurrent Model Architectures

Technical requirements

Exploring the evolution of recurrent networks

Training RNNs for sentiment analysis

Building a bidirectional LSTM

Discussing GRUs and attention-based models

Summary

Chapter 5: Hybrid Advanced Models

Technical requirements

Building a transformer model for language modeling

Developing a RandWireNN model from scratch

Summary

Section 3: Generative Models and Deep Reinforcement Learning

Chapter 6: Music and Text Generation with PyTorch

Technical requirements

Building a transformer-based text generator with PyTorch

Using a pre-trained GPT-2 model as a text generator

Generating MIDI music with LSTMs using PyTorch

Summary

Chapter 7: Neural Style Transfer

Technical requirements

Understanding how to transfer style between images

Implementing neural style transfer using PyTorch

Summary

Chapter 8: Deep Convolutional GANs

Technical requirements

Defining the generator and discriminator networks

Training a DCGAN using PyTorch

Using GANs for style transfer

Summary

Chapter 9: Deep Reinforcement Learning

Technical requirements

Reviewing reinforcement learning concepts

Discussing Q-learning

Understanding deep Q-learning

Building a DQN model in PyTorch

Summary

Section 4: PyTorch in Production Systems

Chapter 10: Operationalizing PyTorch Models into Production

Technical requirements

Model serving in PyTorch

Serving a PyTorch model using TorchServe

Exporting universal PyTorch models using TorchScript and ONNX

Serving PyTorch models in the cloud

Summary

References

Chapter 11: Distributed Training

Technical requirements

Distributed training with PyTorch

Distributed training on GPUs with CUDA

Summary

Chapter 12: PyTorch and AutoML

Technical requirements

Finding the best neural architectures with AutoML

Using Optuna for hyperparameter search

Defining the model architecture and loading dataset

Summary

Chapter 13: PyTorch and Explainable AI

Technical requirements

Model interpretability in PyTorch

Using Captum to interpret models

Summary

Chapter 14: Rapid Prototyping with PyTorch

Technical requirements

Using fast.ai to set up model training in a few minutes

Training models on any hardware using PyTorch Lightning

Summary

Other Books You May Enjoy

Leave a review - let other readers know what you think

Customer Reviews

5 star

4 star

3 star

2 star

1 star

Chapter 1: Overview of Deep Learning using PyTorch

Deep learning is a class of machine learning methods that has revolutionized the way computers/machines are used to perform cognitive tasks in real life. Based on the mathematical concept of deep neural networks, deep learning uses large amounts of data to learn non-trivial relationships between inputs and outputs in the form of complex nonlinear functions. Some of the inputs and outputs, as demonstrated in Figure 1.1, could be the following:

Input: An image of a text; output: Text
Input: Text; output: A natural voice speaking the text
Input: A natural voice speaking the text; output: Transcribed text

And so on. Here is a figure to support the preceding explanation:

Figure 1.1 – Deep learning model examples

Deep neural networks involve a lot of mathematical computations, linear algebraic equations, complex nonlinear functions, and various optimization algorithms. In order to build and train a deep neural network from scratch using a programming language such as Python, it would require us to write all the necessary equations, functions, and optimization schedules. Furthermore, the code would need to be written such that large amounts of data can be loaded efficiently, and training can be performed in a reasonable amount of time. This amounts to implementing several lower-level details each time we build a deep learning application.

Deep learning libraries such as Theano and TensorFlow, among various others, have been developed over the years to abstract these details out. PyTorch is one such Python-based deep learning library that can be used to build deep learning models.

TensorFlow was introduced as an open source deep learning Python (and C++) library by Google in late 2015, which revolutionized the field of applied deep learning. Facebook, in 2016, responded with its own open source deep learning library and called it Torch. Torch was initially used with a scripting language called Lua, and soon enough, the Python equivalent emerged called PyTorch. Around the same time, Microsoft released its own library – CNTK. Amidst the hot competition, PyTorch has been growing fast to become one of the most used deep learning libraries.

This book is meant to be a hands-on resource on some of the most advanced deep learning problems, how they are solved using complex deep learning architectures, and how PyTorch can be effectively used to build, train, and evaluate these complex models. While the book keeps PyTorch at the center, it also includes comprehensive coverage of some of the most recent and advanced deep learning models. The book is intended for data scientists, machine learning engineers, or researchers who have a working knowledge of Python and who, preferably, have used PyTorch before.

Due to the hands-on nature of this book, it is highly recommended to try the examples in each chapter by yourself on your computer to become proficient in writing PyTorch code. We begin with this introductory chapter and subsequently explore various deep learning problems and model architectures that will expose the various functionalities PyTorch has to offer.

This chapter will review some of the concepts behind deep learning and will provide a brief overview of the PyTorch library. We conclude this chapter with a hands-on exercise where we train a deep learning model using PyTorch.

The following topics will be covered in this chapter:

A refresher on deep learning
Exploring the PyTorch library
Training a neural network using PyTorch

Mastering PyTorch

By : Ashish Ranjan Jha

Mastering PyTorch

By: Ashish Ranjan Jha

Overview of this book

Related Content you might be interested in

Current Title:

Mastering PyTorch

PyTorch Artificial Intelligence Fundamentals

Deep Learning with PyTorch Lightning

Deep Learning with PyTorch 1.x

Chapter 1: Overview of Deep Learning using PyTorch