Mastering PyTorch

By : Ashish Ranjan Jha

Mastering PyTorch

By: Ashish Ranjan Jha

Overview of this book

Deep learning is driving the AI revolution, and PyTorch is making it easier than ever before for anyone to build deep learning applications. This PyTorch book will help you uncover expert techniques to get the most out of your data and build complex neural network models. The book starts with a quick overview of PyTorch and explores using convolutional neural network (CNN) architectures for image classification. You'll then work with recurrent neural network (RNN) architectures and transformers for sentiment analysis. As you advance, you'll apply deep learning across different domains, such as music, text, and image generation using generative models and explore the world of generative adversarial networks (GANs). You'll not only build and train your own deep reinforcement learning models in PyTorch but also deploy PyTorch models to production using expert tips and techniques. Finally, you'll get to grips with training large models efficiently in a distributed manner, searching neural architectures effectively with AutoML, and rapidly prototyping models using PyTorch and fast.ai. By the end of this PyTorch book, you'll be able to perform complex deep learning tasks using PyTorch to build smart artificial intelligence models.

Preface

Who this book is for

What this book covers

To get the most out of this book

Download the example code files

Download the color images

Conventions used

Get in touch

Reviews

Section 1: PyTorch Overview

Free Chapter

Chapter 1: Overview of Deep Learning using PyTorch

Technical requirements

A refresher on deep learning

Exploring the PyTorch library

Training a neural network using PyTorch

Summary

Chapter 2: Combining CNNs and LSTMs

Technical requirements

Building a neural network with CNNs and LSTMs

Building an image caption generator using PyTorch

Summary

Section 2: Working with Advanced Neural Network Architectures

Chapter 3: Deep CNN Architectures

Technical requirements

Why are CNNs so powerful?

Evolution of CNN architectures

Developing LeNet from scratch

Fine-tuning the AlexNet model

Running a pre-trained VGG model

Exploring GoogLeNet and Inception v3

Discussing ResNet and DenseNet architectures

Understanding EfficientNets and the future of CNN architectures

Summary

Chapter 4: Deep Recurrent Model Architectures

Technical requirements

Exploring the evolution of recurrent networks

Training RNNs for sentiment analysis

Building a bidirectional LSTM

Discussing GRUs and attention-based models

Summary

Chapter 5: Hybrid Advanced Models

Technical requirements

Building a transformer model for language modeling

Developing a RandWireNN model from scratch

Summary

Section 3: Generative Models and Deep Reinforcement Learning

Chapter 6: Music and Text Generation with PyTorch

Technical requirements

Building a transformer-based text generator with PyTorch

Using a pre-trained GPT-2 model as a text generator

Generating MIDI music with LSTMs using PyTorch

Summary

Chapter 7: Neural Style Transfer

Technical requirements

Understanding how to transfer style between images

Implementing neural style transfer using PyTorch

Summary

Chapter 8: Deep Convolutional GANs

Technical requirements

Defining the generator and discriminator networks

Training a DCGAN using PyTorch

Using GANs for style transfer

Summary

Chapter 9: Deep Reinforcement Learning

Technical requirements

Reviewing reinforcement learning concepts

Discussing Q-learning

Understanding deep Q-learning

Building a DQN model in PyTorch

Summary

Section 4: PyTorch in Production Systems

Chapter 10: Operationalizing PyTorch Models into Production

Technical requirements

Model serving in PyTorch

Serving a PyTorch model using TorchServe

Exporting universal PyTorch models using TorchScript and ONNX

Serving PyTorch models in the cloud

Summary

References

Chapter 11: Distributed Training

Technical requirements

Distributed training with PyTorch

Distributed training on GPUs with CUDA

Summary

Chapter 12: PyTorch and AutoML

Technical requirements

Finding the best neural architectures with AutoML

Using Optuna for hyperparameter search

Defining the model architecture and loading dataset

Summary

Chapter 13: PyTorch and Explainable AI

Technical requirements

Model interpretability in PyTorch

Using Captum to interpret models

Summary

Chapter 14: Rapid Prototyping with PyTorch

Technical requirements

Using fast.ai to set up model training in a few minutes

Training models on any hardware using PyTorch Lightning

Summary

Other Books You May Enjoy

Leave a review - let other readers know what you think

Customer Reviews

5 star

4 star

3 star

2 star

1 star

Discussing ResNet and DenseNet architectures

In the previous section, we explored the Inception models, which had a reduced number of model parameters as the number of layers increased, thanks to the 1x1 convolutions and global average pooling. Furthermore, auxiliary classifiers were used to combat the vanishing gradient problem.

ResNet introduced the concept of skip connections. This simple yet effective trick overcomes the problem of both parameter overflow and vanishing gradients. The idea, as shown in the following diagram, is quite simple. The input is first passed through a non-linear transformation (convolutions followed by non-linear activations) and then the output of this transformation (referred to as the residual) is added to the original input. Each block of such computation is called a residual block, hence the name of the model – residual network or ResNet.

Figure 3.26 – Skip connections

Using these skip (or shortcut) connections...

Mastering PyTorch

By : Ashish Ranjan Jha

Mastering PyTorch

By: Ashish Ranjan Jha

Overview of this book

Related Content you might be interested in

Current Title:

Mastering PyTorch

PyTorch Artificial Intelligence Fundamentals

Deep Learning with PyTorch Lightning

Deep Learning with PyTorch 1.x

Discussing ResNet and DenseNet architectures