Deep Learning with PyTorch Lightning

By : Kunal Sawarkar

3.5 (2)

Buy this Book

Deep Learning with PyTorch Lightning

3.5 (2)

By: Kunal Sawarkar

Buy this Book

Overview of this book

Building and implementing deep learning (DL) is becoming a key skill for those who want to be at the forefront of progress.But with so much information and complex study materials out there, getting started with DL can feel quite overwhelming. Written by an AI thought leader, Deep Learning with PyTorch Lightning helps researchers build their first DL models quickly and easily without getting stuck on the complexities. With its help, you’ll be able to maximize productivity for DL projects while ensuring full flexibility – from model formulation to implementation. Throughout this book, you’ll learn how to configure PyTorch Lightning on a cloud platform, understand the architectural components, and explore how they are configured to build various industry solutions. You’ll build a neural network architecture, deploy an application from scratch, and see how you can expand it based on your specific needs, beyond what the framework can provide. In the later chapters, you’ll also learn how to implement capabilities to build and train various models like Convolutional Neural Nets (CNN), Natural Language Processing (NLP), Time Series, Self-Supervised Learning, Semi-Supervised Learning, Generative Adversarial Network (GAN) using PyTorch Lightning. By the end of this book, you’ll be able to build and deploy DL models with confidence.

Preface

Who this book is for

What this book covers

To get the most out of this book

Download the example code files

Download the colour images

Conventions used

Get in touch

Share Your Thoughts

Section 1: Kickstarting with PyTorch Lightning

Free Chapter

Chapter 1: PyTorch Lightning Adventure

What makes PyTorch Lightning so special?

<pip install> – My Lightning adventure

Understanding the key components of PyTorch Lightning

Crafting AI applications using PyTorch Lightning

Further reading

Summary

Chapter 2: Getting off the Ground with the First Deep Learning Model

Technical requirements

Getting started with Neural Networks

Building a Hello World MLP model

Building our first Deep Learning model

Building a CNN model for image recognition

Summary

Chapter 3: Transfer Learning Using Pre-Trained Models

Technical requirements

Getting started with transfer learning

An image classifier using a pre-trained ResNet-50 architecture

Text classification using BERT transformers

Summary

Chapter 4: Ready-to-Cook Models from Lightning Flash

Technical requirements

Getting started with Lightning Flash

Flash is as simple as 1-2-3

Video classification using Flash

Automatic speech recognition using Flash

Further learning

Summary

Section 2: Solving using PyTorch Lightning

Chapter 5: Time Series Models

Technical requirements

Introduction to time series

Getting started with time series models

Traffic volume forecasting using the LSTM time series model

Summary

Chapter 6: Deep Generative Models

Technical requirements

Getting started with GAN models

Creating new food items using a GAN

Creating new butterfly species using a GAN

GAN training challenges

Creating images using DCGAN

Summary

Chapter 7: Semi-Supervised Learning

Technical requirements

Getting started with semi-supervised learning

Going through the CNN–RNN architecture

Generating captions for images

Summary

Chapter 8: Self-Supervised Learning

Technical requirements

Getting started with Self-Supervised Learning

What is Contrastive Learning?

SimCLR architecture

SimCLR model for image recognition

Summary

Section 3: Advanced Topics

Chapter 9: Deploying and Scoring Models

Technical requirements

Deploying and scoring a Deep Learning model natively

Deploying and scoring inter-portable models

Next steps

Understanding the key components of PyTorch Lightning

Before we jump into building DL models, let's revise a typical pipeline that a Deep Learning project follows.

DL pipeline

Let's revise a typical ML pipeline for a DL network architecture. This is what it looks like:

Figure 1.7 – DL pipeline

A DL pipeline typically involves the following steps. We will continue to see them throughout the book, utilizing them for each aspect of problem-solving:

Defining the problem:
- Set a clear task and objective of what is expected.
Data preparation:
- This step involves finding the right dataset to solve this problem, ingest it, and clean it. For most DL projects, this involves the data engineer working in images, videos, or text corpora to acquire datasets (sometimes by scraping the web), and then cataloging them into sizes.
- Most DL models require huge amounts of data, while models also need to be resilient to minor changes in images such as cropping. For this purpose, engineers augment the dataset by creating crops of original images or black and white (B/W) versions, or invert them, and so on.
Modeling:
- This would first involve FE and defining what kind of network architecture we want to build.
- For example, in the case of a data scientist creating new image recognition models, this would involve defining a CNN architecture with three layers of convolution, a step size, slide window, gradient descent optimization, a loss function, and suchlike can be defined.
- For ML researchers, this step could involve defining new loss functions that measure accuracy in a more useful way or perform some magic by making a model train with a less dense network that gives the same accuracy, or defining a new gradient optimization that distributes well or converges faster.
Training:
- Now comes the fun step. After data scientists have defined all the configurations for a DL network architecture, they need to train a model and keep tweaking it until it achieves convergence.
- For massive datasets (which are the norm in DL), this can be a nightmarish exercise. A data scientist must double up as an ML engineer by writing code to distribute it to the underlying GPU or central processing unit (CPU) or TPU, manage memory and epochs, and keep iterating the code that fully utilizes compute power. A lower 16-bit precision may help train the model faster, and so data scientists may attempt this.
- Alternatively, a distributed downpour gradient descent can be used to optimize faster. If you are finding yourself out of breath with some of these terms, then don't worry. Many data scientists experience this, as it has less to do with statistics and more to do with engineering (and this is where we will see how PyTorch Lightning comes to the rescue).
- Another major challenge in distributed computing is being able to fully utilize all the hardware and accurately compute losses that are distributed in various GPUs. It's not simple either to do data parallelism, (distribute data to different GPUs in batches) or do model parallelism (distribute models to different GPUs).
Deployment engineering:
- After the model has been trained, we need to take it to production. ML operations (MLOps) engineers work by creating deployment-ready format files that can work in their environment.
- This step also involves creating an Application Programming Interface (API) to be integrated with the end application for consumption. Occasionally, it can also involve creating infrastructure to score models for incoming traffic sizes if the model is expected to have a massive workload.

PyTorch Lightning abstraction layers

PyTorch Lightning frameworks make it easy to construct entire DL models to aid data scientists. Here's how this is achieved:

The LightningModule class is used to define the model structure, inference logic, optimizer and scheduler details, training and validation logic, and so on.
A Lightning Trainer abstracts the logic needed for loops, hardware interactions, fitting and evaluating the model, and so on.
You can pass a PyTorch DataLoader to the trainer directly, or you can choose to define a LightningDataModule for improved shareability and reuse.

Deep Learning with PyTorch Lightning

By : Kunal Sawarkar

Deep Learning with PyTorch Lightning

By: Kunal Sawarkar

Overview of this book

Related Content you might be interested in

Current Title:

Deep Learning with PyTorch Lightning

Mastering PyTorch

PyTorch Computer Vision Cookbook

Deep Learning with PyTorch Quick Start Guide

Understanding the key components of PyTorch Lightning

DL pipeline

PyTorch Lightning abstraction layers