Book Image

Deep Learning with PyTorch Lightning

By : Kunal Sawarkar
3.5 (2)
Book Image

Deep Learning with PyTorch Lightning

3.5 (2)
By: Kunal Sawarkar

Overview of this book

Building and implementing deep learning (DL) is becoming a key skill for those who want to be at the forefront of progress.But with so much information and complex study materials out there, getting started with DL can feel quite overwhelming. Written by an AI thought leader, Deep Learning with PyTorch Lightning helps researchers build their first DL models quickly and easily without getting stuck on the complexities. With its help, you’ll be able to maximize productivity for DL projects while ensuring full flexibility – from model formulation to implementation. Throughout this book, you’ll learn how to configure PyTorch Lightning on a cloud platform, understand the architectural components, and explore how they are configured to build various industry solutions. You’ll build a neural network architecture, deploy an application from scratch, and see how you can expand it based on your specific needs, beyond what the framework can provide. In the later chapters, you’ll also learn how to implement capabilities to build and train various models like Convolutional Neural Nets (CNN), Natural Language Processing (NLP), Time Series, Self-Supervised Learning, Semi-Supervised Learning, Generative Adversarial Network (GAN) using PyTorch Lightning. By the end of this book, you’ll be able to build and deploy DL models with confidence.
Table of Contents (15 chapters)
1
Section 1: Kickstarting with PyTorch Lightning
6
Section 2: Solving using PyTorch Lightning
11
Section 3: Advanced Topics

Video classification using Flash

Video classification is one of the most interesting yet challenging problems in DL. Simply speaking, it tries to classify an action in a video clip and recognize it (such as walking, bowling, or golfing):

Figure 4.1 – The Kinetics human action video dataset released by DeepMind is comprised of annotated ~10-second video clips sourced from YouTube

Training such a DL model is a challenging problem because of the sheer amount of compute power it takes to train the model, given the large size of video files compared to tabular or image data. Using a pre-trained model and architecture is a great way to start your experiments for video classification.

PyTorch Lightning Flash relies internally on the PyTorchVideo library for its backbone. PyTorchVideo caters to the ecosystem of video understanding. Lightning Flash makes it easy by creating the predefined and configurable hooks into the underlying framework. There are hooks...