Book Image

Deep Learning with fastai Cookbook

By : Mark Ryan
Book Image

Deep Learning with fastai Cookbook

By: Mark Ryan

Overview of this book

fastai is an easy-to-use deep learning framework built on top of PyTorch that lets you rapidly create complete deep learning solutions with as few as 10 lines of code. Both predominant low-level deep learning frameworks, TensorFlow and PyTorch, require a lot of code, even for straightforward applications. In contrast, fastai handles the messy details for you and lets you focus on applying deep learning to actually solve problems. The book begins by summarizing the value of fastai and showing you how to create a simple 'hello world' deep learning application with fastai. You'll then learn how to use fastai for all four application areas that the framework explicitly supports: tabular data, text data (NLP), recommender systems, and vision data. As you advance, you'll work through a series of practical examples that illustrate how to create real-world applications of each type. Next, you'll learn how to deploy fastai models, including creating a simple web application that predicts what object is depicted in an image. The book wraps up with an overview of the advanced features of fastai. By the end of this fastai book, you'll be able to create your own deep learning applications using fastai. You'll also have learned how to use fastai to prepare raw datasets, explore datasets, train deep learning models, and deploy trained models.
Table of Contents (10 chapters)

Training a deep learning language model with a standalone text dataset

In the previous sections, we trained a language model and a text classifier using the curated text dataset IMDb. In this section and the next section, we will train a language model and a text classifier using a standalone text dataset, the Kaggle Coronavirus tweets NLP – Text Classification dataset described here: https://www.kaggle.com/datatattle/covid-19-nlp-text-classification. This dataset includes a selection of tweets related to the Covid-19 pandemic, along with categorization for the tweets according to the following five categories:

  • Extremely negative
  • Negative
  • Neutral
  • Positive
  • Extremely positive

The goal of the language model trained on this dataset is to predict the subsequent words in a Covid-related tweet given a starting phrase. The goal of the text classification model trained on this dataset, as described in the Training a deep learning text classifier with a...