Natural Language Processing with TensorFlow - Second Edition

By : Thushan Ganegedara

2 (1)

Buy this Book

Natural Language Processing with TensorFlow - Second Edition

2 (1)

By: Thushan Ganegedara

Buy this Book

Overview of this book

Learning how to solve natural language processing (NLP) problems is an important skill to master due to the explosive growth of data combined with the demand for machine learning solutions in production. Natural Language Processing with TensorFlow, Second Edition, will teach you how to solve common real-world NLP problems with a variety of deep learning model architectures. The book starts by getting readers familiar with NLP and the basics of TensorFlow. Then, it gradually teaches you different facets of TensorFlow 2.x. In the following chapters, you then learn how to generate powerful word vectors, classify text, generate new text, and generate image captions, among other exciting use-cases of real-world NLP. TensorFlow has evolved to be an ecosystem that supports a machine learning workflow through ingesting and transforming data, building models, monitoring, and productionization. We will then read text directly from files and perform the required transformations through a TensorFlow data pipeline. We will also see how to use a versatile visualization tool known as TensorBoard to visualize our models. By the end of this NLP book, you will be comfortable with using TensorFlow to build deep learning models with many different architectures, and efficiently ingest data using TensorFlow Additionally, you’ll be able to confidently use TensorFlow throughout your machine learning workflow.

Preface

Who this book is for

What this book covers

To get the most out of this book

Get in touch

Free Chapter

Introduction to Natural Language Processing

What is Natural Language Processing?

Tasks of Natural Language Processing

The traditional approach to Natural Language Processing

The deep learning approach to Natural Language Processing

Introduction to the technical tools

Summary

Understanding TensorFlow 2

What is TensorFlow?

Inputs, variables, outputs, and operations

Keras: The model building API of TensorFlow

Implementing our first neural network

Summary

Word2vec – Learning Word Embeddings

What is a word representation or meaning?

Classical approaches to learning word representation

An intuitive understanding of Word2vec – an approach to learning word representation

The skip-gram algorithm

The Continuous Bag-of-Words algorithm

Summary

Advanced Word Vector Algorithms

GloVe – Global Vectors representation

ELMo – Taking ambiguities out of word vectors

Document classification with ELMo

Summary

Sentence Classification with Convolutional Neural Networks

Introducing CNNs

Understanding CNNs

Exercise – image classification on Fashion-MNIST with CNN

Using CNNs for sentence classification

Summary

Recurrent Neural Networks

Understanding RNNs

Backpropagation Through Time

Applications of RNNs

Named Entity Recognition with RNNs

NER with character and token embeddings

Summary

Understanding Long Short-Term Memory Networks

How LSTMs solve the vanishing gradient problem

Improving LSTMs

Other variants of LSTMs

Summary

Applications of LSTM – Generating Text

Our data

Implementing the language model

Comparing LSTMs to LSTMs with peephole connections and GRUs

Improving sequential models – beam search

Improving LSTMs – generating text with words instead of n-grams

Summary

Sequence-to-Sequence Learning – Neural Machine Translation

Machine translation

A brief historical tour of machine translation

Understanding neural machine translation

Preparing data for the NMT system

Defining the model

Training the NMT

The BLEU score – evaluating the machine translation systems

Visualizing Attention patterns

Inference with NMT

Other applications of Seq2Seq models – chatbots

Summary

Transformers

Transformer architecture

Understanding BERT

Use case: Using BERT to answer questions

Summary

Image Captioning with Transformers

Getting to know the data

Downloading the data

Processing and tokenizing data

Defining a tf.data.Dataset

The machine learning pipeline for image caption generation

Implementing the model with TensorFlow

Training the model

Evaluating the results quantitatively

Evaluating the model

Captions generated for test images

Summary

Other Books You May Enjoy

Index

Appendix A: Mathematical Foundations and Advanced TensorFlow

Basic data structures

Special types of matrices

Tensor/matrix operations

Probability

Visualizing word embeddings with TensorBoard

Summary

Customer Reviews

2 (1)

5 star

4 star

3 star

2 star

100%

1 star

Understanding Long Short-Term Memory Networks

In this chapter, we will discuss the fundamentals behind a more advanced RNN variant known as Long Short-Term Memory Networks (LSTMs). Here, we will focus on understanding the theory behind LSTMs, so we can discuss their implementation in the next chapter. LSTMs are widely used in many sequential tasks (including stock market prediction, language modeling, and machine translation) and have proven to perform better than older sequential models (for example, standard RNNs), especially given the availability of large amounts of data. LSTMs are designed to avoid the problem of the vanishing gradient that we discussed in the previous chapter.

The main practical limitation posed by the vanishing gradient is that it prevents the model from learning long-term dependencies. However, by avoiding the vanishing gradient problem, LSTMs have the ability to store memory for longer than ordinary RNNs (for hundreds of time steps). In contrast to RNNs...

Natural Language Processing with TensorFlow - Second Edition

By : Thushan Ganegedara

Natural Language Processing with TensorFlow - Second Edition

By: Thushan Ganegedara

Overview of this book

Related Content you might be interested in

Current Title:

Natural Language Processing with TensorFlow - Second Edition

Advanced Natural Language Processing with TensorFlow 2

Deep Learning Essentials

Hands-On Natural Language Processing with PyTorch 1.x

Understanding Long Short-Term Memory Networks