Natural Language Processing with TensorFlow - Second Edition

By : Thushan Ganegedara

2 (1)

Buy this Book

Natural Language Processing with TensorFlow - Second Edition

2 (1)

By: Thushan Ganegedara

Buy this Book

Overview of this book

Learning how to solve natural language processing (NLP) problems is an important skill to master due to the explosive growth of data combined with the demand for machine learning solutions in production. Natural Language Processing with TensorFlow, Second Edition, will teach you how to solve common real-world NLP problems with a variety of deep learning model architectures. The book starts by getting readers familiar with NLP and the basics of TensorFlow. Then, it gradually teaches you different facets of TensorFlow 2.x. In the following chapters, you then learn how to generate powerful word vectors, classify text, generate new text, and generate image captions, among other exciting use-cases of real-world NLP. TensorFlow has evolved to be an ecosystem that supports a machine learning workflow through ingesting and transforming data, building models, monitoring, and productionization. We will then read text directly from files and perform the required transformations through a TensorFlow data pipeline. We will also see how to use a versatile visualization tool known as TensorBoard to visualize our models. By the end of this NLP book, you will be comfortable with using TensorFlow to build deep learning models with many different architectures, and efficiently ingest data using TensorFlow Additionally, you’ll be able to confidently use TensorFlow throughout your machine learning workflow.

Preface

Who this book is for

What this book covers

To get the most out of this book

Get in touch

Free Chapter

Introduction to Natural Language Processing

What is Natural Language Processing?

Tasks of Natural Language Processing

The traditional approach to Natural Language Processing

The deep learning approach to Natural Language Processing

Introduction to the technical tools

Summary

Understanding TensorFlow 2

What is TensorFlow?

Inputs, variables, outputs, and operations

Keras: The model building API of TensorFlow

Implementing our first neural network

Summary

Word2vec – Learning Word Embeddings

What is a word representation or meaning?

Classical approaches to learning word representation

An intuitive understanding of Word2vec – an approach to learning word representation

The skip-gram algorithm

The Continuous Bag-of-Words algorithm

Summary

Advanced Word Vector Algorithms

GloVe – Global Vectors representation

ELMo – Taking ambiguities out of word vectors

Document classification with ELMo

Summary

Sentence Classification with Convolutional Neural Networks

Introducing CNNs

Understanding CNNs

Exercise – image classification on Fashion-MNIST with CNN

Using CNNs for sentence classification

Summary

Recurrent Neural Networks

Understanding RNNs

Backpropagation Through Time

Applications of RNNs

Named Entity Recognition with RNNs

NER with character and token embeddings

Summary

Understanding Long Short-Term Memory Networks

How LSTMs solve the vanishing gradient problem

Improving LSTMs

Other variants of LSTMs

Summary

Applications of LSTM – Generating Text

Our data

Implementing the language model

Comparing LSTMs to LSTMs with peephole connections and GRUs

Improving sequential models – beam search

Improving LSTMs – generating text with words instead of n-grams

Summary

Sequence-to-Sequence Learning – Neural Machine Translation

Machine translation

A brief historical tour of machine translation

Understanding neural machine translation

Preparing data for the NMT system

Defining the model

Training the NMT

The BLEU score – evaluating the machine translation systems

Visualizing Attention patterns

Inference with NMT

Other applications of Seq2Seq models – chatbots

Summary

Transformers

Transformer architecture

Understanding BERT

Use case: Using BERT to answer questions

Summary

Image Captioning with Transformers

Getting to know the data

Downloading the data

Processing and tokenizing data

Defining a tf.data.Dataset

The machine learning pipeline for image caption generation

Implementing the model with TensorFlow

Training the model

Evaluating the results quantitatively

Evaluating the model

Captions generated for test images

Summary

Other Books You May Enjoy

Index

Appendix A: Mathematical Foundations and Advanced TensorFlow

Basic data structures

Special types of matrices

Tensor/matrix operations

Probability

Visualizing word embeddings with TensorBoard

Summary

Customer Reviews

2 (1)

5 star

4 star

3 star

2 star

100%

1 star

Transformer architecture

A Transformer is a type of Seq2Seq model (discussed in the previous chapter). Transformer models can work with both image and text data. The Transformer model takes in a sequence of inputs and maps that to a sequence of outputs.

The Transformer model was initially proposed in the paper Attention is all you need by Vaswani et al. (https://arxiv.org/pdf/1706.03762.pdf). Just like a Seq2Seq model, the Transformer consists of an encoder and a decoder (Figure 10.1):

Figure 10.1: The encoder-decoder architecture

Let’s understand how the Transformer model works using the previously studied Machine Translation task. The encoder takes in a sequence of source language tokens and produces a sequence of interim outputs. Then the decoder takes in a sequence of target language tokens and predicts the next token for each time step (the teacher forcing technique). Both the encoder and the decoder use attention mechanisms to improve performance. For...

Natural Language Processing with TensorFlow - Second Edition

By : Thushan Ganegedara

Natural Language Processing with TensorFlow - Second Edition

By: Thushan Ganegedara

Overview of this book

Related Content you might be interested in

Current Title:

Natural Language Processing with TensorFlow - Second Edition

Advanced Natural Language Processing with TensorFlow 2

Deep Learning Essentials

Hands-On Natural Language Processing with PyTorch 1.x

Transformer architecture