Natural Language Processing with TensorFlow

Natural Language Processing with TensorFlow

By : Motaz Saad, Thushan Ganegedara

Buy this Book

Natural Language Processing with TensorFlow

By: Motaz Saad, Thushan Ganegedara

Buy this Book

Overview of this book

Natural language processing (NLP) supplies the majority of data available to deep learning applications, while TensorFlow is the most important deep learning framework currently available. Natural Language Processing with TensorFlow brings TensorFlow and NLP together to give you invaluable tools to work with the immense volume of unstructured data in today’s data streams, and apply these tools to specific NLP tasks. Thushan Ganegedara starts by giving you a grounding in NLP and TensorFlow basics. You'll then learn how to use Word2vec, including advanced extensions, to create word embeddings that turn sequences of words into vectors accessible to deep learning algorithms. Chapters on classical deep learning algorithms, like convolutional neural networks (CNN) and recurrent neural networks (RNN), demonstrate important NLP tasks as sentence classification and language generation. You will learn how to apply high-performance RNN models, like long short-term memory (LSTM) cells, to NLP tasks. You will also explore neural machine translation and implement a neural machine translator. After reading this book, you will gain an understanding of NLP and you'll have the skills to apply TensorFlow in deep learning NLP applications, and how to perform specific NLP tasks.

Natural Language Processing with TensorFlow

Contributors

Preface

Free Chapter

Introduction to Natural Language Processing

What is Natural Language Processing?

Tasks of Natural Language Processing

The traditional approach to Natural Language Processing

The deep learning approach to Natural Language Processing

The roadmap – beyond this chapter

Introduction to the technical tools

Summary

Understanding TensorFlow

What is TensorFlow?

Inputs, variables, outputs, and operations

Reusing variables with scoping

Implementing our first neural network

Summary

Word2vec – Learning Word Embeddings

What is a word representation or meaning?

Classical approaches to learning word representation

Word2vec – a neural network-based approach to learning word representation

The skip-gram algorithm

The Continuous Bag-of-Words algorithm

Summary

Advanced Word2vec

The original skip-gram algorithm

Comparing skip-gram with CBOW

Extensions to the word embeddings algorithms

More recent algorithms extending skip-gram and CBOW

GloVe – Global Vectors representation

Document classification with Word2vec

Summary

Sentence Classification with Convolutional Neural Networks

Introducing Convolution Neural Networks

Understanding Convolution Neural Networks

Exercise – image classification on MNIST with CNN

Using CNNs for sentence classification

Summary

Recurrent Neural Networks

Understanding Recurrent Neural Networks

Backpropagation Through Time

Applications of RNNs

Generating text with RNNs

Evaluating text results output from the RNN

Perplexity – measuring the quality of the text result

Recurrent Neural Networks with Context Features – RNNs with longer memory

Summary

Long Short-Term Memory Networks

Understanding Long Short-Term Memory Networks

How LSTMs solve the vanishing gradient problem

Other variants of LSTMs

Summary

Applications of LSTM – Generating Text

Our data

Implementing an LSTM

Comparing LSTMs to LSTMs with peephole connections and GRUs

Improving LSTMs – beam search

Improving LSTMs – generating text with words instead of n-grams

Using the TensorFlow RNN API

Summary

Applications of LSTM – Image Caption Generation

Getting to know the data

The machine learning pipeline for image caption generation

Extracting image features with CNNs

Implementation – loading weights and inferencing with VGG-

Learning word embeddings

Preparing captions for feeding into LSTMs

Generating data for LSTMs

Defining the LSTM

Evaluating the results quantitatively

Captions generated for test images

Using TensorFlow RNN API with pretrained GloVe word vectors

Summary

Sequence-to-Sequence Learning – Neural Machine Translation

Machine translation

A brief historical tour of machine translation

Understanding Neural Machine Translation

Preparing data for the NMT system

Training the NMT

Inference with NMT

The BLEU score – evaluating the machine translation systems

Implementing an NMT from scratch – a German to English translator

Training an NMT jointly with word embeddings

Improving NMTs

Attention

Other applications of Seq2Seq models – chatbots

Summary

Current Trends and the Future of Natural Language Processing

Current trends in NLP

Penetration into other research fields

Towards Artificial General Intelligence

NLP for social media

New tasks emerging

Newer machine learning models

Summary

References

Mathematical Foundations and Advanced TensorFlow

Basic data structures

Special types of matrices

Tensor/matrix operations

Probability

Introduction to Keras

Introduction to the TensorFlow seq2seq library

Visualizing word embeddings with TensorBoard

Summary

Index

Customer Reviews

5 star

4 star

3 star

2 star

1 star

Tasks of Natural Language Processing

NLP has a multitude of real-world applications. A good NLP system is that which performs many NLP tasks. When you search for today's weather on Google or use Google Translate to find out how to say, "How are you?" in French, you rely on a subset of such tasks in NLP. We will list some of the most ubiquitous tasks here, and this book covers most of these tasks:

Tokenization: Tokenization is the task of separating a text corpus into atomic units (for example, words). Although it may seem trivial, tokenization is an important task. For example, in the Japanese language, words are not delimited by spaces nor punctuation marks.
Word-sense Disambiguation (WSD): WSD is the task of identifying the correct meaning of a word. For example, in the sentences, The dog barked at the mailman, and Tree bark is sometimes used as a medicine, the word bark has two different meanings. WSD is critical for tasks such as question answering.
Named Entity Recognition (NER): NER attempts to extract entities (for example, person, location, and organization) from a given body of text or a text corpus. For example, the sentence, John gave Mary two apples at school on Monday will be transformed to [John] _name gave [Mary] _name [two] _number apples at [school] _organization on [Monday.] _time. NER is an imperative topic in fields such as information retrieval and knowledge representation.
Part-of-Speech (PoS) tagging: PoS tagging is the task of assigning words to their respective parts of speech. It can either be basic tags such as noun, verb, adjective, adverb, and preposition, or it can be granular such as proper noun, common noun, phrasal verb, verb, and so on.
Sentence/Synopsis classification: Sentence or synopsis (for example, movie reviews) classification has many use cases such as spam detection, news article classification (for example, political, technology, and sport), and product review ratings (that is, positive or negative). This is achieved by training a classification model with labeled data (that is, reviews annotated by humans, with either a positive or negative label).
Language generation: In language generation, a learning model (for example, neural network) is trained with text corpora (a large collection of textual documents), which predict new text that follows. For example, language generation can output an entirely new science fiction story by using existing science fiction stories for training.
Question Answering (QA): QA techniques possess a high commercial value, and such techniques are found at the foundation of chatbots and VA (for example, Google Assistant and Apple Siri). Chatbots have been adopted by many companies for customer support. Chatbots can be used to answer and resolve straightforward customer concerns (for example, changing a customer's monthly mobile plan), which can be solved without human intervention. QA touches upon many other aspects of NLP such as information retrieval, and knowledge representation. Consequently, all this makes developing a QA system very difficult.
Machine Translation (MT): MT is the task of transforming a sentence/phrase from a source language (for example, German) to a target language (for example, English). This is a very challenging task as, different languages have highly different morphological structures, which means that it is not a one-to-one transformation. Furthermore, word-to-word relationships between languages can be one-to-many, one-to-one, many-to-one, or many-to-many. This is known as the word alignment problem in MT literature.

Finally, to develop a system that can assist a human in day-to-day tasks (for example, VA or a chatbot) many of these tasks need to be performed together. As we saw in the previous example where the user asks, "Can you show me a good Italian restaurant nearby?" several different NLP tasks, such as speech-to-text conversion, semantic and sentiment analyses, question answering, and machine translation, need to be completed. In Figure 1.1, we provide a hierarchical taxonomy of different NLP tasks categorized into several different types. We first have two broad categories: analysis (analyzing existing text) and generation (generating new text) tasks. Then we divide analysis into three different categories: syntactic (language structure-based tasks), semantic (meaning-based tasks), and pragmatic (open problems difficult to solve):

Figure 1.1: A taxonomy of the popular tasks of NLP categorized under broader categories

Having understood the various tasks in NLP, let us now move on to understand how we can solve these tasks with the help of machines.

Natural Language Processing with TensorFlow

By : Motaz Saad, Thushan Ganegedara

Natural Language Processing with TensorFlow

By: Motaz Saad, Thushan Ganegedara

Overview of this book

Related Content you might be interested in

Current Title:

Natural Language Processing with TensorFlow

Deep Learning Essentials

Hands-On Natural Language Processing with PyTorch 1.x

Recurrent Neural Networks with Python Quick Start Guide

Tasks of Natural Language Processing