Hands-On Python Natural Language Processing

By : Aman Kedia, Mayank Rasu

4 (1)

Buy this Book

Hands-On Python Natural Language Processing

4 (1)

By: Aman Kedia, Mayank Rasu

Buy this Book

Overview of this book

Natural Language Processing (NLP) is the subfield in computational linguistics that enables computers to understand, process, and analyze text. This book caters to the unmet demand for hands-on training of NLP concepts and provides exposure to real-world applications along with a solid theoretical grounding. This book starts by introducing you to the field of NLP and its applications, along with the modern Python libraries that you'll use to build your NLP-powered apps. With the help of practical examples, you’ll learn how to build reasonably sophisticated NLP applications, and cover various methodologies and challenges in deploying NLP applications in the real world. You'll cover key NLP tasks such as text classification, semantic embedding, sentiment analysis, machine translation, and developing a chatbot using machine learning and deep learning techniques. The book will also help you discover how machine learning techniques play a vital role in making your linguistic apps smart. Every chapter is accompanied by examples of real-world applications to help you build impressive NLP applications of your own. By the end of this NLP book, you’ll be able to work with language data, use machine learning to identify patterns in text, and get acquainted with the advancements in NLP.

Preface

Who this book is for

What this book covers

To get the most out of this book

Get in touch

Section 1: Introduction

Free Chapter

Understanding the Basics of NLP

Programming languages versus natural languages

Why should I learn NLP?

Current applications of NLP

Summary

NLP Using Python

Technical requirements

Understanding Python with NLP

Important Python libraries

Web scraping libraries and methodology

Overview of Jupyter Notebook

Summary

Section 2: Natural Language Representation and Mathematics

Building Your NLP Vocabulary

Technical requirements

Lexicons

Phonemes, graphemes, and morphemes

Tokenization

Understanding word normalization

Summary

Transforming Text into Data Structures

Technical requirements

Understanding vectors and matrices

Exploring the Bag-of-Words architecture

TF-IDF vectors

One-hot vectorization

Building a basic chatbot

Summary

Word Embeddings and Distance Measurements for Text

Technical requirements

Understanding word embeddings

Demystifying Word2vec

Training a Word2vec model

Word mover’s distance

Summary

Exploring Sentence-, Document-, and Character-Level Embeddings

Technical requirements

Venturing into Doc2Vec

Exploring fastText

Understanding Sent2Vec and the Universal Sentence Encoder</span>

Summary

Section 3: NLP and Learning

Identifying Patterns in Text Using Machine Learning

Technical requirements

Introduction to ML

Data preprocessing

The Naive Bayes algorithm

The SVM algorithm

Productionizing a trained sentiment analyzer

Summary

From Human Neurons to Artificial Neurons for Understanding Text

Technical requirements

Exploring the biology behind neural networks

How does a neural network learn?

Understanding regularization

Let's talk Keras

Building a question classifier using neural networks

Summary

Applying Convolutions to Text

Technical requirements

What is a CNN?

Detecting sarcasm in text using CNNs

Summary

Capturing Temporal Relationships in Text

Technical requirements

Baby steps toward understanding RNNs

Vanishing and exploding gradients

Architectural forms of RNNs

Giving memory to our networks – LSTMs

Building a text generator using LSTMs

Exploring memory-based variants of the RNN architecture

Summary

State of the Art in NLP

Technical requirements

Seq2Seq modeling

Translating between languages using Seq2Seq modeling

Let's pay some attention

Transformers

BERT

Summary

Other Books You May Enjoy

Leave a review - let other readers know what you think

Customer Reviews

4 (1)

5 star

4 star

100%

3 star

2 star

1 star

One-hot vectorization

In general, a one-hot vector is used to represent categorical variables that take in values from a predefined list of values. These help in representing tokens as vectors that are required in certain use cases. In such vectors, all values are 0 except the one where the token is present, and this entry is marked 1. As you may have guessed, these are binary vectors.

For example, weather can be represented as a categorical variable with the values hot and cold. In this scenario, the one-hot vectors would be as follows:

vec(hot)  = <0, 1>
vec(cold) = <1, 0>

There are two bits in here—the second bit is 1, to denote hot, and the first bit is 1, to denote cold. The size of the vector is 2 since there are only two possibilities available in terms of hot and cold.

Hey! Where does this work similarly in NLP?

In NLP, each of the terms present in the vocabulary can be thought of as a category, just as we had two categories to represent weather conditions. Now...

Hands-On Python Natural Language Processing

By : Aman Kedia, Mayank Rasu

Hands-On Python Natural Language Processing

By: Aman Kedia, Mayank Rasu

Overview of this book

Related Content you might be interested in

Current Title:

Hands-On Python Natural Language Processing

Hands-On Natural Language Processing with PyTorch 1.x

Deep Learning for Natural Language Processing

fastText Quick Start Guide

One-hot vectorization