Deep Learning for Natural Language Processing

Book Image

Deep Learning for Natural Language Processing

By : Karthiek Reddy Bokka, Shubhangi Hora, Tanuj Jain, Monicah Wambugu

Book Image

Deep Learning for Natural Language Processing

By: Karthiek Reddy Bokka, Shubhangi Hora, Tanuj Jain, Monicah Wambugu

Overview of this book

Applying deep learning approaches to various NLP tasks can take your computational algorithms to a completely new level in terms of speed and accuracy. Deep Learning for Natural Language Processing starts by highlighting the basic building blocks of the natural language processing domain. The book goes on to introduce the problems that you can solve using state-of-the-art neural network models. After this, delving into the various neural network architectures and their specific areas of application will help you to understand how to select the best model to suit your needs. As you advance through this deep learning book, you’ll study convolutional, recurrent, and recursive neural networks, in addition to covering long short-term memory networks (LSTM). Understanding these networks will help you to implement their models using Keras. In later chapters, you will be able to develop a trigger word detection application using NLP techniques such as attention model and beam search. By the end of this book, you will not only have sound knowledge of natural language processing, but also be able to select the best text preprocessing and neural network models to solve a number of NLP issues.

About the Book

About the Authors

Learning Objectives

Hardware Requirements

Software Requirements

Installation and Setup

Install Python on Windows

Install Python on Linux

Install Python on macOS X

Installing Keras

Additional Resources

Free Chapter

Introduction to Natural Language Processing

Introduction to Natural Language Processing

The Basics of Natural Language Processing

Capabilities of Natural language processing

Applications of Natural Language Processing

Word Embeddings

Applications of Natural Language Processing

Applications of Natural Language Processing

Applications of Parts of Speech Tagging

Named Entity Recognition

Introduction to Neural Networks

Introduction to Neural Networks

Neural Networks

Training a Neural Network

Designing a Neural Network and Its Applications

Fundamentals of Deploying a Model as a Service

Foundations of Convolutional Neural Network

Foundations of Convolutional Neural Network

Understanding the Architecture of a CNN

Application Areas of CNNs

Recurrent Neural Networks

Recurrent Neural Networks

Previous Versions of Neural Networks

Updates and Gradient Flow

Gated Recurrent Units (GRUs)

Gated Recurrent Units (GRUs)

The Drawback of Simple RNNs

Gated Recurrent Units (GRUs)

Sentiment Analysis with GRU

Long Short-Term Memory (LSTM)

Long Short-Term Memory (LSTM)

The Input Gate and the Candidate Cell State

Output Gate and Current Activation

Neural Language Translation

State-of-the-Art Natural Language Processing

State-of-the-Art Natural Language Processing

Other Architectures and Developments

Activity 11: Build a Text Summarization Model

A Practical NLP Project Workflow in an Organization

A Practical NLP Project Workflow in an Organization

Problem Definition

Data Acquisition

Appendix

Chapter 1: Introduction to Natural Language Processing

Chapter 2: Applications of Natural Language Processing

Chapter 3: Introduction to Neural Networks

Chapter 4: Introduction to convolutional networks

Chapter 5: Foundations of Recurrent Neural Network

Chapter 6: Foundations of GRUs

Chapter 7: Foundations of LSTM

Chapter 8: State of the art in Natural Language Processing

Chapter 9: A practical NLP project workflow in an organisation

Customer Reviews

5 star

0

4 star

0

3 star

0

2 star

0

1 star

0

Chapter 9: A practical NLP project workflow in an organisation

Code for LSTM model

Check if GPU is detected
import tensorflow as tf
tf.test.gpu_device_name()
Setting up collar notebook
from google.colab import drive
drive.mount('/content/gdrive')
# Run the below command in a new cell
cd /content/gdrive/My Drive/Lesson-9/
# Run the below command in a new cell
!unzip data.csv.zip
Import necessary Python packages and classes.
import os
import re
import pickle
import pandas as pd
from keras.preprocessing.text import Tokenizer
from keras.preprocessing.sequence import pad_sequences
from keras.models import Sequential
from keras.layers import Dense, Embedding, LSTM
Load the data file.
def preprocess_data(data_file_path):
data = pd.read_csv(data_file_path, header=None) # read the csv
data.columns = ['rating', 'title', 'review'] # add column names
data['review'] = data['review'].apply(lambda x: x.lower()) # change all text to lower
data['review'] = data['review'].apply((lambda x: re.sub('[^a-zA-z0-9\s]','',x))) # remove all numbers
return data
df = preprocess_data('data.csv')
Initialize tokenization.
max_features = 2000
maxlength = 250
tokenizer = Tokenizer(num_words=max_features, split=' ')
Fit tokenizer.
tokenizer.fit_on_texts(df['review'].values)
X = tokenizer.texts_to_sequences(df['review'].values)
Pad sequences.
X = pad_sequences(X, maxlen=maxlength)
Get target variable
y_train = pd.get_dummies(df.rating).values
embed_dim = 128
hidden_units = 100
n_classes = 5
model = Sequential()
model.add(Embedding(max_features, embed_dim, input_length = X.shape[1]))
model.add(LSTM(hidden_units))
model.add(Dense(n_classes, activation='softmax'))
model.compile(loss = 'categorical_crossentropy', optimizer='adam',metrics = ['accuracy'])
print(model.summary())
Fit the model.
model.fit(X[:100000, :], y_train[:100000, :], batch_size = 128, epochs=15, validation_split=0.2)
Save model and tokenizer.
model.save('trained_model.h5') # creates a HDF5 file 'trained_model.h5'
with open('trained_tokenizer.pkl', 'wb') as f: # creates a pickle file 'trained_tokenizer.pkl'
pickle.dump(tokenizer, f)
from google.colab import files
files.download('trained_model.h5')
files.download('trained_tokenizer.pkl')

Code for Flask

Import the necessary Python packages and classes.
import re
import pickle
import numpy as np
from flask import Flask, request, jsonify
from keras.models import load_model
from keras.preprocessing.sequence import pad_sequences
Define the input files and load in dataframe
def load_variables():
global model, tokenizer
model = load_model('trained_model.h5')
model._make_predict_function() # https://github.com/keras-team/keras/issues/6462
with open('trained_tokenizer.pkl', 'rb') as f:
tokenizer = pickle.load(f)
Define preprocessing functions similar to the training code:
def do_preprocessing(reviews):
processed_reviews = []
for review in reviews:
review = review.lower()
processed_reviews.append(re.sub('[^a-zA-z0-9\s]', '', review))
processed_reviews = tokenizer.texts_to_sequences(np.array(processed_reviews))
processed_reviews = pad_sequences(processed_reviews, maxlen=250)
return processed_reviews
Define a Flask app instance:
app = Flask(__name__)
Define an endpoint that displays a fixed message:
@app.route('/')
def home_routine():
return 'Hello World!'
We'll have a prediction endpoint, to which we can send our review strings. The kind of HTTP request we will use is a 'POST' request:
@app.route('/prediction', methods=['POST'])
def get_prediction():
# get incoming text
# run the model
if request.method == 'POST':
data = request.get_json()
data = do_preprocessing(data)
predicted_sentiment_prob = model.predict(data)
predicted_sentiment = np.argmax(predicted_sentiment_prob, axis=-1)
return str(predicted_sentiment)
Start the web server.
if __name__ == '__main__':
# load model
load_variables()
app.run(debug=True)
Save this file as app.py (any name could be used). Run this code from the terminal using app.py:
python app.py
The output is as follows:

Figure 9.31: Output for flask

Figure 9.31: Output for flask