Neural Networks with Keras Cookbook

By : V Kishore Ayyadevara

Neural Networks with Keras Cookbook

By: V Kishore Ayyadevara

Overview of this book

This book will take you from the basics of neural networks to advanced implementations of architectures using a recipe-based approach. We will learn about how neural networks work and the impact of various hyper parameters on a network's accuracy along with leveraging neural networks for structured and unstructured data. Later, we will learn how to classify and detect objects in images. We will also learn to use transfer learning for multiple applications, including a self-driving car using Convolutional Neural Networks. We will generate images while leveraging GANs and also by performing image encoding. Additionally, we will perform text analysis using word vector based techniques. Later, we will use Recurrent Neural Networks and LSTM to implement chatbot and Machine Translation systems. Finally, you will learn about transcribing images, audio, and generating captions and also use Deep Q-learning to build an agent that plays Space Invaders game. By the end of this book, you will have developed the skills to choose and customize multiple neural network architectures for various deep learning problems you might encounter.

Preface

Who this book is for

What this book covers

To get the most out of this book

Sections

Get in touch

Free Chapter

Building a Feedforward Neural Network

Introduction

Architecture of a simple neural network

Applications of a neural network

Feed-forward propagation from scratch in Python

Building back-propagation from scratch in Python

Building a neural network in Keras

Building a Deep Feedforward Neural Network

Training a vanilla neural network

Scaling the input dataset

Impact on training when the majority of inputs are greater than zero

Impact of batch size on model accuracy

Building a deep neural network to improve network accuracy

Varying the learning rate to improve network accuracy

Varying the loss optimizer to improve network accuracy

Understanding the scenario of overfitting

Speeding up the training process using batch normalization

Applications of Deep Feedforward Neural Networks

Introduction

Predicting credit default

Assigning weights for classes

Predicting house prices

Categorizing news articles into topics

Classifying common audio

Stock price prediction

Leveraging a functional API

Defining weights for rows

Building a Deep Convolutional Neural Network

Introduction

Inaccuracy of traditional neural networks when images are translated

Building a CNN from scratch using Python

CNNs to improve accuracy in the case of image translation

Gender classification using CNNs

Data augmentation to improve network accuracy

Transfer Learning

Gender classification of the person in an image using CNNs

Gender classification of the person in image using the VGG16 architecture-based model

Visualizing the output of the intermediate layers of a neural network

Gender classification of the person in image using the VGG19 architecture-based model

Gender classification using the Inception v3 architecture-based model

Gender classification of the person in image using the ResNet 50 architecture-based model

Detecting the key points within image of a face

Detecting and Localizing Objects in Images

Introduction

Creating the dataset for a bounding box

Generating region proposals within an image, using selective search

Calculating an intersection over a union between two images

Detecting objects, using region proposal-based CNN

Performing non-max suppression

Detecting a person using an anchor box-based algorithm

Image Analysis Applications in Self-Driving Cars

Traffic sign identification

Predicting the angle within which a car needs to be turned

Instance segmentation using the U-net architecture

Semantic segmentation of objects in an image

Image Generation

Introduction

Generating images that can fool a neural network using adversarial attack

DeepDream algorithm to generate images

Neural style transfer between images

Generating images of digits using Generative Adversarial Networks

Generating images using a Deep Convolutional GAN

Face generation using a Deep Convolutional GAN

Face transition from one to another

Performing vector arithmetic on generated images

Encoding Inputs

Introduction

Need for encoding

Encoding an image

Encoding for recommender systems

Text Analysis Using Word Vectors

Introduction

Building a word vector from scratch in Python

Building a word vector using the skip-gram and CBOW models

Performing vector arithmetic using pre-trained word vectors

Creating a document vector

Building word vectors using fastText

Building word vectors using GloVe

Building sentiment classification using word vectors

Building a Recurrent Neural Network

Introduction

Building an RNN from scratch in Python

Implementing RNN for sentiment classification

Building a LSTM Network from scratch in Python

Implementing LSTM for sentiment classification

Implementing stacked LSTM for sentiment classification

Applications of a Many-to-One Architecture RNN

Generating text

Movie recommendations

Topic-modeling, using embeddings

Forecasting the value of a stock's price

Sequence-to-Sequence Learning

Introduction

Returning sequences of outputs from a network

Building a chatbot

Machine translation

Encoder decoder architecture for machine translation

Encoder decoder architecture with attention for machine translation

End-to-End Learning

Introduction

Connectionist temporal classification (CTC)

Handwritten-text recognition

Image caption generation

Generating captions, using beam search

Audio Analysis

Classifying a song by genre

Generating music using deep learning

Transcribing audio into text

Reinforcement Learning

The optimal action to take in a simulated game with a non-negative reward

The optimal action to take in a state in a simulated game

Q-learning to maximize rewards when playing Frozen Lake

Deep Q-learning to balance a cart pole

Deep Q-learning to play Space Invaders game

Other Books You May Enjoy

Leave a review - let other readers know what you think

Customer Reviews

5 star

4 star

3 star

2 star

1 star

Building a neural network in Keras

In the previous section, we built a neural network from scratch, that is, we wrote functions that perform forward-propagation and back-propagation.

How to do it...

We will be building a neural network using the Keras library, which provides utilities that make the process of building a complex neural network much easier.

Installing Keras

Tensorflow and Keras are implemented in Ubuntu, using the following commands:

$pip install --no-cache-dir tensorflow-gpu==1.7

Note that it is preferable to install a GPU-compatible version, as neural networks work considerably faster when they are run on top of a GPU. Keras is a high-level neural network API, written in Python, and capable of running on top of TensorFlow, CNTK, or Theano.

It was developed with a focus on enabling fast experimentation, and it can be installed as follows:

$pip install keras

Building our first model in Keras

In this section, let's understand the process of building a model in Keras by using the same toy dataset that we worked on in the previous sections (the code file is available as Neural_networks_multiple_layers.ipynb in GitHub):

Instantiate a model that can be called sequentially to add further layers on top of it. The Sequential method enables us to perform the model initialization exercise:

from keras.models import Sequential
model = Sequential()

Add a dense layer to the model. A dense layer ensures the connection between various layers in a model. In the following code, we are connecting the input layer to the hidden layer:

model.add(Dense(3, activation='relu', input_shape=(1,)))

In the dense layer initialized with the preceding code, we ensured that we provide the input shape to the model (we need to specify the shape of data that the model has to expect as this is the first dense layer).

Additionally, we mentioned that there will be three connections made to each input (three units in the hidden layer) and also that the activation that needs to be performed in the hidden layer is the ReLu activation.

Connect the hidden layer to the output layer:

model.add(Dense(1, activation='linear'))

Note that in this dense layer, we don't need to specify the input shape, as the model would already infer the input shape from the previous layer.

Also, given that each output is one-dimensional, our output layer has one unit and the activation that we are performing is the linear activation.

The model summary can now be visualized as follows:

model.summary()

A summary of model is as follows:

The preceding output confirms our discussion in the previous section: that there will be a total of six parameters in the connection from the input layer to the hidden layer—three weights and three bias terms—we have a total of six parameters corresponding to the three hidden units. In addition, three weights and one bias term connect the hidden layer to the output layer.

Compile the model. This ensures that we define the loss function and the optimizer to reduce the loss function and the learning rate corresponding to the optimizer (we will look at different optimizers and loss functions in next chapter):

from keras.optimizers import sgd
sgd = sgd(lr = 0.01)

In the preceding step, we specified that the optimizer is the stochastic gradient descent that we learned about in the previous section and the learning rate is 0.01. Pass the predefined optimizer and its corresponding learning rate as a parameter and reduce the mean squared error value:

model.compile(optimizer=sgd,loss='mean_squared_error')

Fit the model. Update the weights so that the model is a better fit:

model.fit(np.array(x), np.array(y), epochs=1, batch_size = 4, verbose=1)

The fit method expects that it receives two NumPy arrays: an input array and the corresponding output array. Note that epochs represents the number of times the total dataset is traversed through, and batch_size represents the number of data points that need to be considered in an iteration of updating the weights. Furthermore, verbose specifies that the output is more detailed, with information about losses in training and test datasets as well as the progress of the model training process.

Extract the weight values. The order in which the weight values are presented is obtained by calling the weights method on top of the model, as follows:

model.weights

The order in which weights are obtained is as follows:

From the preceding output, we see that the order of weights is the three weights (kernel) and three bias terms in the dense_1 layer (which is the connection between the input to the hidden layer) and the three weights (kernel) and one bias term connecting the hidden layer to the dense_2 layer (the output layer).

Now that we understand the order in which weight values are presented, let's extract the values of these weights:

model.get_weights()

Notice that the weights are presented as a list of arrays, where each array corresponds to the value that is specified in the model.weights output.

The output of above lines of code is as follows:

You should notice that the output we are observing here matches with the output we obtaining while hand-building the neural network

Predict the output for a new set of input using the predict method:

x1 = [[5],[6]]
model.predict(np.array(x1))

Note that x1 is the variable that holds the values for the new set of examples for which we need to predict the value of the output. Similarly to the fit method, the predict method also expects an array as its input.

The output of preceding code is as follows:

Notice that, while the preceding output is incorrect, the output when we run for 100 epochs is as follows:

The preceding output will match the expected output (which are 10, 12) as we run for even higher number of epochs.

Neural Networks with Keras Cookbook

By : V Kishore Ayyadevara

Neural Networks with Keras Cookbook

By: V Kishore Ayyadevara

Overview of this book

Related Content you might be interested in

Current Title:

Neural Networks with Keras Cookbook

Modern Computer Vision with PyTorch

Deep Learning with Keras

Python Deep Learning Cookbook