Deep Learning with TensorFlow

Deep Learning with TensorFlow - Second Edition

By : Giancarlo Zaccone, Md. Rezaul Karim

Buy this Book

Deep Learning with TensorFlow - Second Edition

By: Giancarlo Zaccone, Md. Rezaul Karim

Buy this Book

Overview of this book

Deep learning is a branch of machine learning algorithms based on learning multiple levels of abstraction. Neural networks, which are at the core of deep learning, are being used in predictive analytics, computer vision, natural language processing, time series forecasting, and to perform a myriad of other complex tasks. This book is conceived for developers, data analysts, machine learning practitioners and deep learning enthusiasts who want to build powerful, robust, and accurate predictive models with the power of TensorFlow, combined with other open source Python libraries. Throughout the book, you’ll learn how to develop deep learning applications for machine learning systems using Feedforward Neural Networks, Convolutional Neural Networks, Recurrent Neural Networks, Autoencoders, and Factorization Machines. Discover how to attain deep learning programming on GPU in a distributed way. You'll come away with an in-depth knowledge of machine learning techniques and the skills to apply them to real-world projects.

Deep Learning with TensorFlow - Second Edition

Contributors

Preface

Other Books You May Enjoy

Free Chapter

Getting Started with Deep Learning

A soft introduction to machine learning

Artificial neural networks

How does an ANN learn?

Neural network architectures

Deep learning frameworks

Summary

A First Look at TensorFlow

A general overview of TensorFlow

What's new from TensorFlow v1.6 forwards?

Installing and configuring TensorFlow

TensorFlow computational graph

TensorFlow code structure

Data model in TensorFlow

Visualizing computations through TensorBoard

Linear regression and beyond

Summary

Feed-Forward Neural Networks with TensorFlow

Feed-forward neural networks (FFNNs)

Implementing a feed-forward neural network

Implementing a multilayer perceptron (MLP)

Tuning hyperparameters and advanced FFNNs

Summary

Convolutional Neural Networks

Main concepts of CNNs

CNNs in action

LeNet5

Implementing a LeNet-5 step by step

Dataset preparation

Fine-tuning implementation

Inception-v3

Emotion recognition with CNNs

Summary

Optimizing TensorFlow Autoencoders

How does an autoencoder work?

Implementing autoencoders with TensorFlow

Improving autoencoder robustness

Fraud analytics with autoencoders

Summary

Recurrent Neural Networks

Working principles of RNNs

RNN and the gradient vanishing-exploding problem

Implementing an RNN for spam prediction

Developing a predictive model for time series data

An LSTM predictive model for sentiment analysis

Human activity recognition using LSTM model

Summary

Heterogeneous and Distributed Computing

GPGPU computing

The TensorFlow GPU setup

Distributed computing

The distributed TensorFlow setup

Summary

Advanced TensorFlow Programming

tf.estimator

TFLearn

PrettyTensor

Keras

Summary

Recommendation Systems Using Factorization Machines

Recommendation systems

Movie recommendation using collaborative filtering

Factorization machines for recommendation systems

Improved factorization machines

Summary

Reinforcement Learning

The RL problem

OpenAI Gym

The Q-Learning algorithm

Deep Q-learning

Summary

Index

Customer Reviews

5 star

4 star

3 star

2 star

1 star

How does an ANN learn?

The learning process of a neural network is configured as an iterative process of the optimization of the weights and is therefore of the supervised type. The weights are modified because of the network's performance on a set of examples belonging to the training set, that is, the set where you know the classes that the examples belong to.

The aim is to minimize the loss function, which indicates the degree to which the behavior of the network deviates from the desired behavior. The performance of the network is then verified on a testing set consisting of objects (for example, images in an image classification problem) other than those in the training set.

ANNs and the backpropagation algorithm

A commonly used supervised learning algorithm is the backpropagation algorithm. The basic steps of the training procedure are as follows:

Initialize the net with random weights
For all training cases, follow these steps:
- Forward pass: Calculates the network's error, that is, the difference between the desired output and the actual output
- Backward pass: For all layers, starting with the output layer back to input layer:
  i: Shows the network layer's output with the correct input (error function).
  ii: Adapts the weights in the current layer to minimize the error function. This is backpropagation's optimization step.

The training process ends when the error on the validation set begins to increase because this could mark the beginning of a phase overfitting, that is, the phase in which the network tends to interpolate the training data at the expense of generalizability.

Weight optimization

The availability of efficient algorithms to optimize weights, therefore, constitutes an essential tool for the construction of neural networks. The problem can be solved with an iterative numerical technique called Gradient Descent (GD). This technique works according to the following algorithm:

Randomly choose initial values for the parameters of the model
Compute the gradient G of the error function with respect to each parameter of the model
Change the model's parameters so that they move in the direction of decreasing the error, that is, in the direction of -G
Repeat steps 2 and 3 until the value of G approaches zero

The gradient (G) of the error function E provides the direction in which the error function with the current values has the steeper slope; so to decrease E, we have to make some small steps in the opposite direction, -G.

By repeating this operation several times in an iterative manner, we move down towards the minimum of E, to reach a point where G = 0, in such a way that no further progress is possible:

Figure 10: Searching for the minimum for the error function E. We move in the direction in which the gradient G of the function E is minimal.

Stochastic gradient descent

In GD optimization, we compute the cost gradient based on the complete training set, so we sometimes also call it batch GD. In the case of very large datasets, using GD can be quite costly, since we are only taking a single step for one pass over the training set. The larger the training set, the more slowly our algorithm updates the weights, and the longer it may take until it converges at the global cost minimum.

The fastest method of gradient descent is Stochastic Gradient Descent (SGD), and for this reason, it is widely used in deep neural networks. In SGD, we use only one training sample from the training set to do the update for a parameter in a particular iteration.

Here, the term stochastic comes from the fact that the gradient based on a single training sample is a stochastic approximation of the true cost gradient. Due to its stochastic nature, the path towards the global cost minimum is not direct, as in GD, but may zigzag if we are visualizing the cost surface in a 2D space:

Figure 11: GD versus SGD: the gradient descent (left figure) ensures that each update in the weights is done in the right direction: the direction that minimizes the cost function. With the growth in the dataset's size, and more complex computations in each step, SGD (right figure) is preferred in these cases. Here, updates to the weights are done as each sample is processed and, as such, subsequent calculations already use improved weights. Nonetheless, this very reason leads to some misdirection in minimizing the error function.

Deep Learning with TensorFlow - Second Edition

By : Giancarlo Zaccone, Md. Rezaul Karim

Deep Learning with TensorFlow - Second Edition

By: Giancarlo Zaccone, Md. Rezaul Karim

Overview of this book

Related Content you might be interested in

Current Title:

Deep Learning with TensorFlow - Second Edition

Java Deep Learning Projects

TensorFlow: Powerful Predictive Analytics with TensorFlow

Scala Machine Learning Projects

How does an ANN learn?

ANNs and the backpropagation algorithm

Weight optimization

Stochastic gradient descent