TensorFlow 1.x Deep Learning Cookbook

Overview of this book

Deep neural networks (DNNs) have achieved a lot of success in the field of computer vision, speech recognition, and natural language processing. This exciting recipe-based guide will take you from the realm of DNN theory to implementing them practically to solve real-life problems in the artificial intelligence domain. In this book, you will learn how to efficiently use TensorFlow, Google’s open source framework for deep learning. You will implement different deep learning networks, such as Convolutional Neural Networks (CNNs), Recurrent Neural Networks (RNNs), Deep Q-learning Networks (DQNs), and Generative Adversarial Networks (GANs), with easy-to-follow standalone recipes. You will learn how to use TensorFlow with Keras as the backend. You will learn how different DNNs perform on some popularly used datasets, such as MNIST, CIFAR-10, and Youtube8m. You will not only learn about the different mobile and embedded platforms supported by TensorFlow, but also how to set up cloud platforms for deep learning applications. You will also get a sneak peek at TPU architecture and how it will affect the future of DNNs. By using crisp, no-nonsense recipes, you will become an expert in implementing deep learning techniques in growing real-world applications and research areas such as reinforcement learning, GANs, and autoencoders.

Preface

What this book covers

What you need for this book

Free Chapter

TensorFlow - An Introduction

Introduction

Installing TensorFlow

Hello world in TensorFlow

Understanding the TensorFlow program structure

Working with constants, variables, and placeholders

Performing matrix manipulations using TensorFlow

Using a data flow graph

Migrating from 0.x to 1.x

Using XLA to enhance computational performance

Invoking CPU/GPU devices

TensorFlow for Deep Learning

Different Python packages required for DNN-based problems

Regression

Introduction

Choosing loss functions

Optimizers in TensorFlow

Reading from CSV files and preprocessing data

House price estimation-simple linear regression

House price estimation-multiple linear regression

Logistic regression on the MNIST dataset

Neural Networks - Perceptron

Introduction

Activation functions

Single layer perceptron

Calculating gradients of backpropagation algorithm

MNIST classifier using MLP

Function approximation using MLP-predicting Boston house prices

Tuning hyperparameters

Higher-level APIs-Keras

See also

Convolutional Neural Networks

Introduction

Creating a ConvNet to classify handwritten MNIST numbers

Creating a ConvNet to classify CIFAR-10

Transferring style with VGG19 for image repainting

Using a pretrained VGG16 net for transfer learning

Creating a DeepDream network

Advanced Convolutional Neural Networks

Introduction

Creating a ConvNet for Sentiment Analysis

Inspecting what filters a VGG pre-built network has learned

Classifying images with VGGNet, ResNet, Inception, and Xception

Recycling pre-built Deep Learning models for extracting features

Very deep InceptionV3 Net used for Transfer Learning

Generating music with dilated ConvNets, WaveNet, and NSynth

Answering questions about images (Visual Q&A)

Classifying videos with pre-trained nets in six different ways

Recurrent Neural Networks

Introduction

Neural machine translation - training a seq2seq RNN

Neural machine translation - inference on a seq2seq RNN

All you need is attention - another example of a seq2seq RNN

Learning to write as Shakespeare with RNNs

Learning to predict future Bitcoin value with RNNs

Many-to-one and many-to-many RNN examples

Unsupervised Learning

Introduction

Principal component analysis

k-means clustering

Self-organizing maps

Restricted Boltzmann Machine

Recommender system using RBM

DBN for Emotion Detection

Autoencoders

Introduction

Vanilla autoencoders

Sparse autoencoder

Denoising autoencoder

Convolutional autoencoders

Stacked autoencoder

Reinforcement Learning

Introduction

Learning OpenAI Gym

Implementing neural network agent to play Pac-Man

Q learning to balance Cart-Pole

Game of Atari using Deep Q Networks

Policy gradients to play the game of Pong

Mobile Computation

Introduction

Installing TensorFlow mobile for macOS and Android

Playing with TensorFlow and Android examples

Installing TensorFlow mobile for macOS and iPhone

Optimizing a TensorFlow graph for mobile devices

Profiling a TensorFlow graph for mobile devices

Transforming a TensorFlow graph for mobile devices

Generative Models and CapsNet

Introduction

Learning to forge MNIST images with simple GANs

Learning to forge MNIST images with DCGANs

Learning to forge Celebrity Faces and other datasets with DCGAN

Implementing Variational Autoencoders

Learning to beat the previous MNIST state-of-the-art results with Capsule Networks

Distributed TensorFlow and Cloud Deep Learning

Introduction

Working with TensorFlow and GPUs

Playing with Distributed TensorFlow: multiple GPUs and one CPU

Playing with Distributed TensorFlow: multiple servers

Training a Distributed TensorFlow MNIST classifier

Working with TensorFlow Serving and Docker

Running Distributed TensorFlow on Google Cloud (GCP) with Compute Engine

Running Distributed TensorFlow on Google CloudML

Running Distributed TensorFlow on Microsoft Azure

Running Distributed TensorFlow on Amazon AWS

Learning to Learn with AutoML (Meta-Learning)

Meta-learning with recurrent networks and with reinforcement learning

Meta-learning blocks

Meta-learning novel tasks

Siamese Network

TensorFlow Processing Units

Components of TPUs

Customer Reviews

5 star

4 star

3 star

2 star

1 star

Understanding the TensorFlow program structure

TensorFlow is very unlike other programming languages. We first need to build a blueprint of whatever neural network we want to create. This is accomplished by dividing the program into two separate parts, namely, definition of the computational graph and its execution. At first, this might appear cumbersome to the conventional programmer, but it is this separation of the execution graph from the graph definition that gives TensorFlow its strength, that is, the ability to work on multiple platforms and parallel execution.

Computational graph: A computational graph is a network of nodes and edges. In this section, all the data to be used, in other words, tensor Objects (constants, variables, and placeholders) and all the computations to be performed, namely, Operation Objects (in short referred as ops), are defined. Each node can have zero or more inputs but only one output. Nodes in the network represent Objects (tensors and Operations), and edges represent the Tensors that flow between operations. The computation graph defines the blueprint of the neural network but Tensors in it have no value associated with them yet.

To build a computation graph we define all the constants, variables, and operations that we need to perform. Constants, variables, and placeholders will be dealt with in the next recipe. Mathematical operations will be dealt in detail in the recipe for matrix manipulations. Here, we describe the structure using a simple example of defining and executing a graph to add two vectors.

Execution of the graph: The execution of the graph is performed using Session Object. The Session Object encapsulates the environment in which tensor and Operation Objects are evaluated. This is the place where actual calculations and transfer of information from one layer to another takes place. The values of different tensor Objects are initialized, accessed, and saved in Session Object only. Up to now the tensor Objects were just abstract definitions, here they come to life.

How to do it...

We proceed with the recipe as follows:

We consider a simple example of adding two vectors, we have two inputs vectors v_1 and v_2 they are to be fed as input to the Add operation. The graph we want to build is as follows:

The corresponding code to define the computation graph is as follows:

v_1 = tf.constant([1,2,3,4]) 
v_2 = tf.constant([2,1,5,3]) 
v_add = tf.add(v_1,v_2)  # You can also write v_1 + v_2 instead

Next, we execute the graph in the session:

with tf.Session() as sess: 
    prin(sess.run(v_add))

The above two commands are equivalent to the following code. The advantage of using with block is that one need not close the session explicitly.

sess = tf.Session() 
print(ses.run(tv_add)) 
sess.close()

This results in printing the sum of two vectors:

[3 3 8 7]

Remember that each Session needs to be explicitly closed using the close() method, with block implicitly closes the session when it ends.

How it works...

The building of a computational graph is very simple; you go on adding the variables and operations and passing them through (flow the tensors) in the sequence you build your neural network layer by layer. TensorFlow also allows you to use specific devices (CPU/GPU) with different objects of the computation graph using with tf.device(). In our example, the computational graph consists of three nodes, v_1 and v_2 representing the two vectors, and Add is the operation to be performed on them.

Now, to bring this graph to life, we first need to define a session object using tf.Session(); we gave the name sess to our session object. Next, we run it using the run method defined in Session class as follows:

run (fetches, feed_dict=None, options=None, run_metadata)

This evaluates the tensor in fetches; our example has tensor v_add in fetches. The run method will execute every tensor and every operation in the graph leading to v_add. If instead of v_add, you have v_1 in fetches, the result will be the value of vector v_1:

[1,2,3,4]

Fetches can be a single tensor/operation object or more, for example, if the fetches is [v_1, v_2, v_add], the output will be the following:

[array([1, 2, 3, 4]), array([2, 1, 5, 3]), array([3, 3, 8, 7])]

In the same program code, we can have many session objects.

There's more...

You must be wondering why we have to write so many lines of code for a simple vector addition or to print a small message. Well, you could have very conveniently done this work in a one-liner:

print(tf.Session().run(tf.add(tf.constant([1,2,3,4]),tf.constant([2,1,5,3]))))

Writing this type of code not only affects the computational graph but can be memory expensive when the same operation ( OP) is performed repeatedly in a for loop. Making a habit of explicitly defining all tensor and operation objects not only makes the code more readable but also helps you visualize the computational graph in a cleaner manner.

Visualizing the graph using TensorBoard is one of the most useful capabilities of TensorFlow, especially when building complicated neural networks. The computational graph that we built can be viewed with the help of Graph Object.

If you are working on Jupyter Notebook or Python shell, it is more convenient to use tf.InteractiveSession instead of tf.Session. InteractiveSession makes itself the default session so that you can directly call run the tensor Object using eval() without explicitly calling the session, as described in the following example code:

sess = tf.InteractiveSession() 
 
v_1 = tf.constant([1,2,3,4]) 
v_2 = tf.constant([2,1,5,3]) 
 
v_add = tf.add(v_1,v_2) 
 
print(v_add.eval()) 
 
sess.close()

TensorFlow 1.x Deep Learning Cookbook

TensorFlow 1.x Deep Learning Cookbook

Overview of this book

Related Content you might be interested in

Current Title:

TensorFlow 1.x Deep Learning Cookbook

Deep Learning with TensorFlow 2 and Keras

Deep Learning with Keras

Deep Learning with TensorFlow and Keras