TensorFlow Machine Learning Cookbook

TensorFlow Machine Learning Cookbook

By : Nick McClure

Buy this Book

TensorFlow Machine Learning Cookbook

By: Nick McClure

Buy this Book

Overview of this book

TensorFlow is an open source software library for Machine Intelligence. The independent recipes in this book will teach you how to use TensorFlow for complex data computations and will let you dig deeper and gain more insights into your data than ever before. You’ll work through recipes on training models, model evaluation, sentiment analysis, regression analysis, clustering analysis, artificial neural networks, and deep learning – each using Google’s machine learning library TensorFlow. This guide starts with the fundamentals of the TensorFlow library which includes variables, matrices, and various data sources. Moving ahead, you will get hands-on experience with Linear Regression techniques with TensorFlow. The next chapters cover important high-level concepts such as neural networks, CNN, RNN, and NLP. Once you are familiar and comfortable with the TensorFlow ecosystem, the last chapter will show you how to take it to production.

TensorFlow Machine Learning Cookbook

Credits

About the Author

About the Reviewer

www.PacktPub.com

Customer Feedback

Preface

Free Chapter

Getting Started with TensorFlow

Introduction

How TensorFlow Works

Declaring Tensors

Using Placeholders and Variables

Working with Matrices

Declaring Operations

Implementing Activation Functions

Working with Data Sources

Additional Resources

The TensorFlow Way

Introduction

Operations in a Computational Graph

Layering Nested Operations

Working with Multiple Layers

Implementing Loss Functions

Implementing Back Propagation

Working with Batch and Stochastic Training

Combining Everything Together

Evaluating Models

Linear Regression

Introduction

Using the Matrix Inverse Method

Implementing a Decomposition Method

Learning The TensorFlow Way of Linear Regression

Understanding Loss Functions in Linear Regression

Implementing Deming regression

Implementing Lasso and Ridge Regression

Implementing Elastic Net Regression

Implementing Logistic Regression

Support Vector Machines

Introduction

Working with a Linear SVM

Reduction to Linear Regression

Working with Kernels in TensorFlow

Implementing a Non-Linear SVM

Implementing a Multi-Class SVM

Nearest Neighbor Methods

Introduction

Working with Nearest Neighbors

Working with Text-Based Distances

Computing with Mixed Distance Functions

Using an Address Matching Example

Using Nearest Neighbors for Image Recognition

Neural Networks

Introduction

Implementing Operational Gates

Working with Gates and Activation Functions

Implementing a One-Layer Neural Network

Implementing Different Layers

Using a Multilayer Neural Network

Improving the Predictions of Linear Models

Learning to Play Tic Tac Toe

Natural Language Processing

Introduction

Working with bag of words

Implementing TF-IDF

Working with Skip-gram Embeddings

Working with CBOW Embeddings

Making Predictions with Word2vec

Using Doc2vec for Sentiment Analysis

Convolutional Neural Networks

Introduction

Implementing a Simpler CNN

Implementing an Advanced CNN

Retraining Existing CNNs models

Applying Stylenet/Neural-Style

Implementing DeepDream

Recurrent Neural Networks

Introduction

Implementing RNN for Spam Prediction

Implementing an LSTM Model

Stacking multiple LSTM Layers

Creating Sequence-to-Sequence Models

Training a Siamese Similarity Measure

Taking TensorFlow to Production

Introduction

Implementing unit tests

Using Multiple Executors

Parallelizing TensorFlow

Taking TensorFlow to Production

Productionalizing TensorFlow – An Example

More with TensorFlow

Introduction

Visualizing graphs in Tensorboard

There's more…

Working with a Genetic Algorithm

Clustering Using K-Means

Solving a System of ODEs

Index

Customer Reviews

5 star

4 star

3 star

2 star

1 star

How TensorFlow Works

At first, computation in TensorFlow may seem needlessly complicated. But there is a reason for it: because of how TensorFlow treats computation, developing more complicated algorithms is relatively easy. This recipe will guide us through the pseudocode of a TensorFlow algorithm.

Getting ready

Currently, TensorFlow is supported on Linux, Mac, and Windows. The code for this book has been created and run on a Linux system, but should run on any other system as well. The code for the book is available on GitHub at https://github.com/nfmcclure/tensorflow_cookbookTensorFlow. Throughout this book, we will only concern ourselves with the Python library wrapper of TensorFlow, although most of the original core code for TensorFlow is written in C++. This book will use Python 3.4+ (https://www.python.org) and TensorFlow 0.12 (https://www.tensorflow.org). TensorFlow has a 1.0.0 alpha version available on the official GitHub site, and the code in this book has been reviewed to be compatible with that version as well. While TensorFlow can run on the CPU, most algorithms run faster if processed on the GPU, and it is supported on graphics cards with Nvidia Compute Capability v4.0+ (v5.1 recommended). Popular GPUs for TensorFlow are Nvidia Tesla architectures and Pascal architectures with at least 4 GB of video RAM. To run on a GPU, you will also need to download and install the Nvidia Cuda Toolkit and also v 5.x + (https://developer.nvidia.com/cuda-downloads). Some of the recipes will rely on a current installation of the Python packages: Scipy, Numpy, and Scikit-Learn. These accompanying packages are also all included in the Anaconda package (https://www.continuum.io/downloads).

How to do it…

Here we will introduce the general flow of TensorFlow algorithms. Most recipes will follow this outline:

Import or generate datasets: All of our machine-learning algorithms will depend on datasets. In this book, we will either generate data or use an outside source of datasets. Sometimes it is better to rely on generated data because we will just want to know the expected outcome. Most of the time, we will access public datasets for the given recipe and the details on accessing these are given in section 8 of this chapter.
Transform and normalize data: Normally, input datasets do not come in the shape TensorFlow would expect so we need to transform TensorFlow them to the accepted shape. The data is usually not in the correct dimension or type that our algorithms expect. We will have to transform our data before we can use it. Most algorithms also expect normalized data and we will do this here as well. TensorFlow has built-in functions that can normalize the data for you as follows:
```
data = tf.nn.batch_norm_with_global_normalization(...)
```
Partition datasets into train, test, and validation sets: We generally want to test our algorithms on different sets that we have trained on. Also, many algorithms require hyperparameter tuning, so we set aside a validation set for determining the best set of hyperparameters.
Set algorithm parameters (hyperparameters): Our algorithms usually have a set of parameters that we hold constant throughout the procedure. For example, this can be the number of iterations, the learning rate, or other fixed parameters of our choosing. It is considered good form to initialize these together so the reader or user can easily find them, as follows:
```
learning_rate = 0.01
batch_size = 100
iterations = 1000
```
Initialize variables and placeholders: TensorFlow depends on knowing what it can and cannot modify. TensorFlow will modify/adjust the variables and weight/bias during optimization to minimize a loss function. To accomplish this, we feed in data through placeholders. We need to initialize both of these variables and placeholders with size and type, so that TensorFlow knows what to expect. TensorFlow also needs to know the type of data to expect: for most of this book, we will use float32. TensorFlow also provides float64 and float16. Note that the more bytes used for precision results in slower algorithms, but the less we use results in less precision. See the following code:
```
a_var = tf.constant(42)
x_input = tf.placeholder(tf.float32, [None, input_size])
y_input = tf.placeholder(tf.float32, [None, num_classes])
```
Define the model structure: After we have the data, and have initialized our variables and placeholders, we have to define the model. This is done by building a computational graph. TensorFlow chooses what operations and values must be the variables and placeholders to arrive at our model outcomes. We talk more in depth about computational graphs in the Operations in a Computational Graph TensorFlow recipe in Chapter 2, The TensorFlow Way. Our model for this example will be a linear model:
```
y_pred = tf.add(tf.mul(x_input, weight_matrix), b_matrix)
```
Declare the loss functions: After defining the model, we must be able to evaluate the output. This is where we declare the loss function. The loss function is very important as it tells us how far off our predictions are from the actual values. The different types of loss functions are explored in greater detail, in the Implementing Back Propagation recipe in Chapter 2, The TensorFlow Way:
```
loss = tf.reduce_mean(tf.square(y_actual – y_pred))
```
Initialize and train the model: Now that we have everything in place, we need to create an instance of our graph, feed in the data through the placeholders, and let TensorFlow change the variables to better predict our training data. Here is one way to initialize the computational graph:
```
with tf.Session(graph=graph) as session:
  ...
  session.run(...)
  ...
```
Note that we can also initiate our graph with:
```
session = tf.Session(graph=graph)
session.run(…)
```
Evaluate the model: Once we have built and trained the model, we should evaluate the model by looking at how well it does with new data through some specified criteria. We evaluate on the train and test set and these evaluations will allow us to see if the model is underfit or overfit. We will address these in later recipes.
Tune hyperparameters: Most of the time, we will want to go back and change some of the hyperparamters, based on the model performance. We then repeat the previous steps with different hyperparameters and evaluate the model on the validation set.
Deploy/predict new outcomes: It is also important to know how to make predictions on new, unseen, data. We can do this with all of our models, once we have them trained.

How it works…

In TensorFlow, we have to set up the data, variables, placeholders, and model before we tell the program to train and change the variables to improve the predictions. TensorFlow accomplishes this through the computational graphs. These computational graphs are a directed graphs with no recursion, which allows for computational parallelism. We create a loss function for TensorFlow to minimize. TensorFlow accomplishes this by modifying the variables in the computational graph. Tensorflow knows how to modify the variables because it keeps track of the computations in the model and automatically computes the gradients for every variable. Because of this, we can see how easy it can be to make changes and try different data sources.

TensorFlow Machine Learning Cookbook

By : Nick McClure

TensorFlow Machine Learning Cookbook

By: Nick McClure

Overview of this book

Related Content you might be interested in

Current Title:

TensorFlow Machine Learning Cookbook

Machine Learning Using TensorFlow Cookbook

Predictive Analytics with TensorFlow

Deep Learning with TensorFlow

How TensorFlow Works

Getting ready

How to do it…

How it works…

See also