Deep Learning with TensorFlow 2 and Keras - Second Edition

By : Antonio Gulli, Amita Kapoor, Sujit Pal

Deep Learning with TensorFlow 2 and Keras - Second Edition

By: Antonio Gulli, Amita Kapoor, Sujit Pal

Overview of this book

Deep Learning with TensorFlow 2 and Keras, Second Edition teaches neural networks and deep learning techniques alongside TensorFlow (TF) and Keras. You’ll learn how to write deep learning applications in the most powerful, popular, and scalable machine learning stack available. TensorFlow is the machine learning library of choice for professional applications, while Keras offers a simple and powerful Python API for accessing TensorFlow. TensorFlow 2 provides full Keras integration, making advanced machine learning easier and more convenient than ever before. This book also introduces neural networks with TensorFlow, runs through the main applications (regression, ConvNets (CNNs), GANs, RNNs, NLP), covers two working example apps, and then dives into TF in production, TF mobile, and using TensorFlow with AutoML.

Preface

Mission

Machine learning, artificial intelligence, and the deep learning Cambrian explosion

Who this book is for

What this book covers

What you need for this book

Get in touch

References

Free Chapter

Neural Network Foundations with TensorFlow 2.0

What is TensorFlow (TF)?

What is Keras?

What are the most important changes in TensorFlow 2.0?

Introduction to neural networks

Perceptron

Multi-layer perceptron – our first example of a network

A real example – recognizing handwritten digits

Regularization

Playing with Google Colab – CPUs, GPUs, and TPUs

Sentiment analysis

Hyperparameter tuning and AutoML

Predicting output

A practical overview of backpropagation

What have we learned so far?

Towards a deep learning approach

References

TensorFlow 1.x and 2.x

Understanding TensorFlow 1.x

Understanding TensorFlow 2.x

The TensorFlow 2.x ecosystem

Keras or tf.keras?

Summary

Regression

What is regression?

Prediction using linear regression

TensorFlow Estimators

Predicting house price using linear regression

Classification tasks and decision boundaries

Summary

References

Convolutional Neural Networks

Deep Convolutional Neural Network (DCNN)

An example of DCNN ‒ LeNet

Recognizing CIFAR-10 images with deep learning

Very deep convolutional networks for large-scale image recognition

Summary

References

Advanced Convolutional Neural Networks

Computer vision

Video

Textual documents

Audio and music

A summary of convolution operations

Capsule networks

Summary

References

Generative Adversarial Networks

What is a GAN?

Deep convolutional GAN (DCGAN)

Some interesting GAN architectures

Cool applications of GANs

CycleGAN in TensorFlow 2.0

Summary

References

Word Embeddings

Word embedding ‒ origins and fundamentals

Distributed representations

Static embeddings

Creating your own embedding using gensim

Exploring the embedding space with gensim

Using word embeddings for spam detection

Neural embeddings – not just for words

Character and subword embeddings

Dynamic embeddings

Sentence and paragraph embeddings

Language model-based embeddings

Summary

References

Recurrent Neural Networks

Encoder-Decoder architecture – seq2seq

Attention mechanism

Transformer architecture

Summary

References

Autoencoders

Introduction to autoencoders

Vanilla autoencoders

Sparse autoencoder

Denoising autoencoders

Stacked autoencoder

Summary

References

Unsupervised Learning

Principal component analysis

Self-organizing maps

Restricted Boltzmann machines

Variational Autoencoders

Summary

References

Reinforcement Learning

Introduction

Introduction to OpenAI Gym

Deep Q-Networks

Deep deterministic policy gradient

Summary

References

TensorFlow and Cloud

Deep learning on cloud

Virtual machines on cloud

Jupyter Notebooks on cloud

TensorFlow Extended for production

TensorFlow Enterprise

Summary

References

TensorFlow for Mobile and IoT and TensorFlow.js

TensorFlow Mobile

TensorFlow Lite

Pretrained models in TensorFlow Lite

An overview of federated learning at the edge

TensorFlow.js

Summary

References

An introduction to AutoML

What is AutoML?

Achieving AutoML

Automatic data preparation

Automatic feature engineering

Automatic model generation

AutoKeras

Google Cloud AutoML

Bringing Google AutoML to Kaggle

Summary

References

The Math Behind Deep Learning

History

Some mathematical tools

Activation functions

Backpropagation

Thinking about backpropagation and convnets

Thinking about backpropagation and RNNs

A note on TensorFlow and automatic differentiation

Summary

References

Tensor Processing Unit

C/G/T processing units

Three generations of TPUs and Edge TPU

TPU performance

How to use TPUs with Colab

Using pretrained TPU models

Using TensorFlow 2.1 and nightly build

Summary

References

Other Books You May Enjoy

Index

Customer Reviews

5 star

4 star

3 star

2 star

1 star

Sentiment analysis

What is the code we used to test colab? It is an example of sentiment analysis developed on top of the IMDb dataset. The IMDb dataset contains the text of 50,000 movie reviews from the Internet Movie Database. Each review is either positive or negative (for example, thumbs up or thumbs down). The dataset is split into 25,000 reviews for training and 25,000 reviews for testing. Our goal is to build a classifier that is able to predict the binary judgment given the text. We can easily load IMDb via tf.keras and the sequences of words in the reviews have been converted to sequences of integers, where each integer represents a specific word in a dictionary. We also have a convenient way of padding sentences to max_len, so that we can use all sentences, whether short or long, as inputs to a neural network with an input vector of fixed size (we will look at this requirement in more detail in Chapter 8, Recurrent Neural Networks):

import tensorflow as tf
from tensorflow.keras import datasets, layers, models, preprocessing
import tensorflow_datasets as tfds
max_len = 200
n_words = 10000
dim_embedding = 256
EPOCHS = 20
BATCH_SIZE = 500
def load_data():
        # Load data.
        (X_train, y_train), (X_test, y_test) = datasets.imdb.load_data(num_words=n_words)
        # Pad sequences with max_len.
        X_train = preprocessing.sequence.pad_sequences(X_train, maxlen=max_len)
        X_test = preprocessing.sequence.pad_sequences(X_test, maxlen=max_len)
        return (X_train, y_train), (X_test, y_test)

Now let's build a model. We are going to use a few layers that will be explained in detail in Chapter 8, Recurrent Neural Networks. For now, let's assume that the Embedding() layer will map the sparse space of words contained in the reviews into a denser space. This will make computation easier. In addition, we will use a GlobalMaxPooling1D() layer, which takes the maximum value of either feature vector from each of the n_words features. In addition, we have two Dense() layers. The last one is made up of one single neuron with a sigmoid activation function for making the final binary estimation:

def build_model():
    model = models.Sequential()
    # Input: - eEmbedding Layer.
    # The model will take as input an integer matrix of size (batch,     # input_length).
    # The model will output dimension (input_length, dim_embedding).
    # The largest integer in the input should be no larger
    # than n_words (vocabulary size).
        model.add(layers.Embedding(n_words, 
        dim_embedding, input_length=max_len))
        model.add(layers.Dropout(0.3))
    # Takes the maximum value of either feature vector from each of     # the n_words features.
    model.add(layers.GlobalMaxPooling1D())
    model.add(layers.Dense(128, activation='relu'))
    model.add(layers.Dropout(0.5))
    model.add(layers.Dense(1, activation='sigmoid'))
    return model

Now we need to train our model, and this piece of code is very similar to what we did with MNIST. Let's see:

(X_train, y_train), (X_test, y_test) = load_data()
model = build_model()
model.summary()
model.compile(optimizer = "adam", loss = "binary_crossentropy",
 metrics = ["accuracy"]
)
score = model.fit(X_train, y_train,
 epochs = EPOCHS,
 batch_size = BATCH_SIZE,
 validation_data = (X_test, y_test)
)
score = model.evaluate(X_test, y_test, batch_size=BATCH_SIZE)
print("\nTest score:", score[0])
print('Test accuracy:', score[1])

Let's see the network and then run a few iterations:

Figure 36: The results of the network following a few iterations

As shown in the following image, we reach the accuracy of 85%, which is not bad at all for a simple network:

Figure 37: Testing the accuracy of a simple network

Deep Learning with TensorFlow 2 and Keras - Second Edition

By : Antonio Gulli, Amita Kapoor, Sujit Pal

Deep Learning with TensorFlow 2 and Keras - Second Edition

By: Antonio Gulli, Amita Kapoor, Sujit Pal

Overview of this book

Related Content you might be interested in

Current Title:

Deep Learning with TensorFlow 2 and Keras - Second Edition

Deep Learning with Keras

TensorFlow 1.x Deep Learning Cookbook

Generative AI with Python and TensorFlow 2

Sentiment analysis