Book Image

Hands-On Deep Learning Algorithms with Python

By : Sudharsan Ravichandiran

Book Image

Hands-On Deep Learning Algorithms with Python

By: Sudharsan Ravichandiran

Overview of this book

Deep learning is one of the most popular domains in the AI space that allows you to develop multi-layered models of varying complexities. This book introduces you to popular deep learning algorithms—from basic to advanced—and shows you how to implement them from scratch using TensorFlow. Throughout the book, you will gain insights into each algorithm, the mathematical principles involved, and how to implement it in the best possible manner. The book starts by explaining how you can build your own neural networks, followed by introducing you to TensorFlow, the powerful Python-based library for machine learning and deep learning. Moving on, you will get up to speed with gradient descent variants, such as NAG, AMSGrad, AdaDelta, Adam, and Nadam. The book will then provide you with insights into recurrent neural networks (RNNs) and LSTM and how to generate song lyrics with RNN. Next, you will master the math necessary to work with convolutional and capsule networks, widely used for image recognition tasks. You will also learn how machines understand the semantics of words and documents using CBOW, skip-gram, and PV-DM. Finally, you will explore GANs, including InfoGAN and LSGAN, and autoencoders, such as contractive autoencoders and VAE. By the end of this book, you will be equipped with all the skills you need to implement deep learning in your own projects.

Preface

Who this book is for

What this book covers

To get the most out of this book

Free Chapter

Section 1: Getting Started with Deep Learning

Section 1: Getting Started with Deep Learning

Introduction to Deep Learning

Introduction to Deep Learning

What is deep learning?

Biological and artificial neurons

ANN and its layers

Exploring activation functions

Forward propagation in ANN

How does ANN learn?

Debugging gradient descent with gradient checking

Putting it all together

Further reading

Getting to Know TensorFlow

Getting to Know TensorFlow

What is TensorFlow?

Understanding computational graphs and sessions

Variables, constants, and placeholders

Introducing TensorBoard

Handwritten digit classification using TensorFlow

Introducing eager execution

Math operations in TensorFlow

TensorFlow 2.0 and Keras

Should we use Keras or TensorFlow?

Further reading

Section 2: Fundamental Deep Learning Algorithms

Section 2: Fundamental Deep Learning Algorithms

Gradient Descent and Its Variants

Gradient Descent and Its Variants

Demystifying gradient descent

Gradient descent versus stochastic gradient descent

Momentum-based gradient descent

Adaptive methods of gradient descent

Further reading

Generating Song Lyrics Using RNN

Generating Song Lyrics Using RNN

Introducing RNNs

Generating song lyrics using RNNs

Different types of RNN architectures

Further reading

Improvements to the RNN

Improvements to the RNN

LSTM to the rescue

Gated recurrent units

Bidirectional RNN

Going deep with deep RNN

Language translation using the seq2seq model

Further reading

Demystifying Convolutional Networks

Demystifying Convolutional Networks

The architecture of CNNs

The math behind CNNs

Implementing a CNN in TensorFlow

CNN architectures

Capsule networks

Building Capsule networks in TensorFlow

Further reading

Learning Text Representations

Learning Text Representations

Understanding the word2vec model

Building the word2vec model using gensim

Visualizing word embeddings in TensorBoard

Understanding skip-thoughts algorithm

Quick-thoughts for sentence embeddings

Further reading

Section 3: Advanced Deep Learning Algorithms

Section 3: Advanced Deep Learning Algorithms

Generating Images Using GANs

Generating Images Using GANs

Differences between discriminative and generative models

Say hello to GANs!

DCGAN – Adding convolution to a GAN

Least squares GAN

GANs with Wasserstein distance

Further reading

Learning More about GANs

Learning More about GANs

Conditional GANs

Understanding InfoGAN

Translating images using a CycleGAN

Further reading

Reconstructing Inputs Using Autoencoders

Reconstructing Inputs Using Autoencoders

What is an autoencoder?

Autoencoders with convolutions

Exploring denoising autoencoders

Understanding sparse autoencoders

Learning to use contractive autoencoders

Dissecting variational autoencoders

Further reading

Exploring Few-Shot Learning Algorithms

Exploring Few-Shot Learning Algorithms

What is few-shot learning?

Siamese networks

Architecture of siamese networks

Prototypical networks

Relation networks

Matching networks

Further reading

Assessments

Chapter 1 - Introduction to Deep Learning

Chapter 2 - Getting to Know TensorFlow

Chapter 3 - Gradient Descent and Its Variants

Chapter 4 - Generating Song Lyrics Using an RNN

Chapter 5 - Improvements to the RNN

Chapter 6 - Demystifying Convolutional Networks

Chapter 7 - Learning Text Representations

Chapter 8 - Generating Images Using GANs

Chapter 9 - Learning More about GANs

Chapter 10 - Reconstructing Inputs Using Autoencoders

Chapter 11 - Exploring Few-Shot Learning Algorithms

Other Books You May Enjoy

Other Books You May Enjoy

Leave a review - let other readers know what you think

Customer Reviews

5 star

0

4 star

0

3 star

0

2 star

0

1 star

0

What are CNNs?

A CNN, also known as a ConvNet, is one of the most widely used deep learning algorithms for computer vision tasks. Let's say we are performing an image-recognition task. Consider the following image. We want our CNN to recognize that it contains a horse:

How can we do that? When we feed the image to a computer, it basically converts it into a matrix of pixel values. The pixel values range from 0 to 255, and the dimensions of this matrix will be of [image width x image height x number of channels]. A grayscale image has one channel, and colored images have three channels red, green, and blue (RGB).

Let's say we have a colored input image with a width of 11 and a height of 11, that is 11 x 11, then our matrix dimension would be of [11 x 11 x 3]. As you can see in [11 x 11 x 3], 11 x 11 represents the image width and height and 3 represents the channel number...