Book Image

Hands-On Mathematics for Deep Learning

By : Jay Dawani

Book Image

Hands-On Mathematics for Deep Learning

By: Jay Dawani

Overview of this book

Most programmers and data scientists struggle with mathematics, having either overlooked or forgotten core mathematical concepts. This book uses Python libraries to help you understand the math required to build deep learning (DL) models. You'll begin by learning about core mathematical and modern computational techniques used to design and implement DL algorithms. This book will cover essential topics, such as linear algebra, eigenvalues and eigenvectors, the singular value decomposition concept, and gradient algorithms, to help you understand how to train deep neural networks. Later chapters focus on important neural networks, such as the linear neural network and multilayer perceptrons, with a primary focus on helping you learn how each model works. As you advance, you will delve into the math used for regularization, multi-layered DL, forward propagation, optimization, and backpropagation techniques to understand what it takes to build full-fledged DL models. Finally, you’ll explore CNN, recurrent neural network (RNN), and GAN models and their application. By the end of this book, you'll have built a strong foundation in neural networks and DL mathematical concepts, which will help you to confidently research and build custom models in DL.

Preface

Who this book is for

What this book covers

To get the most out of this book

Section 1: Essential Mathematics for Deep Learning

Section 1: Essential Mathematics for Deep Learning

Free Chapter

Linear Algebra

Comparing scalars and vectors

Linear equations

Matrix operations

Vector spaces and subspaces

Matrix decompositions

Vector Calculus

Vector Calculus

Single variable calculus

Multivariable calculus

Vector calculus

Probability and Statistics

Probability and Statistics

Understanding the concepts in probability

Essential concepts in statistics

Optimization

Understanding optimization and it's different types

Exploring the various optimization methods

Exploring population methods

Graph Theory

Understanding the basic concepts and terminology

Adjacency matrix

Types of graphs

Graph Laplacian

Section 2: Essential Neural Networks

Section 2: Essential Neural Networks

Linear Neural Networks

Linear Neural Networks

Linear regression

Polynomial regression

Logistic regression

Feedforward Neural Networks

Feedforward Neural Networks

Understanding biological neural networks

Comparing the perceptron and the McCulloch-Pitts neuron

Training neural networks

Deep neural networks

Regularization

The need for regularization

Parameter tying and sharing

Dataset augmentation

Adversarial training

Convolutional Neural Networks

Convolutional Neural Networks

The inspiration behind ConvNets

Types of data used in ConvNets

Convolutions and pooling

Working with the ConvNet architecture

Training and optimization

Exploring popular ConvNet architectures

Recurrent Neural Networks

Recurrent Neural Networks

The need for RNNs

The types of data used in RNNs

Understanding RNNs

Long short-term memory

Gated recurrent units

Training and optimization

Popular architecture

Section 3: Advanced Deep Learning Concepts Simplified

Section 3: Advanced Deep Learning Concepts Simplified

Attention Mechanisms

Attention Mechanisms

Overview of attention

Understanding neural Turing machines

Exploring the types of attention

Generative Models

Generative Models

Why we need generative models

Generative adversarial networks

Flow-based networks

Transfer and Meta Learning

Transfer and Meta Learning

Transfer learning

Geometric Deep Learning

Geometric Deep Learning

Comparing Euclidean and non-Euclidean data

Graph neural networks

Spectral graph CNNs

Mixture model networks

Facial recognition in 3D

Other Books You May Enjoy

Other Books You May Enjoy

Leave a review - let other readers know what you think

Customer Reviews

5 star

0

4 star

0

3 star

0

2 star

0

1 star

0

Comparing scalars and vectors

Scalars are regular numbers, such as 7, 82, and 93,454. They only have a magnitude and are used to represent time, speed, distance, length, mass, work, power, area, volume, and so on.

Vectors, on the other hand, have magnitude and direction in many dimensions. We use vectors to represent velocity, acceleration, displacement, force, and momentum. We write vectors in bold—such as a instead of a—and they are usually an array of multiple numbers, with each number in this array being an element of the vector.

We denote this as follows:

Here, shows the vector is in n-dimensional real space, which results from taking the Cartesian product of n times; shows each element is a real number; i is the position of each element; and, finally, is a natural number, telling us how many elements are in the vector.

As with regular numbers, you can add and subtract vectors. However, there are some limitations.

Let's take the vector we saw earlier (x) and add it with another vector (y), both of which are in , so that the following applies:

However, we cannot add vectors with vectors that do not have the same dimension or scalars.

Note that when in , we reduce to 2-dimensions (for example, the surface of a sheet of paper), and when n = 3, we reduce to 3-dimensions (the real world).

We can, however, multiply scalars with vectors. Let λ be an arbitrary scalar, which we will multiply with the vector , so that the following applies:

As we can see, λ gets multiplied by each x_i in the vector. The result of this operation is that the vector gets scaled by the value of the scalar.

For example, let , and . Then, we have the following:

While this works fine for multiplying by a whole number, it doesn't help when working with fractions, but you should be able to guess how it works. Let's see an example.

Let , and . Then, we have the following:

There is a very special vector that we can get by multiplying any vector by the scalar, 0. We denote this as 0 and call it the zero vector (a vector containing only zeros).