Intelligent Projects Using Python

By : Santanu Pattanayak

Intelligent Projects Using Python

By: Santanu Pattanayak

Overview of this book

This book will be a perfect companion if you want to build insightful projects from leading AI domains using Python. The book covers detailed implementation of projects from all the core disciplines of AI. We start by covering the basics of how to create smart systems using machine learning and deep learning techniques. You will assimilate various neural network architectures such as CNN, RNN, LSTM, to solve critical new world challenges. You will learn to train a model to detect diabetic retinopathy conditions in the human eye and create an intelligent system for performing a video-to-text translation. You will use the transfer learning technique in the healthcare domain and implement style transfer using GANs. Later you will learn to build AI-based recommendation systems, a mobile app for sentiment analysis and a powerful chatbot for carrying customer services. You will implement AI techniques in the cybersecurity domain to generate Captchas. Later you will train and build autonomous vehicles to self-drive using reinforcement learning. You will be using libraries from the Python ecosystem such as TensorFlow, Keras and more to bring the core aspects of machine learning, deep learning, and AI. By the end of this book, you will be skilled to build your own smart models for tackling any kind of AI problems without any hassle.

Preface

Who this book is for

What this book covers

To get the most out of this book

Get in touch

Free Chapter

Foundations of Artificial Intelligence Based Systems

Neural networks

Neural activation units

The backpropagation method of training neural networks

Convolutional neural networks

Recurrent neural networks (RNNs)

Generative adversarial networks

Reinforcement learning

Transfer learning

Restricted Boltzmann machines

Autoencoders

Summary

Transfer Learning

Technical requirements

Introduction to transfer learning

Transfer learning and detecting diabetic retinopathy

The diabetic retinopathy dataset

Formulating the loss function

Taking class imbalances into account

Preprocessing the images

Additional data generation using affine transformation

Network architecture

The optimizer and initial learning rate

Cross-validation

Model checkpoints based on validation log loss

Python implementation of the training process

Results from the categorical classification

Inference at testing time

Performing regression instead of categorical classification

Using the keras sequential utils as generator

Summary

Neural Machine Translation

Technical requirements

Rule-based machine translation

Statistical machine-learning systems

Neural machine translation

Implementing a sequence-to-sequence neural translation machine

Summary

Style Transfer in Fashion Industry using GANs

Technical requirements

DiscoGAN

CycleGAN

Learning to generate natural handbags from sketched outlines

Preprocess the Images

The generators of the DiscoGAN

The discriminators of the DiscoGAN

Building the network and defining the cost functions

Building the training process

Important parameter values for GAN training

Invoking the training

Monitoring the generator and the discriminator loss

Sample images generated by DiscoGAN

Summary

Video Captioning Application

Technical requirements

CNNs and LSTMs in video captioning

A sequence-to-sequence video-captioning system

Data for the video-captioning system

Processing video images to create CNN features

Processing the labelled captions of the video

Building the train and test dataset

Building the model

Creating a word vocabulary for the captions

Training the model

Training results

Inference with unseen test videos

Summary

The Intelligent Recommender System

Technical requirements

What is a recommender system?

Latent factorization-based recommendation system

Deep learning for latent factor collaborative filtering

SVD++

Restricted Boltzmann machines for recommendation

Contrastive divergence

Collaborative filtering using RBMs

Collaborative filtering implementation using RBM

Inference using the trained RBM

Summary

Mobile App for Movie Review Sentiment Analysis

Technical requirements

Building an Android mobile app using TensorFlow mobile

Movie review rating in an Android app

Preprocessing the movie review text

Building the model

Training the model

Freezing the model to a protobuf format

Creating a word-to-token dictionary for inference

App interface page design

The core logic of the Android app

Testing the mobile app

Summary

Conversational AI Chatbots for Customer Service

Technical requirements

Chatbot architecture

A sequence-to-sequence model using an LSTM

Building a sequence-to-sequence model

Customer support on Twitter

Summary

Autonomous Self-Driving Car Through Reinforcement Learning

Technical requirements

Markov decision process

Learning the Q value function

Deep Q learning

Formulating the cost function

Double deep Q learning

Implementing an autonomous self-driving car

Discretizing actions for deep Q learning

Implementing the Double Deep Q network

Designing the agent

The environment for the self-driving car

Putting it all together

Results from the training

Summary

CAPTCHA from a Deep-Learning Perspective

Technical requirements

Breaking CAPTCHAs with deep learning

CAPTCHA generation through adversarial learning

Summary

Other Books You May Enjoy

Leave a review - let other readers know what you think

Customer Reviews

5 star

4 star

3 star

2 star

1 star

Generative adversarial networks

Generative adversarial networks, popularly known as GANs, are generative models that learn a specific probability distribution through a generator, G. The generator G plays a zero sum minimax game with a discriminator D and both evolve over time, before the Nash equilibrium is reached. The generator tries to produce samples similar to the ones generated by a given probability distribution, P(x), while the discriminator D tries to distinguish those fake data samples generated by the generator G from the data sample from the original distribution. The generator G tries to generate samples similar to the ones from P(x), by converting samples, z, drawn from a noise distribution, P(z). The discriminator, D, learns to tag samples generated by the generator G as G(z) when fake; x belongs to P(x) when they are original. At the equilibrium of the minimax game, the generator will learn to produce samples similar to the ones generated by the original distribution, P(x), so that the following is true:

The following diagram illustrates a GAN network learning the probability distribution of the MNIST digits:

Figure 1.14: GAN architecture

The cost function minimized by the discriminator is the binary cross-entropy for distinguishing the real data points belonging to the probability distribution P(x) from the fake ones generated by the generator (that is, G(z)):

The generator will try to maximize the same cost function given by (1). This means that, the optimization problem can be formulated as a minimax player with the utility function U(G,D), as illustrated here:

Generally, to measure how far a given probability distribution matches that of a given distribution, f-divergence measures are used, such as the Kullback–Leibler (KL) divergence, the Jensen Shannon divergence, and the Bhattacharyya distance. For example, the KL divergence between two probability distributions, P and Q, is given by the following, where the expectation is with respect to the distribution, P:

Similarly, the Jensen Shannon divergence between P and Q is given as follows:

Now, coming back to (2), the expression can be written as follows:

Here, G(x) is the probability distribution for the generator. Expanding the expectation into its integral form, we get the following:

For a fixed generator distribution, G(x), the utility function will be at a minimum with respect to the discriminator if the following is true:

Substituting D(x) from (5) in (3), we get the following:

Now, the task of the generator is to maximize the utility, , or minimize the utility, . The expression for can be rearranged as follows:

Hence, we can see that the generator minimizing is equivalent to minimizing the Jensen Shannon divergence between the real distribution, P(x), and the distribution of the samples generated by the generator, G (that is, G(x)).

Training a GAN is not a straightforward process, and there are several technical considerations that we need to take into account while training such a network. We will be using an advanced GAN network to build a cross-domain style transfer application in Chapter 4, Style Transfer in Fashion Industry using GANs.

Intelligent Projects Using Python

By : Santanu Pattanayak

Intelligent Projects Using Python

By: Santanu Pattanayak

Overview of this book

Related Content you might be interested in

Current Title:

Intelligent Projects Using Python

Deep Learning for Natural Language Processing

Deep Learning Quick Reference

Hands-On Deep Learning Architectures with Python