Book Image

Intelligent Projects Using Python

By : Santanu Pattanayak

Book Image

Intelligent Projects Using Python

By: Santanu Pattanayak

Overview of this book

This book will be a perfect companion if you want to build insightful projects from leading AI domains using Python. The book covers detailed implementation of projects from all the core disciplines of AI. We start by covering the basics of how to create smart systems using machine learning and deep learning techniques. You will assimilate various neural network architectures such as CNN, RNN, LSTM, to solve critical new world challenges. You will learn to train a model to detect diabetic retinopathy conditions in the human eye and create an intelligent system for performing a video-to-text translation. You will use the transfer learning technique in the healthcare domain and implement style transfer using GANs. Later you will learn to build AI-based recommendation systems, a mobile app for sentiment analysis and a powerful chatbot for carrying customer services. You will implement AI techniques in the cybersecurity domain to generate Captchas. Later you will train and build autonomous vehicles to self-drive using reinforcement learning. You will be using libraries from the Python ecosystem such as TensorFlow, Keras and more to bring the core aspects of machine learning, deep learning, and AI. By the end of this book, you will be skilled to build your own smart models for tackling any kind of AI problems without any hassle.

Preface

Who this book is for

What this book covers

To get the most out of this book

Free Chapter

Foundations of Artificial Intelligence Based Systems

Foundations of Artificial Intelligence Based Systems

Neural networks

Neural activation units

The backpropagation method of training neural networks

Convolutional neural networks

Recurrent neural networks (RNNs)

Generative adversarial networks

Reinforcement learning

Transfer learning

Restricted Boltzmann machines

Transfer Learning

Transfer Learning

Technical requirements

Introduction to transfer learning

Transfer learning and detecting diabetic retinopathy

The diabetic retinopathy dataset

Formulating the loss function

Taking class imbalances into account

Preprocessing the images

Additional data generation using affine transformation

Network architecture

The optimizer and initial learning rate

Cross-validation

Model checkpoints based on validation log loss

Python implementation of the training process

Results from the categorical classification

Inference at testing time

Performing regression instead of categorical classification

Using the keras sequential utils as generator

Neural Machine Translation

Neural Machine Translation

Technical requirements

Rule-based machine translation

Statistical machine-learning systems

Neural machine translation

Implementing a sequence-to-sequence neural translation machine

Style Transfer in Fashion Industry using GANs

Style Transfer in Fashion Industry using GANs

Technical requirements

Learning to generate natural handbags from sketched outlines

Preprocess the Images

The generators of the DiscoGAN

The discriminators of the DiscoGAN

Building the network and defining the cost functions

Building the training process

Important parameter values for GAN training

Invoking the training

Monitoring the generator and the discriminator loss

Sample images generated by DiscoGAN

Video Captioning Application

Video Captioning Application

Technical requirements

CNNs and LSTMs in video captioning

A sequence-to-sequence video-captioning system

Data for the video-captioning system

Processing video images to create CNN features

Processing the labelled captions of the video

Building the train and test dataset

Building the model

Creating a word vocabulary for the captions

Training the model

Training results

Inference with unseen test videos

The Intelligent Recommender System

The Intelligent Recommender System

Technical requirements

What is a recommender system?

Latent factorization-based recommendation system

Deep learning for latent factor collaborative filtering

Restricted Boltzmann machines for recommendation

Contrastive divergence

Collaborative filtering using RBMs

Collaborative filtering implementation using RBM

Inference using the trained RBM

Mobile App for Movie Review Sentiment Analysis

Mobile App for Movie Review Sentiment Analysis

Technical requirements

Building an Android mobile app using TensorFlow mobile

Movie review rating in an Android app

Preprocessing the movie review text

Building the model

Training the model

Freezing the model to a protobuf format

Creating a word-to-token dictionary for inference

App interface page design

The core logic of the Android app

Testing the mobile app

Conversational AI Chatbots for Customer Service

Conversational AI Chatbots for Customer Service

Technical requirements

Chatbot architecture

A sequence-to-sequence model using an LSTM

Building a sequence-to-sequence model

Customer support on Twitter

Autonomous Self-Driving Car Through Reinforcement Learning

Autonomous Self-Driving Car Through Reinforcement Learning

Technical requirements

Markov decision process

Learning the Q value function

Deep Q learning

Formulating the cost function

Double deep Q learning

Implementing an autonomous self-driving car

Discretizing actions for deep Q learning

Implementing the Double Deep Q network

Designing the agent

The environment for the self-driving car

Putting it all together

Results from the training

CAPTCHA from a Deep-Learning Perspective

CAPTCHA from a Deep-Learning Perspective

Technical requirements

Breaking CAPTCHAs with deep learning

CAPTCHA generation through adversarial learning

Other Books You May Enjoy

Other Books You May Enjoy

Leave a review - let other readers know what you think

Customer Reviews

5 star

0

4 star

0

3 star

0

2 star

0

1 star

0

Processing the labelled captions of the video

The corpus.csv file contains the description of the videos in the form of text captions (see Figure 5.5). A snippet of the data is shown in the following screenshot. We can remove a few [VideoID,Start,End] combination records and treat these as test files for evaluation later on:

Figure 5.5: A snapshot of the format of the captions file

The VideoID, Start and End columns combine to form the video name in the following format: VideoID_Start_End.avi. Based on the video name, the features from the convolutional neural network VGG16 has been stored as VideoID_Start_End.npy. Illustrated in the following code block is the function to process the text captions for the video and create the path cross reference to the video image features from VGG16:

def get_clean_caption_data(self,text_path,feat_path):
        text_data = pd.read_csv(text_path...