Deep Learning for Computer Vision

Deep Learning for Computer Vision

By : Rajalingappaa Shanmugamani

Buy this Book

Deep Learning for Computer Vision

By: Rajalingappaa Shanmugamani

Buy this Book

Overview of this book

Deep learning has shown its power in several application areas of Artificial Intelligence, especially in Computer Vision. Computer Vision is the science of understanding and manipulating images, and finds enormous applications in the areas of robotics, automation, and so on. This book will also show you, with practical examples, how to develop Computer Vision applications by leveraging the power of deep learning. In this book, you will learn different techniques related to object classification, object detection, image segmentation, captioning, image generation, face analysis, and more. You will also explore their applications using popular Python libraries such as TensorFlow and Keras. This book will help you master state-of-the-art, deep learning algorithms and their implementation.

Title Page

Packt Upsell

Foreword

Contributors

Preface

Free Chapter

Getting Started

Understanding deep learning

Deep learning for computer vision

Development environment setup

Summary

Image Classification

Training the MNIST model in TensorFlow

Training the MNIST model in Keras

Other popular image testing datasets

The bigger deep learning models

Training a model for cats versus dogs

Developing real-world applications

Summary

Image Retrieval

Understanding visual features

Model inference

Content-based image retrieval

Summary

Object Detection

Detecting objects in an image

Exploring the datasets

Localizing algorithms

Detecting objects

Object detection API

The YOLO object detection algorithm

Summary

Semantic Segmentation

Predicting pixels

Datasets

Algorithms for semantic segmentation

Ultra-nerve segmentation

Segmenting satellite images

Segmenting instances

Summary

Similarity Learning

Algorithms for similarity learning

Human face analysis

Summary

Image Captioning

Understanding the problem and datasets

Understanding natural language processing for image captioning

Approaches for image captioning and related problems

Implementing attention-based image captioning

Summary

Generative Models

Applications of generative models

Neural artistic style transfer

Generative Adversarial Networks

Visual dialogue model

Summary

Video Classification

Understanding and classifying videos

Extending image-based approaches to videos

Summary

Deployment

Performance of models

Deployment in the cloud

Deployment of models in devices

Summary

Other Books You May Enjoy

Leave a review - let other readers know what you think

Customer Reviews

5 star

4 star

3 star

2 star

1 star

The YOLO object detection algorithm

A recent algorithm for object detection is You look only once (YOLO). The image is divided into multiple grids. Each grid cell of the image runs the same algorithm. Let's start the implementation by defining layers with initializers:

def pooling_layer(input_layer, pool_size=[2, 2], strides=2, padding='valid'):
    layer = tf.layers.max_pooling2d(
        inputs=input_layer,
        pool_size=pool_size,
        strides=strides,
        padding=padding
    )
    add_variable_summary(layer, 'pooling')
    return layer

def convolution_layer(input_layer, filters, kernel_size=[3, 3], padding='valid',
                      activation=tf.nn.leaky_relu):
    layer = tf.layers.conv2d(
        inputs=input_layer,
        filters=filters,
        kernel_size=kernel_size,
        activation=activation,
        padding=padding,
        weights_initializer=tf.truncated_normal_initializer(0.0, 0.01),
        weights_regularizer=tf.l2_regularizer(0.0005)
    )
    add_variable_summary...

Deep Learning for Computer Vision

By : Rajalingappaa Shanmugamani

Deep Learning for Computer Vision

By: Rajalingappaa Shanmugamani

Overview of this book

Related Content you might be interested in

Current Title:

Deep Learning for Computer Vision

TensorFlow Deep Learning Projects

Hands-On Computer Vision with TensorFlow 2

Practical Convolutional Neural Networks

The YOLO object detection algorithm