Practical Computer Vision

By : Abhinav Dadhich

Practical Computer Vision

By: Abhinav Dadhich

Overview of this book

In this book, you will find several recently proposed methods in various domains of computer vision. You will start by setting up the proper Python environment to work on practical applications. This includes setting up libraries such as OpenCV, TensorFlow, and Keras using Anaconda. Using these libraries, you'll start to understand the concepts of image transformation and filtering. You will find a detailed explanation of feature detectors such as FAST and ORB; you'll use them to find similar-looking objects. With an introduction to convolutional neural nets, you will learn how to build a deep neural net using Keras and how to use it to classify the Fashion-MNIST dataset. With regard to object detection, you will learn the implementation of a simple face detector as well as the workings of complex deep-learning-based object detectors such as Faster R-CNN and SSD using TensorFlow. You'll get started with semantic segmentation using FCN models and track objects with Deep SORT. Not only this, you will also use Visual SLAM techniques such as ORB-SLAM on a standard dataset. By the end of this book, you will have a firm understanding of the different computer vision techniques and how to apply them in your applications.

Preface

Who this book is for

What this book covers

To get the most out of this book

Get in touch

Free Chapter

A Fast Introduction to Computer Vision

What constitutes computer vision?

Computer vision is everywhere

Getting started

Computer vision research conferences

Summary

Libraries, Development Platform, and Datasets

Libraries and installation

Datasets

Summary

References

Image Filtering and Transformations in OpenCV

Datasets and libraries required

Image manipulation

Introduction to filters

Transformation of an image

Image pyramids

Summary

What is a Feature?

Features use cases

Harris Corner Detection

Summary

References

Convolutional Neural Networks

Datasets and libraries used

Introduction to neural networks

Revisiting the convolution operation

Convolutional Neural Networks

CNN in practice

Summary

Feature-Based Object Detection

Introduction to object detection

Challenges in object detection

Dataset and libraries used

Methods for object detection

Summary

References

Segmentation and Tracking

Datasets and libraries

Segmentation

Tracking

Summary

References

3D Computer Vision

Dataset and libraries

Summary

Mathematics for Computer Vision

Datasets and libraries

Linear algebra

Introduction to probability theory

Summary

Machine Learning for Computer Vision

What is machine learning?

Kinds of machine learning techniques

Dimensionality's curse

A rolling-ball view of learning

Useful tools

Evaluation

Summary

Other Books You May Enjoy

Leave a review - let other readers know what you think

Customer Reviews

5 star

4 star

3 star

2 star

1 star

Image formation

The basic camera model is a pinhole camera, though the real-world cameras that we use are far more complex models. A pinhole camera is made up of a very small slit on a plane that allows the formation of an image as depicted in the following figure:

This camera converts a point in the physical world, often termed the real world, to a pixel on an image plane. The conversion follows the transformation of the three-dimensional coordinate to two-dimensional coordinates. Here in the image plane, the coordinates are denoted as where , P_i is any point on an image. In the physical world, the same point is denoted by , where P_w is any point in the physical world with a global reference frame.

P_i(x', y') and P_w(x, y, z) can be related as, for an ideal pin hole camera:

Here, f is focal length of the camera.

For further discussion on geometry of image formation...

Practical Computer Vision

By : Abhinav Dadhich

Practical Computer Vision

By: Abhinav Dadhich

Overview of this book

Related Content you might be interested in

Current Title:

Practical Computer Vision

Computer Vision with Python 3

Hands-On Image Processing with Python

Practical Convolutional Neural Networks