OpenCV 3.x with Python By Example

OpenCV 3.x with Python By Example - Second Edition

By : Gabriel Garrido Calvo, Prateek Joshi

Buy this Book

OpenCV 3.x with Python By Example - Second Edition

By: Gabriel Garrido Calvo, Prateek Joshi

Buy this Book

Overview of this book

Computer vision is found everywhere in modern technology. OpenCV for Python enables us to run computer vision algorithms in real time. With the advent of powerful machines, we have more processing power to work with. Using this technology, we can seamlessly integrate our computer vision applications into the cloud. Focusing on OpenCV 3.x and Python 3.6, this book will walk you through all the building blocks needed to build amazing computer vision applications with ease. We start off by manipulating images using simple filtering and geometric transformations. We then discuss affine and projective transformations and see how we can use them to apply cool advanced manipulations to your photos like resizing them while keeping the content intact or smoothly removing undesired elements. We will then cover techniques of object tracking, body part recognition, and object recognition using advanced techniques of machine learning such as artificial neural network. 3D reconstruction and augmented reality techniques are also included. The book covers popular OpenCV libraries with the help of examples. This book is a practical tutorial that covers various examples at different levels, teaching you about the different functions of OpenCV and their actual implementation. By the end of this book, you will have acquired the skills to use OpenCV and Python to develop real-world computer vision applications.

Title Page

Contributors

Packt Upsell

Preface

Free Chapter

Applying Geometric Transformations to Images

Installing OpenCV-Python

Reading, displaying, and saving images

Loading and saving an image

Affine transformations

Projective transformations

Image warping

Summary

Detecting Edges and Applying Image Filters

Creating a vignette filter

Enhancing the contrast in an image

Summary

Cartoonizing an Image

Accessing the webcam

Keyboard inputs

Mouse inputs

Interacting with a live video stream

Cartoonizing an image

Summary

Detecting and Tracking Different Body Parts

Using Haar cascades to detect things

What are integral images?

Detecting and tracking faces

It's time for a moustache

Detecting pupils

Summary

Extracting Features from an Image

Why do we care about keypoints?

What are keypoints?

Detecting the corners

Good features to track

Scale-invariant feature transform (SIFT)

Speeded-up robust features (SURF)

Features from accelerated segment test (FAST)

Binary robust independent elementary features (BRIEF)

Oriented FAST and Rotated BRIEF (ORB)

Summary

Seam Carving

Why do we care about seam carving?

How does it work?

How do we define interesting?

How do we compute the seams?

Can we expand an image?

Can we remove an object completely?

Summary

Detecting Shapes and Segmenting an Image

Contour analysis and shape matching

Approximating a contour

Identifying a pizza with a slice taken out

How to censor a shape?

What is image segmentation?

Watershed algorithm

Summary

Object Tracking

Frame differencing

Colorspace based tracking

Building an interactive object tracker

Feature-based tracking

Background subtraction

Summary

Object Recognition

Object detection versus object recognition

What is a dense feature detector?

What is a visual dictionary?

What is supervised and unsupervised learning?

What are support vector machines?

How do we actually implement this?

Summary

Augmented Reality

What is the premise of augmented reality?

What does an augmented reality system look like?

Geometric transformations for augmented reality

What is pose estimation?

How to track planar objects

How to augment our reality

Let's add some movements

Summary

Machine Learning by an Artificial Neural Network

Machine learning (ML) versus artificial neural network (ANN)

How does ANN work?

How to define multi-layer perceptrons (MLP)

How to implement an ANN-MLP classifier?

Summary

Other Books You May Enjoy

Leave a review - let other readers know what you think

Customer Reviews

5 star

4 star

3 star

2 star

1 star

Image color spaces

In computer vision and image processing, color space refers to a specific way of organizing colors. A color space is actually a combination of two things, a color model and a mapping function. The reason we want color models is because it helps us in representing pixel values using tuples. The mapping function maps the color model to the set of all possible colors that can be represented.

There are many different color spaces that are useful. Some of the more popular color spaces are RGB, YUV, HSV, Lab, and so on. Different color spaces provide different advantages. We just need to pick the color space that's right for the given problem. Let's take a couple of color spaces and see what information they provide:

RGB: Probably the most popular color space. It stands for Red, Green, and Blue. In this color space, each color is represented as a weighted combination of red, green, and blue. So every pixel value is represented as a tuple of three numbers corresponding to red, green, and blue. Each value ranges between 0 and 255.
YUV: Even though RGB is good for many purposes, it tends to be very limited for many real-life applications. People started thinking about different methods to separate the intensity information from the color information. Hence, they came up with the YUV color space. Y refers to the luminance or intensity, and U/V channels represent color information. This works well in many applications because the human visual system perceives intensity information very differently from color information.
HSV: As it turned out, even YUV was still not good enough for some applications. So people started thinking about how humans perceive color, and they came up with the HSV color space. HSV stands for Hue, Saturation, and Value. This is a cylindrical system where we separate three of the most primary properties of colors and represent them using different channels. This is closely related to how the human visual system understands color. This gives us a lot of flexibility as to how we can handle images.

Converting color spaces

Considering all the color spaces, there are around 190 conversion options available in OpenCV. If you want to see a list of all available flags, go to the Python shell and type the following:

import cv2
print([x for x in dir(cv2) if x.startswith('COLOR_')])

You will see a list of options available in OpenCV for converting from one color space to another. We can pretty much convert any color space to any other color space. Let's see how we can convert a color image to a grayscale image:

import cv2
img = cv2.imread('./images/input.jpg', cv2.IMREAD_COLOR)
gray_img = cv2.cvtColor(img, cv2.COLOR_RGB2GRAY)
cv2.imshow('Grayscale image', gray_img)
cv2.waitKey()

What just happened?

We use the cvtColor function to convert color spaces. The first argument is the input image and the second argument specifies the color space conversion.

Splitting image channels

You can convert to YUV by using the following flag:

yuv_img = cv2.cvtColor(img, cv2.COLOR_BGR2YUV)

The image will look something like the following one:

This may look like a deteriorated version of the original image, but it's not. Let's separate out the three channels:

# Alternative 1
y,u,v = cv2.split(yuv_img)
cv2.imshow('Y channel', y)
cv2.imshow('U channel', u)
cv2.imshow('V channel', v)
cv2.waitKey()

# Alternative 2 (Faster)
cv2.imshow('Y channel', yuv_img[:, :, 0])
cv2.imshow('U channel', yuv_img[:, :, 1])
cv2.imshow('V channel', yuv_img[:, :, 2])
cv2.waitKey()

Since yuv_img is a NumPy (which provides dimensional selection operators), we can separate out the three channels by slicing it. If you look at yuv_img.shape, you will see that it is a 3D array. So once you run the preceding piece of code, you will see three different images. Following is the Ychannel:

The channel is basically the grayscale image. Next is the U channel:

And lastly, the V channel:

As we can see here, the channel is the same as the grayscale image. It represents the intensity values, and channels represent the color information.

Merging image channels

Now we are going to read an image, split it into separate channels, and merge them to see how different effects can be obtained out of different combinations:

img = cv2.imread('./images/input.jpg', cv2.IMREAD_COLOR)
g,b,r = cv2.split(img)
gbr_img = cv2.merge((g,b,r))
rbr_img = cv2.merge((r,b,r))
cv2.imshow('Original', img)
cv2.imshow('GRB', gbr_img)
cv2.imshow('RBR', rbr_img)
cv2.waitKey()

Here we can see how channels can be recombined to obtain different color intensities:

In this one, the red channel is used twice so the reds are more intense:

This should give you a basic idea of how to convert between color spaces. You can play around with more color spaces to see what the images look like. We will discuss the relevant color spaces as and when we encounter them during subsequent chapters.

OpenCV 3.x with Python By Example - Second Edition

By : Gabriel Garrido Calvo, Prateek Joshi

OpenCV 3.x with Python By Example - Second Edition

By: Gabriel Garrido Calvo, Prateek Joshi

Overview of this book

Related Content you might be interested in

Current Title:

OpenCV 3.x with Python By Example - Second Edition

Learn OpenCV 4 By Building Projects

Artificial Intelligence with Python

Python Machine Learning Cookbook,

Image color spaces

Converting color spaces

What just happened?

Splitting image channels

Merging image channels