Python Image Processing Cookbook

By : Sandipan Dey

Python Image Processing Cookbook

By: Sandipan Dey

Overview of this book

With the advancements in wireless devices and mobile technology, there's increasing demand for people with digital image processing skills in order to extract useful information from the ever-growing volume of images. This book provides comprehensive coverage of the relevant tools and algorithms, and guides you through analysis and visualization for image processing. With the help of over 60 cutting-edge recipes, you'll address common challenges in image processing and learn how to perform complex tasks such as object detection, image segmentation, and image reconstruction using large hybrid datasets. Dedicated sections will also take you through implementing various image enhancement and image restoration techniques, such as cartooning, gradient blending, and sparse dictionary learning. As you advance, you'll get to grips with face morphing and image segmentation techniques. With an emphasis on practical solutions, this book will help you apply deep learning techniques such as transfer learning and fine-tuning to solve real-world problems. By the end of this book, you'll be proficient in utilizing the capabilities of the Python ecosystem to implement various image processing techniques effectively.

Preface

Who this book is for

What this book covers

To get the most out of this book

Sections

Get in touch

Image Manipulation and Transformation

Technical requirements

Transforming color space (RGB → Lab)

Applying affine transformation

Applying perspective transformation and homography

Creating pencil sketches from images

Creating cartoonish images

Simulating light art/long exposure

Object detection using color in HSV

Free Chapter

Image Enhancement

Applying filters to denoise different types of noise in an image

Image denoising with a denoising autoencoder

Image denoising with PCA/DFT/DWT

Image denoising with anisotropic diffusion

Improving image contrast with histogram equalization

Implementing histogram matching

Performing gradient blending

Edge detection with Canny, LoG/zero-crossing, and wavelets

Image Restoration

Restoring an image with the Wiener filter

Restoring an image with the constrained least squares filter

Image restoration with a Markov random field

Image inpainting

Image completion with inpainting using deep learning

Image restoration with dictionary learning

Compressing an image using wavelets

Using steganography and steganalysis

Binary Image Processing

Applying morphological operators to a binary image

Applying Morphological filters

Morphological pattern matching

Segmenting images with morphology

Counting objects

Image Registration

Medical image registration with SimpleITK

Image alignment with ECC algorithm and warping

Face alignment with dlib

Robust matching and homography with the RANSAC algorithm

Image mosaicing (panorama)

Face morphing

Implementing an image search engine

Image Segmentation

Thresholding with Otsu and Riddler–Calvard

Image segmentation with self-organizing maps

RandomWalk segmentation with scikit-image

Human skin segmentation with the GMM-EM algorithm

Medical image segmentation

Deep semantic segmentation

Deep instance segmentation

Image Classification

Classifying images with scikit-learn (HOG and logistic regression)

Classifying textures with Gabor filter banks

Classifying images with VGG19/Inception V3/MobileNet/ResNet101 (with PyTorch)

Fine-tuning (with transfer learning) for image classification

Classifying traffic signs using a deep learning model (with PyTorch)

Estimating a human pose using a deep learning model

Object Detection in Images

Object detection with HOG/SVM

Object detection with Yolo V3

Object detection with Faster R-CNN

Object detection with Mask R-CNN

Multiple object tracking with Python-OpenCV

Text detection/recognition in images with EAST/Tesseract

Face detection with Viola-Jones/Haar-like features

Face Recognition, Image Captioning, and More

Face recognition using FaceNet

Age, gender, and emotion recognition using deep learning models

Image colorization with deep learning

Automatic image captioning with a CNN and an LSTM

Image generation with a GAN

Using a variational autoencoder to reconstruct and generate images

Using a restricted Boltzmann machine to reconstruct Bangla MNIST images

Other Books You May Enjoy

Leave a review - let other readers know what you think

Customer Reviews

5 star

4 star

3 star

2 star

1 star

Simulating light art/long exposure

Long exposure (or light art) refers to the process of creating a photo that captures the effect of passing time. Some popular application examples of long exposure photographs are silky-smooth water and a single band of continuous-motion illumination of the highways with car headlights. In this recipe, we will simulate the long exposures by averaging the image frames from a video.

Getting ready

We will extract image frames from a video and then average the frames to simulate light art. Let's start by importing the required libraries:

from glob import glob
import cv2
import numpy as np
import matplotlib.pylab as plt

How to do it...

The following steps need to be performed:

Implement an extract_frames() function to extract the first 200 frames (at most) from a video passed as input to the function:

def extract_frames(vid_file):
 vidcap = cv2.VideoCapture(vid_file)
 success,image = vidcap.read()
 i = 1
 success = True
 while success and i <= 200:
  cv2.imwrite('images/exposure/vid_{}.jpg'.format(i), image)
  success,image = vidcap.read()
  i += 1

Call the function to save all of the frames (as .jpg) extracted from the video of the waterfall in Godafost (Iceland) to the exposure folder:

extract_frames('images/godafost.mp4') #cloud.mp4

Read all the .jpg files from the exposure folder; read them one by one (as float); split each image into B, G, and R channels; compute a running sum of the color channels; and finally, compute average values for the color channels:

imfiles = glob('images/exposure/*.jpg')
nfiles = len(imfiles)
R1, G1, B1 = 0, 0, 0
for i in range(nfiles):
 image = cv2.imread(imfiles[i]).astype(float)
 (B, G, R) = cv2.split(image)
 R1 += R
 B1 += B
 G1 += G
R1, G1, B1 = R1 / nfiles, G1 / nfiles, B1 / nfiles

Merge the average values of the color channels obtained and save the final output image:

final = cv2.merge([B1, G1, R1])
cv2.imwrite('images/godafost.png', final)

The following photo shows one of the extracted input frames:

If you run the preceding code block, you will obtain a long exposure-like image like the one shown here:

Notice the continuous effects in the clouds and the waterfall.

How it works...

The VideoCapture() function from OpenCV-Python was used to create a VideoCapture object with the video file as input. Then, the read() method of that object was used to capture frames from the video.

The imread() and imwrite() functions from OpenCV-Python were used to read/write images from/to disk.

The cv2.split() function was used to split an RGB image into individual color channels, while the cv2.merge() function was used to combine them back into an RGB image.

There's more...

Focus stacking (also known as extended depth of fields) is a technique (in image processing/computational photography) that takes multiple images (of the same subject but captured at different focus distances) as input and then creates an output image with a higher DOF than any of the individual source images by combining the input images. We can simulate focus stacking in Python. The following is an example of focus stacking grayscale image frames extracted from a video using the mahotas library.

Extended depth of field with mahotas

Perform the following steps to implement focus stacking with the mahotas library functions:

Create the image stack first by extracting grayscale image frames from a highway traffic video at night:

import mahotas as mh
def create_image_stack(vid_file, n = 200): 
 vidcap = cv2.VideoCapture(vid_file)
 success,image = vidcap.read()
 i = 0
 success = True
 h, w = image.shape[:2]
 imstack = np.zeros((n, h, w))
 while success and i < n:
   imstack[i,...] = cv2.cvtColor(image, cv2.COLOR_BGR2GRAY)
   success,image = vidcap.read()
   i += 1
 return imstack

image = create_image_stack('images/highway.mp4') #cloud.mp4
stack,h,w = image.shape

Use the sobel() function from mahotas as the pixel-level measure of infocusness:

focus = np.array([mh.sobel(t, just_filter=True) for t in image])

At each pixel location, select the best slice (with maximum infocusness) and create the final image:

best = np.argmax(focus, 0)
image = image.reshape((stack,-1)) # image is now (stack, nr_pixels)
image = image.transpose() # image is now (nr_pixels, stack)
final = image[np.arange(len(image)), best.ravel()] # Select the right pixel at each location
final = final.reshape((h,w)) # reshape to get final result

The following photo is an input image used in the image stack:

The following screenshot is the final output image produced by the algorithm implementation:

Python Image Processing Cookbook

By : Sandipan Dey

Python Image Processing Cookbook

By: Sandipan Dey

Overview of this book

Related Content you might be interested in

Current Title:

Python Image Processing Cookbook

Hands-On Image Processing with Python

Computer Vision with Python 3

Computer Vision Projects with OpenCV and Python 3