Python Image Processing Cookbook

By : Sandipan Dey

Python Image Processing Cookbook

By: Sandipan Dey

Overview of this book

With the advancements in wireless devices and mobile technology, there's increasing demand for people with digital image processing skills in order to extract useful information from the ever-growing volume of images. This book provides comprehensive coverage of the relevant tools and algorithms, and guides you through analysis and visualization for image processing. With the help of over 60 cutting-edge recipes, you'll address common challenges in image processing and learn how to perform complex tasks such as object detection, image segmentation, and image reconstruction using large hybrid datasets. Dedicated sections will also take you through implementing various image enhancement and image restoration techniques, such as cartooning, gradient blending, and sparse dictionary learning. As you advance, you'll get to grips with face morphing and image segmentation techniques. With an emphasis on practical solutions, this book will help you apply deep learning techniques such as transfer learning and fine-tuning to solve real-world problems. By the end of this book, you'll be proficient in utilizing the capabilities of the Python ecosystem to implement various image processing techniques effectively.

Preface

Who this book is for

What this book covers

To get the most out of this book

Sections

Get in touch

Image Manipulation and Transformation

Technical requirements

Transforming color space (RGB → Lab)

Applying affine transformation

Applying perspective transformation and homography

Creating pencil sketches from images

Creating cartoonish images

Simulating light art/long exposure

Object detection using color in HSV

Free Chapter

Image Enhancement

Applying filters to denoise different types of noise in an image

Image denoising with a denoising autoencoder

Image denoising with PCA/DFT/DWT

Image denoising with anisotropic diffusion

Improving image contrast with histogram equalization

Implementing histogram matching

Performing gradient blending

Edge detection with Canny, LoG/zero-crossing, and wavelets

Image Restoration

Restoring an image with the Wiener filter

Restoring an image with the constrained least squares filter

Image restoration with a Markov random field

Image inpainting

Image completion with inpainting using deep learning

Image restoration with dictionary learning

Compressing an image using wavelets

Using steganography and steganalysis

Binary Image Processing

Applying morphological operators to a binary image

Applying Morphological filters

Morphological pattern matching

Segmenting images with morphology

Counting objects

Image Registration

Medical image registration with SimpleITK

Image alignment with ECC algorithm and warping

Face alignment with dlib

Robust matching and homography with the RANSAC algorithm

Image mosaicing (panorama)

Face morphing

Implementing an image search engine

Image Segmentation

Thresholding with Otsu and Riddler–Calvard

Image segmentation with self-organizing maps

RandomWalk segmentation with scikit-image

Human skin segmentation with the GMM-EM algorithm

Medical image segmentation

Deep semantic segmentation

Deep instance segmentation

Image Classification

Classifying images with scikit-learn (HOG and logistic regression)

Classifying textures with Gabor filter banks

Classifying images with VGG19/Inception V3/MobileNet/ResNet101 (with PyTorch)

Fine-tuning (with transfer learning) for image classification

Classifying traffic signs using a deep learning model (with PyTorch)

Estimating a human pose using a deep learning model

Object Detection in Images

Object detection with HOG/SVM

Object detection with Yolo V3

Object detection with Faster R-CNN

Object detection with Mask R-CNN

Multiple object tracking with Python-OpenCV

Text detection/recognition in images with EAST/Tesseract

Face detection with Viola-Jones/Haar-like features

Face Recognition, Image Captioning, and More

Face recognition using FaceNet

Age, gender, and emotion recognition using deep learning models

Image colorization with deep learning

Automatic image captioning with a CNN and an LSTM

Image generation with a GAN

Using a variational autoencoder to reconstruct and generate images

Using a restricted Boltzmann machine to reconstruct Bangla MNIST images

Other Books You May Enjoy

Leave a review - let other readers know what you think

Customer Reviews

5 star

4 star

3 star

2 star

1 star

Creating cartoonish images

In this recipe, you will learn how to create cartoonish flat-textured images from an image. Again, there is more than one way to do the same; here, we will learn how to do it with edge-preserving bilateral filters.

Getting ready

The following libraries need to be imported first:

import cv2
import numpy as np
import matplotlib.pylab as plt

How to do it...

For this recipe, we will be using the bilateralFilter() function from OpenCV-Python. We need to start by downsampling the image to create an image pyramid (you will see more of this in the next chapter), followed by repeated application of small bilateral filters (to remove unimportant details) and upsampling the image to its original size. Next, you need to apply the median blur (to flatten the texture) followed by masking the original image with the binary image obtained by adaptive thresholding. The following code demonstrates the steps:

Read the input image and initialize the parameters to be used later:

img = plt.imread("images/bean.png")

num_down = 2 # number of downsampling steps
num_bilateral = 7 # number of bilateral filtering steps

w, h, _ = img.shape

Use the Gaussian pyramid's downsampling to reduce the image size (and make the subsequent operations faster):

img_color = np.copy(img)
for _ in range(num_down):
 img_color = cv2.pyrDown(img_color)

Apply bilateral filters (with a small diameter value) iteratively. The d parameter represents the diameter of the neighborhood for each pixel, where the sigmaColor and sigmaSpace parameters represent the filter sigma in the color and the coordinate spaces, respectively:

for _ in range(num_bilateral):
 img_color = cv2.bilateralFilter(img_color, d=9, sigmaColor=0.1, sigmaSpace=0.01)

Use upsampling to enlarge the image to the original size:

for _ in range(num_down):
 img_color = cv2.pyrUp(img_color)

Convert to the output image obtained from the last step and blur the image with the median filter:

img_gray = cv2.cvtColor(img, cv2.COLOR_RGB2GRAY)
img_blur = cv2.medianBlur(img_gray, 7)

Detect and enhance the edges:

img_edge = cv2.adaptiveThreshold((255*img_blur).astype(np.uint8), \
            255, cv2.ADAPTIVE_THRESH_MEAN_C, cv2.THRESH_BINARY, \
            blockSize=9, C=2)

Convert the grayscale edge image back into an RGB color image and compute bitwise AND with the RGB color image to get the final output cartoonish image:

img_edge = cv2.cvtColor(img_edge, cv2.COLOR_GRAY2RGB)
img_cartoon = cv2.bitwise_and(img_color, img_edge)

How it works...

As explained earlier, the bilateralFilter(), medianBlur(), adaptiveThreshold(), and bitwise_and() functions from OpenCV-Python were the key functions used to first remove weak edges, then convert into flat texture, and finally enhance the prominent edges in the image.

The bilateralFilter() function from OpenCV-Python was used to smooth the textures while keeping the edges fairly sharp:

The higher the value of the sigmaColor parameter, the more the pixel colors in the neighborhood will be mixed together. This will produce larger areas of semi-equal color in the output image.
The higher the value of the sigmaSpace parameter, the more the pixels with similar colors will influence each other.

The image was downsampled to create an image pyramid (you will see more of this in the next chapter).

Next, repeated application of small bilateral filters was used to remove unimportant details. A subsequent upsampling was used to resize the image to its original size.

Finally, medianBlur was applied (to flatten the texture) followed by masking the original image with the binary image obtained by adaptive thresholding.

If you run the preceding code, you will get an output cartoonish image, as shown here:

There's more...

Play with the parameter values of the OpenCV functions to see the impact on the output image produced. Also, as mentioned earlier, there is more than one way to achieve the same effect. Try using anisotropic diffusion to obtain flat texture images. You should get an image like the following one (use the anisotropic_diffusion() function from the medpy library):

Python Image Processing Cookbook

By : Sandipan Dey

Python Image Processing Cookbook

By: Sandipan Dey

Overview of this book

Related Content you might be interested in

Current Title:

Python Image Processing Cookbook

Hands-On Image Processing with Python

Computer Vision with Python 3

Computer Vision Projects with OpenCV and Python 3