Depth Estimation and Segmentation | Learning OpenCV 4 Computer Vision with Python 3

Book Overview & Buying
Table Of Contents

Learning OpenCV 4 Computer Vision with Python 3 - Third Edition

By : Joseph Howse, Joe Minichino

4.1 (14)

Buy this Book

Learning OpenCV 4 Computer Vision with Python 3

4.1 (14)

By: Joseph Howse, Joe Minichino

Buy this Book

Overview of this book

Computer vision is a rapidly evolving science, encompassing diverse applications and techniques. This book will not only help those who are getting started with computer vision but also experts in the domain. You’ll be able to put theory into practice by building apps with OpenCV 4 and Python 3. You’ll start by understanding OpenCV 4 and how to set it up with Python 3 on various platforms. Next, you’ll learn how to perform basic operations such as reading, writing, manipulating, and displaying still images, videos, and camera feeds. From taking you through image processing, video analysis, and depth estimation and segmentation, to helping you gain practice by building a GUI app, this book ensures you’ll have opportunities for hands-on activities. Next, you’ll tackle two popular challenges: face detection and face recognition. You’ll also learn about object classification and machine learning concepts, which will enable you to create and use object detectors and classifiers, and even track objects in movies or video camera feed. Later, you’ll develop your skills in 3D tracking and augmented reality. Finally, you’ll cover ANNs and DNNs, learning how to develop apps for recognizing handwritten digits and classifying a person's gender and age. By the end of this book, you’ll have the skills you need to execute real-world computer vision projects.

Preface

Who this book is for

What this book covers

To get the most out of this book

Get in touch

Setting Up OpenCV

Technical requirements

What's new in OpenCV 4

Choosing and using the right setup tools

Running samples

Finding documentation, help, and updates

Summary

Free Chapter

Handling Files, Cameras, and GUIs

Technical requirements

Basic I/O scripts

Project Cameo (face tracking and image manipulation)

Cameo – an object-oriented design

Summary

Processing Images with OpenCV

Technical requirements

Converting images between different color models

Exploring the Fourier transform

Creating modules

Edge detection

Custom kernels – getting convoluted

Modifying the application

Edge detection with Canny

Contour detection

Detecting lines, circles, and other shapes

Summary

Depth Estimation and Segmentation

Technical requirements

Creating modules

Capturing frames from a depth camera

Converting 10-bit images to 8-bit

Creating a mask from a disparity map

Modifying the application

Depth estimation with a normal camera

Foreground detection with the GrabCut algorithm

Image segmentation with the Watershed algorithm

Summary

Detecting and Recognizing Faces

Technical requirements

Conceptualizing Haar cascades

Getting Haar cascade data

Using OpenCV to perform face detection

Swapping faces in the infrared

Summary

Retrieving Images and Searching Using Image Descriptors

Technical requirements

Understanding types of feature detection and matching

Detecting Harris corners

Detecting DoG features and extracting SIFT descriptors

Detecting Fast Hessian features and extracting SURF descriptors

Using ORB with FAST features and BRIEF descriptors

Filtering matches using K-Nearest Neighbors and the ratio test

Matching with FLANN

Performing homography with FLANN-based matches

A sample application – tattoo forensics

Summary

Building Custom Object Detectors

Technical requirements

Understanding HOG descriptors

Understanding NMS

Understanding SVMs

Detecting people with HOG descriptors

Creating and training an object detector

Detecting cars

Summary

Tracking Objects

Technical requirements

Detecting moving objects with background subtraction

Tracking colorful objects using MeanShift and CamShift

Finding trends in motion using the Kalman filter

Tracking pedestrians

Summary

Camera Models and Augmented Reality

Technical requirements

Understanding 3D image tracking and augmented reality

Implementing the demo application

Improving the 3D tracking algorithm

Summary

Introduction to Neural Networks with OpenCV

Technical requirements

Understanding ANNs

Training a basic ANN in OpenCV

Training an ANN classifier in multiple epochs

Recognizing handwritten digits with an ANN

Using DNNs from other frameworks in OpenCV

Detecting and classifying objects with third-party DNNs

Detecting and classifying faces with third-party DNNs

Summary

Other Book You May Enjoy

Leave a review - let other readers know what you think

Appendix A: Bending Color Space with the Curves Filter

Formulating a curve

Caching and applying a curve

Designing object-oriented curve filters

Emulating photo films

Summary

In this chapter, we learned how to analyze simple spatial relationships within images so that we can differentiate between multiple objects, or between a foreground and a background. Our techniques included extraction of three-dimensional information from a two-dimensional input (a video frame or an image). First, we examined depth cameras, and then epipolar geometry and stereo images, so we are now able to calculate disparity maps. Finally, we looked at image segmentation with two of the most popular methods: GrabCut and Watershed.

As we progress through this book, we will continue to extract increasingly complex information from images. Next, we are ready to explore OpenCV's functionality for detection and recognition of faces and other objects.

Tech Concepts

Programming languages

Tech Tools