Book Image

Learning OpenCV 5 Computer Vision with Python, Fourth Edition - Fourth Edition

By : Joseph Howse, Joe Minichino

5 (2)

Book Image

Learning OpenCV 5 Computer Vision with Python, Fourth Edition - Fourth Edition

5 (2)

By: Joseph Howse, Joe Minichino

Overview of this book

Computer vision is a rapidly evolving science in the field of artificial intelligence, encompassing diverse use cases and techniques. This book will not only help those who are getting started with computer vision but also experts in the domain. You'll be able to put theory into practice by building apps with OpenCV 5 and Python 3. You'll start by setting up OpenCV 5 with Python 3 on various platforms. Next, you'll learn how to perform basic operations such as reading, writing, manipulating, and displaying images, videos, and camera feeds. From taking you through image processing, video analysis, depth estimation, and segmentation, to helping you gain practice by building a GUI app, this book ensures you'll have opportunities for hands-on activities. You'll tackle two popular challenges: face detection and face recognition. You'll also learn about object classification and machine learning, which will enable you to create and use object detectors and even track moving objects in real time. Later, you'll develop your skills in augmented reality and real-world 3D navigation. Finally, you'll cover ANNs and DNNs, learning how to develop apps for recognizing handwritten digits and classifying a person's gender and age, and you'll deploy your solutions to the Cloud. By the end of this book, you'll have the skills you need to execute real-world computer vision projects.

Free Chapter

Learning OpenCV 5 Computer Vision with Python, Fourth Edition: Tackle tools, techniques, and algorithms for computer vision and machine learning

Learning OpenCV 5 Computer Vision with Python, Fourth Edition: Tackle tools, techniques, and algorithms for computer vision and machine learning

1 Setting Up OpenCV

Join our book community on Discord

Technical requirements

What's new in OpenCV 5

Optimizing OpenCV for specific hardware

Choosing and using the right setup tools

Running the official sample code

Finding documentation, help, and updates

Finding this book’s sample code

2 Handling Files, Cameras, and GUIs

Join our book community on Discord

Technical requirements

Basic I/O scripts

Project Cameo (face tracking and image manipulation)

Cameo – an object-oriented design

3 Processing Images with OpenCV

Join our book community on Discord

Technical requirements

Converting images between different color models

Exploring the Fourier transform

Creating modules

Custom kernels – getting convoluted

Modifying the application

Edge detection with Canny

Contour detection

Detecting lines, circles, and other shapes

5 Detecting and Recognizing Faces

Join our book community on Discord

Technical requirements

Conceptualizing Haar cascades

Getting Haar cascade data

Using OpenCV to perform face detection

Performing face recognition

Swapping faces in infrared

6 Retrieving Images and Searching Using Image Descriptors

Join our book community on Discord

Technical requirements

Understanding types of feature detection and matching

Detecting Harris corners

Detecting DoG features and extracting SIFT descriptors

Detecting Fast Hessian features and extracting SURF descriptors

Using ORB with FAST features and BRIEF descriptors

Filtering matches using K-Nearest Neighbors and the ratio test

Matching with FLANN

Finding homography with FLANN-based matches

A sample application – tattoo forensics

7 Building Custom Object Detectors

Join our book community on Discord

Technical requirements

Understanding HOG descriptors

Understanding NMS

Understanding SVMs

Detecting people with HOG descriptors

Creating and training an object detector

8 Tracking Objects

Join our book community on Discord

Technical requirements

Detecting moving objects with background subtraction

Tracking colorful objects using MeanShift and CamShift

Finding trends in motion using the Kalman filter

Tracking pedestrians

9 Camera Models and Augmented Reality

Join our book community on Discord

Technical requirements

Understanding 3D image tracking and augmented reality

Implementing the demo application

Improving the 3D tracking algorithm

11 Introduction to Neural Networks with OpenCV

Join our book community on Discord

Technical requirements

Understanding ANNs

Training a basic ANN in OpenCV

Training an ANN classifier in multiple epochs

Recognizing handwritten digits with an ANN

Using DNNs from other frameworks in OpenCV

Detecting and classifying objects with third-party DNNs

Detecting and classifying faces with third-party DNNs

Using OpenCV with MediaPipe and Tensorflow to classify gestures and actions

12 OpenCV Applications at Scale

Join our book community on Discord

What are Containers?

AWS Serverless (Fargate, Lambda) and AWS SAM

Appendix A: Bending Color Space with the Curves Filter

Join our book community on Discord

Customer Reviews

5 (2)

5 star

100%

4 star

0

3 star

0

2 star

0

1 star

0

Implementing the demo application

We are going to implement our demo in a single script, ImageTrackingDemo.py, which will contain the following components:

Import statements
A helper function for a custom grayscale conversion
Helper functions to convert keypoints from 2D to 3D space
An application class, ImageTrackingDemo, which will encapsulate a model of the camera and lens, a model of the reference image, a Kalman filter, 6DOF tracking results (including the translation and both the Rodrigues and Euler representations of the rotation), and an application loop that will track the image and draw a simple AR visualization
A main function to launch the application

The script will depend on one other file, reference_image.png, which will represent the image that we want to track.

By preparing a reference image in advance, and by loading it from file at runtime, we can ensure that its technical qualities are good: it has a high resolution (important for close-up tracking), it is properly...