Learning OpenCV 5 Computer Vision with Python, Fourth Edition - Fourth Edition

By : Joseph Howse, Joe Minichino

5 (2)

Learning OpenCV 5 Computer Vision with Python, Fourth Edition - Fourth Edition

5 (2)

By: Joseph Howse, Joe Minichino

Overview of this book

Computer vision is a rapidly evolving science in the field of artificial intelligence, encompassing diverse use cases and techniques. This book will not only help those who are getting started with computer vision but also experts in the domain. You'll be able to put theory into practice by building apps with OpenCV 5 and Python 3. You'll start by setting up OpenCV 5 with Python 3 on various platforms. Next, you'll learn how to perform basic operations such as reading, writing, manipulating, and displaying images, videos, and camera feeds. From taking you through image processing, video analysis, depth estimation, and segmentation, to helping you gain practice by building a GUI app, this book ensures you'll have opportunities for hands-on activities. You'll tackle two popular challenges: face detection and face recognition. You'll also learn about object classification and machine learning, which will enable you to create and use object detectors and even track moving objects in real time. Later, you'll develop your skills in augmented reality and real-world 3D navigation. Finally, you'll cover ANNs and DNNs, learning how to develop apps for recognizing handwritten digits and classifying a person's gender and age, and you'll deploy your solutions to the Cloud. By the end of this book, you'll have the skills you need to execute real-world computer vision projects.

Free Chapter

Learning OpenCV 5 Computer Vision with Python, Fourth Edition: Tackle tools, techniques, and algorithms for computer vision and machine learning

1 Setting Up OpenCV

Join our book community on Discord

Technical requirements

What's new in OpenCV 5

Optimizing OpenCV for specific hardware

Choosing and using the right setup tools

Running the official sample code

Finding documentation, help, and updates

Finding this book’s sample code

Summary

2 Handling Files, Cameras, and GUIs

Join our book community on Discord

Technical requirements

Basic I/O scripts

Project Cameo (face tracking and image manipulation)

Cameo – an object-oriented design

Summary

3 Processing Images with OpenCV

Join our book community on Discord

Technical requirements

Converting images between different color models

Exploring the Fourier transform

Creating modules

Edge detection

Custom kernels – getting convoluted

Modifying the application

Edge detection with Canny

Contour detection

Detecting lines, circles, and other shapes

Summary

5 Detecting and Recognizing Faces

Join our book community on Discord

Technical requirements

Conceptualizing Haar cascades

Getting Haar cascade data

Using OpenCV to perform face detection

Performing face recognition

Swapping faces in infrared

Summary

6 Retrieving Images and Searching Using Image Descriptors

Join our book community on Discord

Technical requirements

Understanding types of feature detection and matching

Detecting Harris corners

Detecting DoG features and extracting SIFT descriptors

Detecting Fast Hessian features and extracting SURF descriptors

Using ORB with FAST features and BRIEF descriptors

Filtering matches using K-Nearest Neighbors and the ratio test

Matching with FLANN

Finding homography with FLANN-based matches

A sample application – tattoo forensics

Summary

7 Building Custom Object Detectors

Join our book community on Discord

Technical requirements

Understanding HOG descriptors

Understanding NMS

Understanding SVMs

Detecting people with HOG descriptors

Creating and training an object detector

Detecting cars

Summary

8 Tracking Objects

Join our book community on Discord

Technical requirements

Detecting moving objects with background subtraction

Tracking colorful objects using MeanShift and CamShift

Finding trends in motion using the Kalman filter

Tracking pedestrians

Summary

9 Camera Models and Augmented Reality

Join our book community on Discord

Technical requirements

Understanding 3D image tracking and augmented reality

Implementing the demo application

Improving the 3D tracking algorithm

Summary

11 Introduction to Neural Networks with OpenCV

Join our book community on Discord

Technical requirements

Understanding ANNs

Training a basic ANN in OpenCV

Training an ANN classifier in multiple epochs

Recognizing handwritten digits with an ANN

Using DNNs from other frameworks in OpenCV

Detecting and classifying objects with third-party DNNs

Detecting and classifying faces with third-party DNNs

Using OpenCV with MediaPipe and Tensorflow to classify gestures and actions

Summary

12 OpenCV Applications at Scale

Join our book community on Discord

What are Containers?

Docker Basics

AWS Serverless (Fargate, Lambda) and AWS SAM

Conclusion

Appendix A: Bending Color Space with the Curves Filter

Join our book community on Discord

Summary

Customer Reviews

5 (2)

5 star

100%

4 star

3 star

2 star

1 star

Understanding NMS

The concept of NMS might sound simple. From a set of overlapping solutions, just pick the best one! However, the implementation is more complex than you might initially think. Remember the image pyramid? Overlapping detections can occur at different scales. We must gather up all our positive detections, and convert their bounds back to a common scale before we check for overlap. A typical implementation of NMS takes the following approach:

Construct an image pyramid.
Scan each level of the pyramid with the sliding window approach, for object detection. For each window that yields a positive detection (beyond a certain arbitrary confidence threshold), convert the window back to the original image's scale. Add the window and its confidence score to a list of positive detections.
Sort the list of positive detections by order of descending confidence score so that the best detections come first in the list.
For each window, W, in the list of positive detections, remove...

Learning OpenCV 5 Computer Vision with Python, Fourth Edition - Fourth Edition

By : Joseph Howse, Joe Minichino

Learning OpenCV 5 Computer Vision with Python, Fourth Edition - Fourth Edition

By: Joseph Howse, Joe Minichino

Overview of this book

Related Content you might be interested in

Current Title:

Learning OpenCV 5 Computer Vision with Python, Fourth Edition - Fourth Edition

Understanding NMS