Learning OpenCV 5 Computer Vision with Python, Fourth Edition - Fourth Edition

By : Joseph Howse, Joe Minichino

5 (2)

Learning OpenCV 5 Computer Vision with Python, Fourth Edition - Fourth Edition

5 (2)

By: Joseph Howse, Joe Minichino

Overview of this book

Computer vision is a rapidly evolving science in the field of artificial intelligence, encompassing diverse use cases and techniques. This book will not only help those who are getting started with computer vision but also experts in the domain. You'll be able to put theory into practice by building apps with OpenCV 5 and Python 3. You'll start by setting up OpenCV 5 with Python 3 on various platforms. Next, you'll learn how to perform basic operations such as reading, writing, manipulating, and displaying images, videos, and camera feeds. From taking you through image processing, video analysis, depth estimation, and segmentation, to helping you gain practice by building a GUI app, this book ensures you'll have opportunities for hands-on activities. You'll tackle two popular challenges: face detection and face recognition. You'll also learn about object classification and machine learning, which will enable you to create and use object detectors and even track moving objects in real time. Later, you'll develop your skills in augmented reality and real-world 3D navigation. Finally, you'll cover ANNs and DNNs, learning how to develop apps for recognizing handwritten digits and classifying a person's gender and age, and you'll deploy your solutions to the Cloud. By the end of this book, you'll have the skills you need to execute real-world computer vision projects.

Free Chapter

Learning OpenCV 5 Computer Vision with Python, Fourth Edition: Tackle tools, techniques, and algorithms for computer vision and machine learning

1 Setting Up OpenCV

Join our book community on Discord

Technical requirements

What's new in OpenCV 5

Optimizing OpenCV for specific hardware

Choosing and using the right setup tools

Running the official sample code

Finding documentation, help, and updates

Finding this book’s sample code

Summary

2 Handling Files, Cameras, and GUIs

Join our book community on Discord

Technical requirements

Basic I/O scripts

Project Cameo (face tracking and image manipulation)

Cameo – an object-oriented design

Summary

3 Processing Images with OpenCV

Join our book community on Discord

Technical requirements

Converting images between different color models

Exploring the Fourier transform

Creating modules

Edge detection

Custom kernels – getting convoluted

Modifying the application

Edge detection with Canny

Contour detection

Detecting lines, circles, and other shapes

Summary

5 Detecting and Recognizing Faces

Join our book community on Discord

Technical requirements

Conceptualizing Haar cascades

Getting Haar cascade data

Using OpenCV to perform face detection

Performing face recognition

Swapping faces in infrared

Summary

6 Retrieving Images and Searching Using Image Descriptors

Join our book community on Discord

Technical requirements

Understanding types of feature detection and matching

Detecting Harris corners

Detecting DoG features and extracting SIFT descriptors

Detecting Fast Hessian features and extracting SURF descriptors

Using ORB with FAST features and BRIEF descriptors

Filtering matches using K-Nearest Neighbors and the ratio test

Matching with FLANN

Finding homography with FLANN-based matches

A sample application – tattoo forensics

Summary

7 Building Custom Object Detectors

Join our book community on Discord

Technical requirements

Understanding HOG descriptors

Understanding NMS

Understanding SVMs

Detecting people with HOG descriptors

Creating and training an object detector

Detecting cars

Summary

8 Tracking Objects

Join our book community on Discord

Technical requirements

Detecting moving objects with background subtraction

Tracking colorful objects using MeanShift and CamShift

Finding trends in motion using the Kalman filter

Tracking pedestrians

Summary

9 Camera Models and Augmented Reality

Join our book community on Discord

Technical requirements

Understanding 3D image tracking and augmented reality

Implementing the demo application

Improving the 3D tracking algorithm

Summary

11 Introduction to Neural Networks with OpenCV

Join our book community on Discord

Technical requirements

Understanding ANNs

Training a basic ANN in OpenCV

Training an ANN classifier in multiple epochs

Recognizing handwritten digits with an ANN

Using DNNs from other frameworks in OpenCV

Detecting and classifying objects with third-party DNNs

Detecting and classifying faces with third-party DNNs

Using OpenCV with MediaPipe and Tensorflow to classify gestures and actions

Summary

12 OpenCV Applications at Scale

Join our book community on Discord

What are Containers?

Docker Basics

AWS Serverless (Fargate, Lambda) and AWS SAM

Conclusion

Appendix A: Bending Color Space with the Curves Filter

Join our book community on Discord

Summary

Customer Reviews

5 (2)

5 star

100%

4 star

3 star

2 star

1 star

Understanding 3D image tracking and augmented reality

We have already solved problems involving image matching in Chapter 6, Retrieving Images and Searching Using Image Descriptors. Moreover, we have solved problems involving continuous tracking in Chapter 8, Tracking Objects. Therefore, we are familiar with many of the components of an image tracking system, though we have not yet tackled any 3D tracking problems.

So, what exactly is 3D tracking? Well, it is the process of continually updating an estimate of an object's pose (its position and orientation) in a 3D space. Typically, the pose is expressed in terms of six variables: three variables to represent the object's 3D translation (that is, position) and the other three variables to represent its 3D rotation (that is, orientation).

A more technical term for 3D tracking is 6DOF tracking – that is, tracking with 6 degrees of freedom, meaning the 6 variables we just mentioned. With any fewer than 6 variables, it would...

Learning OpenCV 5 Computer Vision with Python, Fourth Edition - Fourth Edition

By : Joseph Howse, Joe Minichino

Learning OpenCV 5 Computer Vision with Python, Fourth Edition - Fourth Edition

By: Joseph Howse, Joe Minichino

Overview of this book

Related Content you might be interested in

Current Title:

Learning OpenCV 5 Computer Vision with Python, Fourth Edition - Fourth Edition

Understanding 3D image tracking and augmented reality