Learning OpenCV 5 Computer Vision with Python, Fourth Edition - Fourth Edition

By : Joseph Howse, Joe Minichino

5 (2)

Learning OpenCV 5 Computer Vision with Python, Fourth Edition - Fourth Edition

5 (2)

By: Joseph Howse, Joe Minichino

Overview of this book

Computer vision is a rapidly evolving science in the field of artificial intelligence, encompassing diverse use cases and techniques. This book will not only help those who are getting started with computer vision but also experts in the domain. You'll be able to put theory into practice by building apps with OpenCV 5 and Python 3. You'll start by setting up OpenCV 5 with Python 3 on various platforms. Next, you'll learn how to perform basic operations such as reading, writing, manipulating, and displaying images, videos, and camera feeds. From taking you through image processing, video analysis, depth estimation, and segmentation, to helping you gain practice by building a GUI app, this book ensures you'll have opportunities for hands-on activities. You'll tackle two popular challenges: face detection and face recognition. You'll also learn about object classification and machine learning, which will enable you to create and use object detectors and even track moving objects in real time. Later, you'll develop your skills in augmented reality and real-world 3D navigation. Finally, you'll cover ANNs and DNNs, learning how to develop apps for recognizing handwritten digits and classifying a person's gender and age, and you'll deploy your solutions to the Cloud. By the end of this book, you'll have the skills you need to execute real-world computer vision projects.

Free Chapter

Learning OpenCV 5 Computer Vision with Python, Fourth Edition: Tackle tools, techniques, and algorithms for computer vision and machine learning

1 Setting Up OpenCV

Join our book community on Discord

Technical requirements

What's new in OpenCV 5

Optimizing OpenCV for specific hardware

Choosing and using the right setup tools

Running the official sample code

Finding documentation, help, and updates

Finding this book’s sample code

Summary

2 Handling Files, Cameras, and GUIs

Join our book community on Discord

Technical requirements

Basic I/O scripts

Project Cameo (face tracking and image manipulation)

Cameo – an object-oriented design

Summary

3 Processing Images with OpenCV

Join our book community on Discord

Technical requirements

Converting images between different color models

Exploring the Fourier transform

Creating modules

Edge detection

Custom kernels – getting convoluted

Modifying the application

Edge detection with Canny

Contour detection

Detecting lines, circles, and other shapes

Summary

5 Detecting and Recognizing Faces

Join our book community on Discord

Technical requirements

Conceptualizing Haar cascades

Getting Haar cascade data

Using OpenCV to perform face detection

Performing face recognition

Swapping faces in infrared

Summary

6 Retrieving Images and Searching Using Image Descriptors

Join our book community on Discord

Technical requirements

Understanding types of feature detection and matching

Detecting Harris corners

Detecting DoG features and extracting SIFT descriptors

Detecting Fast Hessian features and extracting SURF descriptors

Using ORB with FAST features and BRIEF descriptors

Filtering matches using K-Nearest Neighbors and the ratio test

Matching with FLANN

Finding homography with FLANN-based matches

A sample application – tattoo forensics

Summary

7 Building Custom Object Detectors

Join our book community on Discord

Technical requirements

Understanding HOG descriptors

Understanding NMS

Understanding SVMs

Detecting people with HOG descriptors

Creating and training an object detector

Detecting cars

Summary

8 Tracking Objects

Join our book community on Discord

Technical requirements

Detecting moving objects with background subtraction

Tracking colorful objects using MeanShift and CamShift

Finding trends in motion using the Kalman filter

Tracking pedestrians

Summary

9 Camera Models and Augmented Reality

Join our book community on Discord

Technical requirements

Understanding 3D image tracking and augmented reality

Implementing the demo application

Improving the 3D tracking algorithm

Summary

11 Introduction to Neural Networks with OpenCV

Join our book community on Discord

Technical requirements

Understanding ANNs

Training a basic ANN in OpenCV

Training an ANN classifier in multiple epochs

Recognizing handwritten digits with an ANN

Using DNNs from other frameworks in OpenCV

Detecting and classifying objects with third-party DNNs

Detecting and classifying faces with third-party DNNs

Using OpenCV with MediaPipe and Tensorflow to classify gestures and actions

Summary

12 OpenCV Applications at Scale

Join our book community on Discord

What are Containers?

Docker Basics

AWS Serverless (Fargate, Lambda) and AWS SAM

Conclusion

Appendix A: Bending Color Space with the Curves Filter

Join our book community on Discord

Summary

Customer Reviews

5 (2)

5 star

100%

4 star

3 star

2 star

1 star

Detecting and classifying objects with third-party DNNs

For this demo, we are going to capture frames from a webcam in real-time and use a DNN to detect and classify 20 kinds of objects that may be in any given frame. Yes, a single DNN can do all this in real-time on a typical laptop that a programmer might use!

Before delving into the code, let's introduce the DNN that we will use. It is a Caffe version of a model called MobileNet-SSD, which uses a hybrid of a framework from Google called MobileNet and another framework called Single Shot Detector (SSD) MultiBox. The latter framework has a GitHub repository at https://github.com/weiliu89/caffe/tree/ssd/. The training technique for the Caffe version of MobileNet-SSD is provided by a project on GitHub at https://github.com/chuanqi305/MobileNet-SSD/. Copies of the following MobileNet-SSD files can be found in this book's repository, in the chapter10/objects_data folder:

...

Learning OpenCV 5 Computer Vision with Python, Fourth Edition - Fourth Edition

By : Joseph Howse, Joe Minichino

Learning OpenCV 5 Computer Vision with Python, Fourth Edition - Fourth Edition

By: Joseph Howse, Joe Minichino

Overview of this book

Related Content you might be interested in

Current Title:

Learning OpenCV 5 Computer Vision with Python, Fourth Edition - Fourth Edition

Detecting and classifying objects with third-party DNNs