Introduction to Neural Networks with OpenCV | Learning OpenCV 4 Computer Vision with Python 3

Book Overview & Buying
Table Of Contents

Learning OpenCV 4 Computer Vision with Python 3 - Third Edition

By : Joseph Howse, Joe Minichino

4.1 (14)

Buy this Book

Learning OpenCV 4 Computer Vision with Python 3

4.1 (14)

By: Joseph Howse, Joe Minichino

Buy this Book

Overview of this book

Computer vision is a rapidly evolving science, encompassing diverse applications and techniques. This book will not only help those who are getting started with computer vision but also experts in the domain. You’ll be able to put theory into practice by building apps with OpenCV 4 and Python 3. You’ll start by understanding OpenCV 4 and how to set it up with Python 3 on various platforms. Next, you’ll learn how to perform basic operations such as reading, writing, manipulating, and displaying still images, videos, and camera feeds. From taking you through image processing, video analysis, and depth estimation and segmentation, to helping you gain practice by building a GUI app, this book ensures you’ll have opportunities for hands-on activities. Next, you’ll tackle two popular challenges: face detection and face recognition. You’ll also learn about object classification and machine learning concepts, which will enable you to create and use object detectors and classifiers, and even track objects in movies or video camera feed. Later, you’ll develop your skills in 3D tracking and augmented reality. Finally, you’ll cover ANNs and DNNs, learning how to develop apps for recognizing handwritten digits and classifying a person's gender and age. By the end of this book, you’ll have the skills you need to execute real-world computer vision projects.

Preface

Who this book is for

What this book covers

To get the most out of this book

Get in touch

Setting Up OpenCV

Technical requirements

What's new in OpenCV 4

Choosing and using the right setup tools

Running samples

Finding documentation, help, and updates

Summary

Free Chapter

Handling Files, Cameras, and GUIs

Technical requirements

Basic I/O scripts

Project Cameo (face tracking and image manipulation)

Cameo – an object-oriented design

Summary

Processing Images with OpenCV

Technical requirements

Converting images between different color models

Exploring the Fourier transform

Creating modules

Edge detection

Custom kernels – getting convoluted

Modifying the application

Edge detection with Canny

Contour detection

Detecting lines, circles, and other shapes

Summary

Depth Estimation and Segmentation

Technical requirements

Creating modules

Capturing frames from a depth camera

Converting 10-bit images to 8-bit

Creating a mask from a disparity map

Modifying the application

Depth estimation with a normal camera

Foreground detection with the GrabCut algorithm

Image segmentation with the Watershed algorithm

Summary

Detecting and Recognizing Faces

Technical requirements

Conceptualizing Haar cascades

Getting Haar cascade data

Using OpenCV to perform face detection

Swapping faces in the infrared

Summary

Retrieving Images and Searching Using Image Descriptors

Technical requirements

Understanding types of feature detection and matching

Detecting Harris corners

Detecting DoG features and extracting SIFT descriptors

Detecting Fast Hessian features and extracting SURF descriptors

Using ORB with FAST features and BRIEF descriptors

Filtering matches using K-Nearest Neighbors and the ratio test

Matching with FLANN

Performing homography with FLANN-based matches

A sample application – tattoo forensics

Summary

Building Custom Object Detectors

Technical requirements

Understanding HOG descriptors

Understanding NMS

Understanding SVMs

Detecting people with HOG descriptors

Creating and training an object detector

Detecting cars

Summary

Tracking Objects

Technical requirements

Detecting moving objects with background subtraction

Tracking colorful objects using MeanShift and CamShift

Finding trends in motion using the Kalman filter

Tracking pedestrians

Summary

Camera Models and Augmented Reality

Technical requirements

Understanding 3D image tracking and augmented reality

Implementing the demo application

Improving the 3D tracking algorithm

Summary

Introduction to Neural Networks with OpenCV

Technical requirements

Understanding ANNs

Training a basic ANN in OpenCV

Training an ANN classifier in multiple epochs

Recognizing handwritten digits with an ANN

Using DNNs from other frameworks in OpenCV

Detecting and classifying objects with third-party DNNs

Detecting and classifying faces with third-party DNNs

Summary

Other Book You May Enjoy

Leave a review - let other readers know what you think

Appendix A: Bending Color Space with the Curves Filter

Formulating a curve

Caching and applying a curve

Designing object-oriented curve filters

Emulating photo films

Summary

This chapter scratched the surface of the vast and fascinating world of ANNs. We learned about the structure of ANNs, and how to design a network topology based on application requirements. Then, we focused on OpenCV's implementation of MLP ANNs, as well as on OpenCV's support for diverse DNNs that have been trained in other frameworks.

We applied neural networks to real-world problems: notably, handwritten digit recognition; object detection and classification; and a combination of face detection, age classification, and gender classification in real time. We saw that even in these introductory demos, neural networks show a lot of promise in terms of versatility, accuracy, and speed. Hopefully, this encourages you to try out pre-trained models from various authors, and to learn to train advanced models of your own in various frameworks.

With this thought,...

Tech Concepts

Programming languages

Tech Tools