Book Image

OpenCV 4 with Python Blueprints - Second Edition

By : Dr. Menua Gevorgyan, Arsen Mamikonyan, Michael Beyeler
Book Image

OpenCV 4 with Python Blueprints - Second Edition

By: Dr. Menua Gevorgyan, Arsen Mamikonyan, Michael Beyeler

Overview of this book

OpenCV is a native cross-platform C++ library for computer vision, machine learning, and image processing. It is increasingly being adopted in Python for development. This book will get you hands-on with a wide range of intermediate to advanced projects using the latest version of the framework and language, OpenCV 4 and Python 3.8, instead of only covering the core concepts of OpenCV in theoretical lessons. This updated second edition will guide you through working on independent hands-on projects that focus on essential OpenCV concepts such as image processing, object detection, image manipulation, object tracking, and 3D scene reconstruction, in addition to statistical learning and neural networks. You’ll begin with concepts such as image filters, Kinect depth sensor, and feature matching. As you advance, you’ll not only get hands-on with reconstructing and visualizing a scene in 3D but also learn to track visually salient objects. The book will help you further build on your skills by demonstrating how to recognize traffic signs and emotions on faces. Later, you’ll understand how to align images, and detect and track objects using neural networks. By the end of this OpenCV Python book, you’ll have gained hands-on experience and become proficient at developing advanced computer vision apps according to specific business needs.
Table of Contents (14 chapters)
11
Profiling and Accelerating Your Apps
12
Setting Up a Docker Container

Learning feature extraction

Generally speaking, in machine learning, feature extraction is a process of dimensionality reduction of the data that results in an informative description of a data element.

In computer vision, a feature is usually an interesting area of an image. It is a measurable property of an image that is very informative about what the image represents. Usually, the grayscale value of an individual pixel (that is, the raw data) does not tell us a lot about the image as a whole. Instead, we need to derive a property that is more informative.

For example, knowing that there are patches in the image that look like eyes, a nose, and a mouth will allow us to reason about how likely it is that the image represents a face. In this case, the number of resources required to describe the data is drastically reduced. The data refers to, for example, whether we are seeing...