Raspberry Pi Computer Vision Programming - Second Edition

By : Ashwin Pajankar

5 (1)

Buy this Book

Raspberry Pi Computer Vision Programming - Second Edition

5 (1)

By: Ashwin Pajankar

Buy this Book

Overview of this book

Raspberry Pi is one of the popular single-board computers of our generation. All the major image processing and computer vision algorithms and operations can be implemented easily with OpenCV on Raspberry Pi. This updated second edition is packed with cutting-edge examples and new topics, and covers the latest versions of key technologies such as Python 3, Raspberry Pi, and OpenCV. This book will equip you with the skills required to successfully design and implement your own OpenCV, Raspberry Pi, and Python-based computer vision projects. At the start, you'll learn the basics of Python 3, and the fundamentals of single-board computers and NumPy. Next, you'll discover how to install OpenCV 4 for Python 3 on Raspberry Pi, before covering major techniques and algorithms in image processing, manipulation, and computer vision. By working through the steps in each chapter, you'll understand essential OpenCV features. Later sections will take you through creating graphical user interface (GUI) apps with GPIO and OpenCV. You'll also learn to use the new computer vision library, Mahotas, to perform various image processing operations. Finally, you'll explore the Jupyter Notebook and how to set up a Windows computer and Ubuntu for computer vision. By the end of this book, you'll be able to confidently build and deploy computer vision apps.

Preface

Who this book is for

What this book covers

To get the most out of this book

Download the example code files

Code in Action

Download the color images

Conventions used

Get in touch

Reviews

Chapter 1: Introduction to Computer Vision and the Raspberry Pi

Understanding computer vision

Single-board computers

Raspberry Pi

OSes for Raspberry Pi

Setting up Raspbian on a Raspberry Pi

Updating the RPi

Summary

Free Chapter

Chapter 2: Preparing the Raspberry Pi for Computer Vision

Remotely logging into the RPi with SSH

Remote desktop access

Installing OpenCV on an RPi board

Heatsinks and overclocking RPi 4B

Summary

Chapter 3: Introduction to Python Programming

Technical requirements

Understanding Python 3

The SciPy ecosystem

RPi GPIO programming with Python 3

Summary

Chapter 4: Getting Started with Computer Vision

Technical requirements

Exploring image datasets

Working with images using OpenCV

Using matplotlib to visualize images

Drawing geometric shapes with OpenCV and NumPy

Working with a GUI

Event handling and a primitive paint application

Working with a USB webcam

The Pi camera module

Summary

Chapter 5: Basics of Image Processing

Technical requirements

Retrieving image properties

Basic operations on images

Arithmetic operations on images

Blending and transitioning images

Multiplying images by a constant and one another

Creating a negative of an image

Bitwise logical operations on images

Summary

Chapter 6: Colorspaces, Transformations, and Thresholding

Technical requirements

Colorspaces and converting them

Performing transformation operations on images

Perspective transformation of images

Thresholding images

Summary

Chapter 7: Let's Make Some Noise

Technical requirements

Noise

Working with kernels

2D convolution with the signal processing module in SciPy

Filtering and blurring with OpenCV

Summary

Chapter 8: High-Pass Filters and Feature Detection

Technical requirements

Exploring high-pass filters

Working with the Canny edge detector

Harris corner detection

Exercise

Summary

Chapter 9: Image Restoration, Segmentation, and Depth Maps

Technical requirements

Restoring damaged images using inpainting

Segmenting images

Disparity maps and depth estimation

Summary

Chapter 10: Histograms, Contours, and Morphological Transformations

Technical requirements

Computing and visualizing histograms

Visualizing image contours

Applying morphological transformations to images

Summary

Chapter 11: Real-Life Applications of Computer Vision

Technical requirements

Implementing the Max RGB filter

Implementing background subtraction

Computing the optical flow

Detecting and tracking motion

Detecting barcodes in images

Implementing the chroma key effect

Summary

Chapter 12: Working with Mahotas and Jupyter

Technical requirements

Processing images with Mahotas

Combining Mahotas and OpenCV

Exploring the Jupyter Notebook for Python 3 programming

Summary

Chapter 13: Appendix

Technical requirements

Performance measurement and the management of OpenCV

Reusing a Raspbian OS microSD card

Tour of the raspi-config command-line utility

Installation and the environment setup on Windows, Debian, and Ubuntu

Python implementations and Python distributions

Other Books You May Enjoy

Leave a review - let other readers know what you think

Customer Reviews

5 (1)

5 star

100%

4 star

3 star

2 star

1 star

Understanding computer vision

The field of computer vision is a combination of different fields, including (but not limited to) computer science, mathematics, and electrical engineering. It includes ways to capture, process, and analyze images and videos from the real world in order to assist in decision making. Computer vision means mimicking biological (that is, human and non-human) vision. The end goal of most computer vision systems is to extract useful information from still images and videos (including prerecorded videos and live feeds) for the purpose of decision making. Biological vision systems work in a similar fashion. Additionally, unlike biological vision, computer vision can also acquire and work with images from the visual spectrum that are not visible to biological entities, for example, infrared and depth images.

Computer vision also relates to the area of extracting information from captured images and videos. A computer vision system may accept various types of data, such as images, videos, and live video streams, as inputs to further process, analyze, and extract meaningful information for the purpose of making important decisions.

The fields of artificial intelligence, machine vision, and computer vision overlap and share many topics, such as image processing, pattern recognition, and machine learning, as depicted in the following diagram:

Figure 1.1 – The relationships between different scientific domains

In order to work as a researcher in the area of computer vision, you need to have a solid background and understanding of mathematics. However, to write programs for computer vision using OpenCV and Python 3, you don't need to know a lot of mathematics. Note that, in this book, you will be learning all of the mathematical and theoretical concepts required to get started with image processing and computer vision.

The typical objectives of a computer vision system could be one or more of the following:

The recognition of objects, the classification of visual detection, and an analysis of motion
The reconstruction of scenes using images
Image denoising and restoration

Do not get stressed if you are unfamiliar with these key terms. We will explore and implement many of these concepts throughout our journey.

OpenCV

OpenCV (also known as Open Source Computer Vision) is an open source library for computer vision and machine learning. It has many functionalities for image processing and computer vision. It is a cross-platform library, and it works with many programming languages and OSes. It has a large collection of computer vision and machine learning-related functions. It also has several Graphical User Interface (GUI) and event handling features.

OpenCV is free for academic and commercial usage as it is under the Berkley Software Distribution (BSD) license. It is written with the C++ programming language. It has interfaces for most of the popular programming languages, including (but not limited to) C/C++, Python, and Java. It runs on a variety of OSes, including Windows, Android, Linux, macOS, and other Unix-like OSes. In this book, we will write computer vision-related programs with OpenCV and Python 3.

The library has more than 2,500 optimized algorithms for machine learning and computer vision tasks. It has a community of more than 47,000 computer vision professionals, and it has been downloaded more than 18 million times. OpenCV is extensively used in academics for teaching, research organizations, government organizations, and various industry segments. Reputed able organizations such as Google, Yahoo, Microsoft, Intel, IBM, Sony, Honda, and Toyota all use OpenCV.

Let's take a look at the history of OpenCV. OpenCV was originally an in-house initiative of Intel Research and was used to develop a framework to work with images and videos. It was initially supported by Willow Garage and then Itseez.

Note

You can visit the website of Willow Garage at http://www.willowgarage.com/.

In August 2012, the responsibility for further development and support for OpenCV was assumed by an independent, not-for-profit, organization, OpenCV.org. It maintains the website for OpenCV. In May 2016, Intel acquired Itseez. The following URLs have the press announcement from Intel and OpenCV.org:

Here's a brief timeline of the developments related to OpenCV:

Figure 1.2 – Timeline of OpenCV

You can find all the details, including different versions and press releases of the OpenCV library, at https://opencv.org/.

As we will be writing computer vision programs with Raspberry Pi as the platform, we will study single-board computers and Raspberry Pi in detail. We will learn how to set up the Raspbian OS on various models of a Raspberry Pi single-board computer.

Raspberry Pi Computer Vision Programming - Second Edition

By : Ashwin Pajankar

Raspberry Pi Computer Vision Programming - Second Edition

By: Ashwin Pajankar

Overview of this book

Related Content you might be interested in

Current Title:

Raspberry Pi Computer Vision Programming - Second Edition

OpenCV 3.x with Python By Example

Mastering OpenCV 4 with Python

OpenCV 3 Computer Vision with Python Cookbook

Understanding computer vision

OpenCV