Book Image

Computer Vision Projects with OpenCV and Python 3

By : Matthew Rever
Book Image

Computer Vision Projects with OpenCV and Python 3

By: Matthew Rever

Overview of this book

Python is the ideal programming language for rapidly prototyping and developing production-grade codes for image processing and Computer Vision with its robust syntax and wealth of powerful libraries. This book will help you design and develop production-grade Computer Vision projects tackling real-world problems. With the help of this book, you will learn how to set up Anaconda and Python for the major OSes with cutting-edge third-party libraries for Computer Vision. You'll learn state-of-the-art techniques for classifying images, finding and identifying human postures, and detecting faces within videos. You will use powerful machine learning tools such as OpenCV, Dlib, and TensorFlow to build exciting projects such as classifying handwritten digits, detecting facial features,and much more. The book also covers some advanced projects, such as reading text from license plates from real-world images using Google’s Tesseract software, and tracking human body poses using DeeperCut within TensorFlow. By the end of this book, you will have the expertise required to build your own Computer Vision projects using Python and its associated libraries.
Table of Contents (9 chapters)

What this book covers

Chapter 1, Setting Up an Anaconda Environment, helps you download and install Python 3 and Anaconda along with their additional libraries, and also discusses the basic concepts of Jupyter Notebook.
Chapter 2, Image Captioning with TensorFlow, introduces you to image captioning using the Google Brain im2txt captioning model, which is a pre-defined model. We will also learn the process of retraining the model for our own customized images.
Chapter 3, Reading License Plates with OpenCV, introduces you to reading license plates using the plate utility functions. We learn the process of finding the possible candidates for our license plate characters, which is key to reading license plates.
Chapter 4, Human Pose Estimation with TensorFlow, introduces you to pose estimation using the DeeperCut algorithm and the pre-defined ArtTrack model. You will learn about single-person and multi-person pose detection, and you'll learn how to retrain the model for images and videos.

Chapter 5, Handwritten Digit Recognition with scikit-learn and TensorFlow, helps you acquire and process MNIST digit data. You will learn how to create and train a support vector machine, and also learn about digit classification using TensorFlow.
Chapter 6, Facial Feature Tracking and Classification with dlib, helps you detect facial features from images and videos, which helps us carry out facial recognition.
Chapter 7, Deep Learning Image Classification with TensorFlow, helps you learn image classification using a pre-trained Inception model. The chapter also teaches you how to retrain the model for customized images.