Computer Vision Projects with OpenCV and Python 3

By : Matthew Rever

Computer Vision Projects with OpenCV and Python 3

By: Matthew Rever

Overview of this book

Python is the ideal programming language for rapidly prototyping and developing production-grade codes for image processing and Computer Vision with its robust syntax and wealth of powerful libraries. This book will help you design and develop production-grade Computer Vision projects tackling real-world problems. With the help of this book, you will learn how to set up Anaconda and Python for the major OSes with cutting-edge third-party libraries for Computer Vision. You'll learn state-of-the-art techniques for classifying images, finding and identifying human postures, and detecting faces within videos. You will use powerful machine learning tools such as OpenCV, Dlib, and TensorFlow to build exciting projects such as classifying handwritten digits, detecting facial features,and much more. The book also covers some advanced projects, such as reading text from license plates from real-world images using Google’s Tesseract software, and tracking human body poses using DeeperCut within TensorFlow. By the end of this book, you will have the expertise required to build your own Computer Vision projects using Python and its associated libraries.

Preface

Who this book is for

What this book covers

To get the most out of this book

Get in touch

Free Chapter

Setting Up an Anaconda Environment

Introducing and installing Python and Anaconda

Installing additional libraries

Exploring Jupyter Notebook

Summary

Image Captioning with TensorFlow

Technical requirements

Introduction to image captioning

Google Brain im2txt captioning model

Running the captioning code on Jupyter

Retraining the captioning model

Summary

Reading License Plates with OpenCV

Identifying the license plate

Plate utility functions

Finding plate characters

Finding and reading license plates with OpenCV

Result analysis

Summary

Human Pose Estimation with TensorFlow

Pose estimation using DeeperCut and ArtTrack

Single-person pose detection

Multi-person pose detection

Retraining the human pose estimation model

Summary

Handwritten Digit Recognition with scikit-learn and TensorFlow

Acquiring and processing MNIST digit data

Creating and training a support vector machine

Introducing TensorFlow with digit classification

Evaluating the results

Summary

Facial Feature Tracking and Classification with dlib

Introducing dlib

Facial landmarks

Finding 68 facial landmarks in images

Faces in videos

Facial recognition

Summary

Deep Learning Image Classification with TensorFlow

Technical requirements

An introduction to TensorFlow

Using Inception for image classification

Retraining with our own images

Speeding up computation with your GPU

Summary

Other Books You May Enjoy

Leave a review - let other readers know what you think

Customer Reviews

5 star

4 star

3 star

2 star

1 star

Google Brain im2txt captioning model

Google Brain im2txt was used by Google in a paper 2015 MSCOCO Image Captioning Challenge, and will form the foundation of the image captioning code that we will implement in our project.

The Google's GitHub TensorFlow page can be found at https://github.com/tensorflow/models/tree/master/research/im2txt .

In the research directory, we will find the im2txt file, which was used by Google in the paper, 2015 MSCOCO Image Captioning Challenge, which is available for free at https://arxiv.org/abs/1609.06647. It covers RNNs, LSTM, and fundamental algorithms in detail.

We can check how CNNs are used for image classification and also learn how to use the LSTM RNNs for actually generating sequential caption outputs.

We can download the code from the GitHub link; however, it has not been set up to run easily as it does not include a pre-trained model...

Computer Vision Projects with OpenCV and Python 3

By : Matthew Rever

Computer Vision Projects with OpenCV and Python 3

By: Matthew Rever

Overview of this book

Related Content you might be interested in

Current Title:

Computer Vision Projects with OpenCV and Python 3

Python Image Processing Cookbook

Deep Learning Essentials

The Computer Vision Workshop