Book Image

Computer Vision Projects with OpenCV and Python 3

By : Matthew Rever
Book Image

Computer Vision Projects with OpenCV and Python 3

By: Matthew Rever

Overview of this book

Python is the ideal programming language for rapidly prototyping and developing production-grade codes for image processing and Computer Vision with its robust syntax and wealth of powerful libraries. This book will help you design and develop production-grade Computer Vision projects tackling real-world problems. With the help of this book, you will learn how to set up Anaconda and Python for the major OSes with cutting-edge third-party libraries for Computer Vision. You'll learn state-of-the-art techniques for classifying images, finding and identifying human postures, and detecting faces within videos. You will use powerful machine learning tools such as OpenCV, Dlib, and TensorFlow to build exciting projects such as classifying handwritten digits, detecting facial features,and much more. The book also covers some advanced projects, such as reading text from license plates from real-world images using Google’s Tesseract software, and tracking human body poses using DeeperCut within TensorFlow. By the end of this book, you will have the expertise required to build your own Computer Vision projects using Python and its associated libraries.
Table of Contents (9 chapters)

An introduction to TensorFlow

In this chapter, we'll go deeper into TensorFlow and see how we can build a general-purpose image classifier using its deep learning method.

This will be an extension of what we learned in Chapter 2, Handwritten Digit Recognition with scikit-learn and TensorFlow, where we learned how to classify handwritten digits. However, this method is quite a bit more powerful, as it will work on general images of people, animals, food, everyday objects, and so on.

To start, let's talk a little bit about what TensorFlow does, and the general workflow of TensorFlow.

To begin, what is a tensor? Wikipedia states this:

"In mathematics, tensors are geometric objects that describe linear relations between geometric vectors, scalars, and other tensors... Given a reference basis of vectors, a tensor can be represented as an organized multi-dimensional array...