Machine Learning for OpenCV

By : Michael Beyeler

Machine Learning for OpenCV

By: Michael Beyeler

Overview of this book

Machine learning is no longer just a buzzword, it is all around us: from protecting your email, to automatically tagging friends in pictures, to predicting what movies you like. Computer vision is one of today's most exciting application fields of machine learning, with Deep Learning driving innovative systems such as self-driving cars and Google’s DeepMind. OpenCV lies at the intersection of these topics, providing a comprehensive open-source library for classic as well as state-of-the-art computer vision and machine learning algorithms. In combination with Python Anaconda, you will have access to all the open-source computing libraries you could possibly ask for. Machine learning for OpenCV begins by introducing you to the essential concepts of statistical learning, such as classification and regression. Once all the basics are covered, you will start exploring various algorithms such as decision trees, support vector machines, and Bayesian networks, and learn how to combine them with other OpenCV functionality. As the book progresses, so will your machine learning skills, until you are ready to take on today's hottest topic in the field: Deep Learning. By the end of this book, you will be ready to take on your own machine learning problems, either by building on the existing source code or developing your own algorithm from scratch!

Preface

What this book covers

What you need for this book

Free Chapter

A Taste of Machine Learning

Getting started with machine learning

Problems that machine learning can solve

Getting started with Python

Getting started with OpenCV

Installation

Summary

Working with Data in OpenCV and Python

Understanding the machine learning workflow

Dealing with data using OpenCV and Python

Summary

First Steps in Supervised Learning

Understanding supervised learning

Using classification models to predict class labels

Using regression models to predict continuous outcomes

Classifying iris species using logistic regression

Summary

Representing Data and Engineering Features

Understanding feature engineering

Preprocessing data

Understanding dimensionality reduction

Representing categorical variables

Representing text features

Representing images

Summary

Using Decision Trees to Make a Medical Diagnosis

Understanding decision trees

Using decision trees to diagnose breast cancer

Using decision trees for regression

Summary

Detecting Pedestrians with Support Vector Machines

Understanding linear support vector machines

Dealing with nonlinear decision boundaries

Detecting pedestrians in the wild

Summary

Implementing a Spam Filter with Bayesian Learning

Understanding Bayesian inference

Implementing your first Bayesian classifier

Classifying emails using the naive Bayes classifier

Summary

Discovering Hidden Structures with Unsupervised Learning

Understanding unsupervised learning

Understanding k-means clustering

Understanding expectation-maximization

Compressing color spaces using k-means

Classifying handwritten digits using k-means

Organizing clusters as a hierarchical tree

Summary

Using Deep Learning to Classify Handwritten Digits

Understanding the McCulloch-Pitts neuron

Understanding the perceptron

Implementing your first perceptron

Understanding multilayer perceptrons

Getting acquainted with deep learning

Classifying handwritten digits

Summary

Combining Different Algorithms into an Ensemble

Understanding ensemble methods

Combining decision trees into a random forest

Using random forests for face recognition

Implementing AdaBoost

Combining different models into a voting classifier

Summary

Selecting the Right Model with Hyperparameter Tuning

Evaluating a model

Understanding cross-validation

Estimating robustness using bootstrapping

Assessing the significance of our results

Tuning hyperparameters with grid search

Scoring models using different evaluation metrics

Chaining algorithms together to form a pipeline

Summary

Wrapping Up

Approaching a machine learning problem

Building your own estimator

Where to go from here?

Summary

Customer Reviews

5 star

4 star

3 star

2 star

1 star

Representing text features

Similar to categorical features, scikit-learn offers an easy way to encode another common feature type, text features. When working with text features, it is often convenient to encode individual words or phrases as numerical values.

Let's consider a dataset that contains a small corpus of text phrases:

In [1]: sample = [
...        'feature engineering',
...        'feature selection',
...        'feature extraction'
...     ]

One of the simplest methods of encoding such data is by word count; for each phrase, we simply count the occurrences of each word within it. In scikit-learn, this is easily done using CountVectorizer, which functions akin to DictVectorizer:

In [2]: from sklearn.feature_extraction.text import CountVectorizer
...     vec = CountVectorizer()
...     X = vec.fit_transform(sample)
...     X
Out[2]: <3x4...

Machine Learning for OpenCV

By : Michael Beyeler

Machine Learning for OpenCV

By: Michael Beyeler

Overview of this book

Related Content you might be interested in

Current Title:

Machine Learning for OpenCV

Machine Learning with scikit-learn Quick Start Guide

Python Machine Learning, Second Edition

scikit-learn Cookbook