Hands-On Vision and Behavior for Self-Driving Cars

By : Luca Venturi, Krishtof Korda

Hands-On Vision and Behavior for Self-Driving Cars

By: Luca Venturi, Krishtof Korda

Overview of this book

The visual perception capabilities of a self-driving car are powered by computer vision. The work relating to self-driving cars can be broadly classified into three components - robotics, computer vision, and machine learning. This book provides existing computer vision engineers and developers with the unique opportunity to be associated with this booming field. You will learn about computer vision, deep learning, and depth perception applied to driverless cars. The book provides a structured and thorough introduction, as making a real self-driving car is a huge cross-functional effort. As you progress, you will cover relevant cases with working code, before going on to understand how to use OpenCV, TensorFlow and Keras to analyze video streaming from car cameras. Later, you will learn how to interpret and make the most of lidars (light detection and ranging) to identify obstacles and localize your position. You’ll even be able to tackle core challenges in self-driving cars such as finding lanes, detecting pedestrian and crossing lights, performing semantic segmentation, and writing a PID controller. By the end of this book, you’ll be equipped with the skills you need to write code for a self-driving car running in a driverless car simulator, and be able to tackle various challenges faced by autonomous car engineers.

Preface

Who this book is for

What this book covers

To get the most out of this book

Download the example code files

Code in Action

Download the color images

Conventions used

Get in touch

Reviews

Section 1: OpenCV and Sensors and Signals

Free Chapter

Chapter 1: OpenCV Basics and Camera Calibration

Technical requirements

Introduction to OpenCV and NumPy

Working with image files

Working with video files

Manipulating images

Pedestrian detection using HOG

Camera calibration with OpenCV

Summary

Questions

Chapter 2: Understanding and Working with Signals

Technical requirements

Understanding signal types

Analog versus digital

Serial versus parallel

Framed-based serial protocols

Summary

Questions

Further reading

Chapter 3: Lane Detection

Technical requirements

How to perform thresholding

Perspective correction

Edge detection

Finding the lanes using histograms

The sliding window algorithm

Enhancing a video

Rolling average

Summary

Questions

Section 2: Improving How the Self-Driving Car Works with Deep Learning and Neural Networks

Chapter 4: Deep Learning with Neural Networks

Technical requirements

Understanding machine learning and neural networks

Learning about convolutional neural networks

Getting started with Keras and TensorFlow

Detecting MNIST handwritten digits

Defining the model of the neural network

Summary

Questions

Further reading

Chapter 5: Deep Learning Workflow

Technical requirements

Obtaining the dataset

Understanding the three datasets

Understanding classifiers

The model

Visualizing the activations

Inference

Retraining

Summary

Questions

Chapter 6: Improving Your Neural Network

Technical requirements

A bigger model

A more efficient network

Building a smarter network with batch normalization

Early stopping

Improving the dataset with data augmentation

Improving the validation accuracy with dropout

Summary

Questions

Chapter 7: Detecting Pedestrians and Traffic Lights

Technical requirements

Detecting pedestrians, vehicles, and traffic lights with SSD

Detecting the color of a traffic light

Recognizing traffic lights and their colors

Summary

Questions

Further reading

Chapter 8: Behavioral Cloning

Technical requirements

Teaching a neural network how to drive with behavioral cloning

Introducing DAVE-2

Integrating the neural network with Carla

Self-driving!

Summary

Questions

Further reading

Chapter 9: Semantic Segmentation

Technical requirements

Introducing semantic segmentation

Understanding DenseNet for classification

Segmenting images with CNN

Adapting DenseNet for semantic segmentation

Coding the blocks of FC-DenseNet

Summary

Questions

Further reading

Section 3: Mapping and Controls

Chapter 10: Steering, Throttle, and Brake Control

Technical requirements

Why do you need controls?

Types of controllers

Implementing PID in CARLA

An example MPC in C++

Summary

Questions

Further reading

Chapter 11: Mapping Our Environments

Technical requirements

Why you need maps and localization

Types of mapping and localization

Open source mapping tools

SLAM with an Ouster lidar and Google Cartographer

Summary

Assessments

Other Books You May Enjoy

Leave a review - let other readers know what you think

Customer Reviews

5 star

4 star

3 star

2 star

1 star

What this book covers

Chapter 1, OpenCV Basics and Camera Calibration, is an introduction to OpenCV and NumPy; you will learn how to manipulate images and videos, and how to detect pedestrians using OpenCV; in addition, it explains how a camera works and how OpenCV can be used to calibrate it.

Chapter 2, Understanding and Working with Signals, describes the different types of signals: serial, parallel, digital, analog, single-ended, and differential, and explains some very important protocols: CAN, Ethernet, TCP, and UDP.

Chapter 3, Lane Detection, teaches you everything you need to know to detect the lanes in a road using OpenCV. It covers color spaces, perspective correction, edge detection, histograms, the sliding window technique, and the filtering required to get the best detection.

Chapter 4, Deep Learning with Neural Networks, is a practical introduction to neural networks, designed to quickly teach how to write a neural network. It describes neural networks in general and convolutional neural networks in particular. It introduces Keras, a deep learning module, and it shows how to use it to detect handwritten digits and to classify some images.

Chapter 5, Deep Learning Workflow, ideally complements Chapter 4, Deep Learning with Neural Networks, as it describes the theory of neural networks and the steps required in a typical workflow: obtaining or creating a dataset, splitting it into training, validation, and test sets, data augmentation, the main layers used in a classifier, and how to train, do inference, and retrain. The chapter also covers underfitting and overfitting and explains how to visualize the activations of the convolutional layers.

Chapter 6, Improving Your Neural Network, explains how to optimize a neural network, reducing its parameters, and how to improve its accuracy using batch normalization, early stopping, data augmentation, and dropout.

Chapter 7, Detecting Pedestrians and Traffic Lights, introduces you to CARLA, a self-driving car simulator, which we will use to create a dataset of traffic lights. Using a pre-trained neural network called SSD, we will detect pedestrians, cars, and traffic lights, and we will use a powerful technique called transfer learning to train a neural network to classify the traffic lights according to their colors.

Chapter 8, Behavioral Cloning, explains how to train a neural network to drive CARLA. It explains what behavioral cloning is, how to build a driving dataset using CARLA, how to create a network that's suitable for this task, and how to train it. We will use saliency maps to get an understanding of what the network is learning, and we will integrate it with CARLA to help it self-drive!

Chapter 9, Semantic Segmentation, is the final and most advanced chapter about deep learning, and it explains what semantic segmentation is. It details an extremely interesting architecture called DenseNet, and it shows how to adapt it to semantic segmentation.

Chapter 10, Steering, Throttle, and Brake Control, is about controlling a self-driving car. It explains what a controller is, focusing on PID controllers and covering the basics of MPC controllers. Finally, we will implement a PID controller in CARLA.

Chapter 11, Mapping Our Environments, is the final chapter. It discusses maps, localization, and lidar, and it describes some open source mapping tools. You will learn what Simultaneous Localization and Mapping (SLAM) is and how to implement it using the Ouster lidar and Google Cartographer.

Hands-On Vision and Behavior for Self-Driving Cars

By : Luca Venturi, Krishtof Korda

Hands-On Vision and Behavior for Self-Driving Cars

By: Luca Venturi, Krishtof Korda

Overview of this book

Related Content you might be interested in

Current Title:

Hands-On Vision and Behavior for Self-Driving Cars

Applied Deep Learning and Computer Vision for Self-Driving Cars

Practical Convolutional Neural Networks

Hands-On Computer Vision with TensorFlow 2

What this book covers