Hands-On Java Deep Learning for Computer Vision

By : Klevis Ramo

Hands-On Java Deep Learning for Computer Vision

By: Klevis Ramo

Overview of this book

Although machine learning is an exciting world to explore, you may feel confused by all of its theoretical aspects. As a Java developer, you will be used to telling the computer exactly what to do, instead of being shown how data is generated; this causes many developers to struggle to adapt to machine learning. The goal of this book is to walk you through the process of efficiently training machine learning and deep learning models for Computer Vision using the most up-to-date techniques. The book is designed to familiarize you with neural networks, enabling you to train them efficiently, customize existing state-of-the-art architectures, build real-world Java applications, and get great results in a short space of time. You will build real-world Computer Vision applications, ranging from a simple Java handwritten digit recognition model to real-time Java autonomous car driving systems and face recognition models. By the end of this book, you will have mastered the best practices and modern techniques needed to build advanced Computer Vision Java applications and achieve production-grade accuracy.

Preface

Who this book is for

What this book covers

To get the most out of this book

Get in touch

Free Chapter

Introduction to Computer Vision and Training Neural Networks

The computer vision state

Exploring neural networks

How does a neural network learn?

Organizing data and applications

Effective training techniques

Optimizing algorithms

Configuring the training parameters of the neural network

Representing images and outputs

Building a handwritten digit recognizer

Summary

Convolutional Neural Network Architectures

Understanding edge detection

Building a Java edge detection application

Convolution on RGB images

Working with convolutional layers' parameters

Pooling layers

Building and training a Convolution Neural Network

Improving the handwritten digit recognition application

Summary

Transfer Learning and Deep CNN Architectures

Working with classical networks

Using residual networks for image recognition

The power of 1 x 1 convolutions and the inception network

Applying transfer learning

Neural networks

Building an animal image classification – using transfer learning and VGG-16 architecture

Summary

Real-Time Object Detection

Resolving object localization

Object detection with the sliding window solution

Convolutional sliding window

Detecting objects with the YOLO algorithm

Max suppression and anchor boxes

Building a real-time video, car, and pedestrian detection application

Summary

Creating Art with Neural Style Transfer

What are convolution network layers learning?

Neural style transfer

Applying content cost function

Applying style cost function

Building a neural network that produces art

Summary

Face Recognition

Problems in face detection

Differentiating inputs with Siamese networks

Exploring triplet loss

Binary classification

Building a face recognition Java application

Summary

Other Books You May Enjoy

Leave a review - let other readers know what you think

Customer Reviews

5 star

4 star

3 star

2 star

1 star

The computer vision state

In this section, we will look at how computer vision has grown over the past couple of years into the current field of computer vision we have today. As mentioned before, the progress in the field of deep learning is what propelled computer vision to advance.

Deep learning has enabled a lot of applications that seemed impossible before. These include the following:

Autonomous driving: An algorithm is able to detect the location of pedestrians and other cars, helping to make decisions about the direction of the vehicle and avoid accidents.
Face recognition and smarter mobile applications: You may already have seen phones that can be unlocked using facial recognition. In the near future, we could have security systems based on this; for example, the door of your house may be unlocked by your face or your car may start after recognizing your face. Smart mobile applications with fancy features such as applying filters and grouping faces together have also improved drastically.
Art generation: Even generating art will be possible, as we will see during this book, using computer vision techniques.

What is really exciting is that we can use some of these ideas and architectures to build applications.

The importance of data in deep learning algorithms

The main source of knowledge for deep learning algorithms is data. Therefore, the quality and the amount of data greatly affects the performance of every algorithm.

For speech recognition, we have a decent amount of data, considering the complexity of the problem. Although the dataset for the images has dramatically improved, having a few more samples will help achieve better results for image recognition. On the other hand, when it comes to object detection, we have less data due to the complexity in the effort of marking each of the objects with a bounding box as shown in the diagram.

Computer vision is, in itself, a really complex problem to solve. Imagine having a bunch of pixels with decimal values, and from there, you have to figure out what they represent.

For this reason, computer vision has developed more complex techniques, larger and more complex architectures, and also a lot of parameters to tune. The rule is such that the less data you have, the more hacks are needed, the more engineering or manual creation of features is required, and the architectures tend to grow complex. On the other hand, if you have more data, the deep learning algorithm tends to do well, and hand-engineering the data becomes a whole lot easier, which means we don't have to tune the parameters and the network architectures stay simple.

Throughout this book, we'll look at several methods to tackle computer vision challenges, such as transfer learning using well-known architectures in literature and opera. We will also make good use of open source implementations. In the next section, we'll start to understand the basics of neural networks and their representations.

Hands-On Java Deep Learning for Computer Vision

By : Klevis Ramo

Hands-On Java Deep Learning for Computer Vision

By: Klevis Ramo

Overview of this book

Related Content you might be interested in

Current Title:

Hands-On Java Deep Learning for Computer Vision

Hands-On Mathematics for Deep Learning

Java Deep Learning Projects

Practical Convolutional Neural Networks