Book Image

Java Deep Learning Projects

Book Image

Java Deep Learning Projects

Overview of this book

Java is one of the most widely used programming languages. With the rise of deep learning, it has become a popular choice of tool among data scientists and machine learning experts. Java Deep Learning Projects starts with an overview of deep learning concepts and then delves into advanced projects. You will see how to build several projects using different deep neural network architectures such as multilayer perceptrons, Deep Belief Networks, CNN, LSTM, and Factorization Machines. You will get acquainted with popular deep and machine learning libraries for Java such as Deeplearning4j, Spark ML, and RankSys and you’ll be able to use their features to build and deploy projects on distributed computing environments. You will then explore advanced domains such as transfer learning and deep reinforcement learning using the Java ecosystem, covering various real-world domains such as healthcare, NLP, image classification, and multimedia analytics with an easy-to-follow approach. Expert reviews and tips will follow every project to give you insights and hacks. By the end of this book, you will have stepped up your expertise when it comes to deep learning in Java, taking it beyond theory and be able to build your own advanced deep learning systems.

Preface

Who this book is for

What this book covers

To get the most out of this book

Free Chapter

Getting Started with Deep Learning

Getting Started with Deep Learning

A soft introduction to ML

Delving into deep learning

Artificial Neural Networks

ANNs and the backpropagation algorithm

Neural network architectures

DL frameworks and cloud platforms

Deep learning from a disaster – Titanic survival prediction

Frequently asked questions (FAQs)

Answers to FAQs

Cancer Types Prediction Using Recurrent Type Networks

Cancer Types Prediction Using Recurrent Type Networks

Deep learning in cancer genomics

Cancer genomics dataset description

Preparing programming environment

Cancer type prediction using an LSTM network

Frequently asked questions (FAQs)

Answers to questions

Multi-Label Image Classification Using Convolutional Neural Networks

Multi-Label Image Classification Using Convolutional Neural Networks

Image classification and drawbacks of DNNs

CNN architecture

Multi-label image classification using CNNs

Frequently asked questions (FAQs)

Answers to questions

Sentiment Analysis Using Word2Vec and LSTM Network

Sentiment Analysis Using Word2Vec and LSTM Network

Sentiment analysis is a challenging task

Using Word2Vec for neural word embeddings

Datasets and pre-trained model description

Sentiment analysis using Word2Vec and LSTM

Frequently asked questions (FAQs)

Answers to questions

Transfer Learning for Image Classification

Transfer Learning for Image Classification

Image classification with pretrained VGG16

Developing an image classifier using transfer learning

Making simple inferencing

Frequently asked questions (FAQs)

Answers to questions

Real-Time Object Detection using YOLO, JavaCV, and DL4J

Real-Time Object Detection using YOLO, JavaCV, and DL4J

Object detection from images and videos

You Only Look Once (YOLO)

Developing a real-time object detection project

Frequently asked questions (FAQs)

Answers to questions

Stock Price Prediction Using LSTM Network

Stock Price Prediction Using LSTM Network

State-of-the-art automated stock trading

Developing a stock price predictive model

Frequently asked questions (FAQs)

Answers to questions

Distributed Deep Learning – Video Classification Using Convolutional LSTM Networks

Distributed Deep Learning – Video Classification Using Convolutional LSTM Networks

Distributed deep learning across multiple GPUs

Video classification using convolutional – LSTM

Distributed training on AWS deep learning AMI 9.0

Frequently asked questions (FAQs)

Answers to questions

Playing GridWorld Game Using Deep Reinforcement Learning

Playing GridWorld Game Using Deep Reinforcement Learning

Notation, policy, and utility for RL

Neural Q-learning

Developing a GridWorld game using a deep Q-network

Playing the GridWorld game

Frequently asked questions (FAQs)

Answers to questions

Developing Movie Recommendation Systems Using Factorization Machines

Developing Movie Recommendation Systems Using Factorization Machines

Recommendation systems

Factorization machines in recommender systems

Developing a movie recommender system using FMs

Frequently asked questions (FAQs)

Answers to questions

Discussion, Current Trends, and Outlook

Discussion, Current Trends, and Outlook

Discussion and outlook

Current trends and outlook

Frequently asked questions (FAQs)

Answers to questions

Other Books You May Enjoy

Other Books You May Enjoy

Leave a review - let other readers know what you think

Customer Reviews

5 star

0

4 star

0

3 star

0

2 star

0

1 star

0

Artificial Neural Networks

ANNs work on the concept of deep learning. They represent the human nervous system in how the nervous system consists of a number of neurons that communicate with each other using axons.

Biological neurons

The working principles of ANNs are inspired by how a human brain works, depicted in Figure 7. The receptors receive the stimuli either internally or from the external world; then they pass the information into the biological neurons for further processing. There are a number of dendrites, in addition to another long extension called the axon.

Towards its extremity, there are minuscule structures called synaptic terminals, used to connect one neuron to the dendrites of other neurons. Biological neurons receive short electrical impulses called signals from other neurons, and in response, they trigger their own signals:

Working principle of biological neurons

We can thus summarize that the neuron comprises a cell body (also known as the soma), one or more dendrites for receiving signals from other neurons, and an axon for carrying out the signals generated by the neurons.

A neuron is in an active state when it is sending signals to other neurons. However, when it is receiving signals from other neurons, it is in an inactive state. In an idle state, a neuron accumulates all the signals received before reaching a certain activation threshold. This whole thing motivated researchers to introduce an ANN.

A brief history of ANNs

Inspired by the working principles of biological neurons, Warren McCulloch and Walter Pitts proposed the first artificial neuron model in 1943 in terms of a computational model of nervous activity. This simple model of a biological neuron, also known as an artificial neuron (AN), has one or more binary (on/off) inputs and one output only.

An AN simply activates its output when more than a certain number of its inputs are active. For example, here we see a few ANNs that perform various logical operations. In this example, we assume that a neuron is activated only when at least two of its inputs are active:

ANNs performing simple logical computations

The example sounds too trivial, but even with such a simplified model, it is possible to build a network of ANs. Nevertheless, these networks can be combined to compute complex logical expressions too. This simplified model inspired John von Neumann, Marvin Minsky, Frank Rosenblatt, and many others to come up with another model called a perceptron back in 1957.

The perceptron is one of the simplest ANN architectures we've seen in the last 60 years. It is based on a slightly different AN called a Linear Threshold Unit (LTU). The only difference is that the inputs and outputs are now numbers instead of binary on/off values. Each input connection is associated with a weight. The LTU computes a weighted sum of its inputs, then applies a step function (which resembles the action of an activation function) to that sum, and outputs the result:

The left-side figure represents an LTU and the right-side figure shows a perceptron

One of the downsides of a perceptron is that its decision boundary is linear. Therefore, they are incapable of learning complex patterns. They are also incapable of solving some simple problems like Exclusive OR (XOR). However, later on, the limitations of perceptrons were somewhat eliminated by stacking multiple perceptrons, called MLP.

How does an ANN learn?

Based on the concept of biological neurons, the term and the idea of ANs arose. Similarly to biological neurons, the artificial neuron consists of the following:

One or more incoming connections that aggregate signals from neurons
One or more output connections for carrying the signal to the other neurons
An activation function, which determines the numerical value of the output signal

The learning process of a neural network is configured as an iterative process of optimization of the weights (see more in the next section). The weights are updated in each epoch. Once the training starts, the aim is to generate predictions by minimizing the loss function. The performance of the network is then evaluated on the test set.

Now we know the simple concept of an artificial neuron. However, generating only some artificial signals is not enough to learn a complex task. Albeit, a commonly used supervised learning algorithm is the backpropagation algorithm, which is very commonly used to train a complex ANN.