Book Image

Java Deep Learning Cookbook

By : Rahul Raj
Book Image

Java Deep Learning Cookbook

By: Rahul Raj

Overview of this book

Java is one of the most widely used programming languages in the world. With this book, you will see how to perform deep learning using Deeplearning4j (DL4J) – the most popular Java library for training neural networks efficiently. This book starts by showing you how to install and configure Java and DL4J on your system. You will then gain insights into deep learning basics and use your knowledge to create a deep neural network for binary classification from scratch. As you progress, you will discover how to build a convolutional neural network (CNN) in DL4J, and understand how to construct numeric vectors from text. This deep learning book will also guide you through performing anomaly detection on unsupervised data and help you set up neural networks in distributed systems effectively. In addition to this, you will learn how to import models from Keras and change the configuration in a pre-trained DL4J model. Finally, you will explore benchmarking in DL4J and optimize neural networks for optimal results. By the end of this book, you will have a clear understanding of how you can use DL4J to build robust deep learning applications in Java.
Table of Contents (14 chapters)

Applying Transfer Learning to Network Models

In this chapter, we will talk about transfer learning methods, which are essential to reuse a model that was previously developed. We will see how we can apply transfer learning to the model created in Chapter 3, Building Deep Neural Networks for Binary Classification, as well as a pre-trained model from the DL4J Model Zoo API. We can use the DL4J transfer learning API to modify the network architecture, hold specific layer parameters while training, and fine-tune model configurations. Transfer learning enables improved performance and can develop skillful models. We pass learned parameters learned from another model to the current training session. If you have already set up the DL4J workspace for previous chapters, then you don't have to add any new dependencies in pom.xml; otherwise, you need to add the basic Deeplearning4j...