Book Image

Hands-On Deep Learning with Go

By : Gareth Seneque, Darrell Chua
Book Image

Hands-On Deep Learning with Go

By: Gareth Seneque, Darrell Chua

Overview of this book

Go is an open source programming language designed by Google for handling large-scale projects efficiently. The Go ecosystem comprises some really powerful deep learning tools such as DQN and CUDA. With this book, you'll be able to use these tools to train and deploy scalable deep learning models from scratch. This deep learning book begins by introducing you to a variety of tools and libraries available in Go. It then takes you through building neural networks, including activation functions and the learning algorithms that make neural networks tick. In addition to this, you'll learn how to build advanced architectures such as autoencoders, restricted Boltzmann machines (RBMs), convolutional neural networks (CNNs), recurrent neural networks (RNNs), and more. You'll also understand how you can scale model deployments on the AWS cloud infrastructure for training and inference. By the end of this book, you'll have mastered the art of building, training, and deploying deep learning models in Go to solve real-world problems.
Table of Contents (15 chapters)
Free Chapter
1
Section 1: Deep Learning in Go, Neural Networks, and How to Train Them
6
Section 2: Implementing Deep Neural Network Architectures
11
Section 3: Pipeline, Deployment, and Beyond!

Training RNNs

The way we train these networks is by using backpropagation through time (BPTT). This is an exotic name for a slight variation of something you already know of from Chapter 2, What is a Neural Network and How Do I Train One?. In this section, we will explore this variation in detail.

Backpropagation through time

With RNNs, we have multiple copies of the same network, one for each timestep. Therefore, we need a way to backpropagate the error derivatives and calculate weight updates for each of the parameters in every timestep. The way we do this is simple. We're following the contours of a function so that we can try and optimize its shape. We have multiple copies of the trainable parameters, one at each...