Book Image

Neural Networks with Keras Cookbook

By : V Kishore Ayyadevara
Book Image

Neural Networks with Keras Cookbook

By: V Kishore Ayyadevara

Overview of this book

This book will take you from the basics of neural networks to advanced implementations of architectures using a recipe-based approach. We will learn about how neural networks work and the impact of various hyper parameters on a network's accuracy along with leveraging neural networks for structured and unstructured data. Later, we will learn how to classify and detect objects in images. We will also learn to use transfer learning for multiple applications, including a self-driving car using Convolutional Neural Networks. We will generate images while leveraging GANs and also by performing image encoding. Additionally, we will perform text analysis using word vector based techniques. Later, we will use Recurrent Neural Networks and LSTM to implement chatbot and Machine Translation systems. Finally, you will learn about transcribing images, audio, and generating captions and also use Deep Q-learning to build an agent that plays Space Invaders game. By the end of this book, you will have developed the skills to choose and customize multiple neural network architectures for various deep learning problems you might encounter.
Table of Contents (18 chapters)

Connectionist temporal classification (CTC)

One of the limitations to perform supervised learning on top of handwritten text recognition or in speech transcription is that, using a traditional approach, we would have to provide the label of which part of the image contain a certain character (in the case of hand-writing recognition) or which subsegment of the audio contains a certain phoneme (multiple phonemes combine to form a word utterance).

However, providing the ground truth for each character in image, or each phoneme in speech transcription, is prohibitively costly when building the dataset, where there are thousands of words or hundreds of hours of speech to transcribe.

CTC comes in handy to address the issue of not knowing the mapping of different parts of images to different characters. In this section, we will learn about how CTC loss functions.

...