Deep Learning with R for Beginners

Deep Learning with R for Beginners

By : Mark Hodnett, Joshua F. Wiley, Yuxi (Hayden) Liu, Pablo Maldonado

Buy this Book

Deep Learning with R for Beginners

By: Mark Hodnett, Joshua F. Wiley, Yuxi (Hayden) Liu, Pablo Maldonado

Buy this Book

Overview of this book

Deep learning has a range of practical applications in several domains, while R is the preferred language for designing and deploying deep learning models. This Learning Path introduces you to the basics of deep learning and even teaches you to build a neural network model from scratch. As you make your way through the chapters, you’ll explore deep learning libraries and understand how to create deep learning models for a variety of challenges, right from anomaly detection to recommendation systems. The Learning Path will then help you cover advanced topics, such as generative adversarial networks (GANs), transfer learning, and large-scale deep learning in the cloud, in addition to model optimization, overfitting, and data augmentation. Through real-world projects, you’ll also get up to speed with training convolutional neural networks (CNNs), recurrent neural networks (RNNs), and long short-term memory networks (LSTMs) in R. By the end of this Learning Path, you’ll be well-versed with deep learning and have the skills you need to implement a number of deep learning concepts in your research work or projects.

Title Page

About Packt

Contributors

Preface

Free Chapter

Getting Started with Deep Learning

What is deep learning?

A conceptual overview of neural networks

Deep neural networks

Some common myths about deep learning

Setting up your R environment

Summary

Training a Prediction Model

Neural networks in R

The problem of overfitting data – the consequences explained

Use case – building and applying a neural network

Summary

Deep Learning Fundamentals

Building neural networks from scratch in R

Using regularization to overcome overfitting

Use case – improving out-of-sample model performance using dropout

Summary

Training Deep Prediction Models

Getting started with deep feedforward neural networks

Activation functions

Introduction to the MXNet deep learning library

Use case – using MXNet for classification and regression

Summary

Image Classification Using Convolutional Neural Networks

CNNs

Convolutional layers

Image classification using the MXNet library

References/further reading

Summary

Tuning and Optimizing Models

Evaluation metrics and evaluating performance

Data preparation

Data augmentation

Tuning hyperparameters

Use case—using LIME for interpretability

Summary

Natural Language Processing Using Deep Learning

Document classification

Advanced deep learning text classification

Summary

Deep Learning Models Using TensorFlow in R

Introduction to the TensorFlow library

TensorFlow models

TensorFlow estimators and TensorFlow runs packages

Summary

Anomaly Detection and Recommendation Systems

What is unsupervised learning?

How do auto-encoders work?

Training an auto-encoder in R

Using auto-encoders for anomaly detection

Use case – collaborative filtering

Summary

Running Deep Learning Models in the Cloud

Setting up a local computer for deep learning

Using AWS for deep learning

Using Azure for deep learning

Using Google Cloud for deep learning

Using Paperspace for deep learning

Summary

The Next Level in Deep Learning

Image classification models

Deploying TensorFlow models

Some common myths about deep learning

There are many misconceptions, half-truths, and downright misleading opinions on deep learning. Here are some common mis-conceptions regarding deep learning:

Artificial intelligence means deep learning and replaces all other techniques
Deep learning requires a PhD-level understanding of mathematics
Deep learning is hard to train, almost an art form
Deep learning requires lots of data
Deep learning has poor interpretability
Deep learning needs GPUs

The following paragraphs discuss these statements, one by one.

Deep learning is not artificial intelligence and does not replace all other machine learning algorithms. It is only one family of algorithms in machine learning. Despite the hype, deep learning probably accounts for less than 1% of the machine learning projects in production right now. Most of the recommendation engines and online adverts that you encounter when you browse the net are not powered by deep learning. Most models used internally by companies to manage their subscribers, for example churn analysis, are not deep learning models. The models used by credit institutions to decide who gets credit do not use deep learning.

Deep learning does not require a deep understanding of mathematics unless your interest is in researching new deep learning algorithms and specialized architectures. Most practitioners use existing deep learning techniques on their data by taking an existing architecture and modifying it for their work. This does not require a deep mathematical foundation, the mathematics used in deep learning are taught at high school level throughout the world. In fact, we demonstrate this in Chapter 3, Deep Learning Fundamentals, where we build an entire neural network from basic code in less than 70 lines of code!

Training deep learning models is difficult but it is not an art form. It does require practice, but the same problems occur over and over again. Even better, there is often a prescribed fix for that problem, for example, if your model is overfitting, add regularization, if your model is not training well, build a more complex model and/or use data augmentation. We will look at this in more depth in Chapter 6, Tuning and Optimizing Models.

There is a lot of truth to the statement that deep learning requires lots of data. However, you may still be able to apply deep learning to the problem by using a pre-trained network, or creating more training data from existing data (data augmentation). We will look at these in later Chapter 6, Tuning and Optimizing Models and Chapter 11, The Next Level in Deep Learning.

Deep learning models are difficult to interpret. By this, we mean being able to explain how the models came to their decision. This is a problem in many machine learning algorithms, not just deep learning. In machine learning, generally there is an inverse relationship between accuracy and interpretation – the more accurate the model needs to be, the less interpretable it is. For some tasks, for example, online advertising, interpretability is not important and there is little cost from being wrong, so the most powerful algorithm is preferred. In some cases, for example, credit scoring, interpretability may be required by law; people could demand an explanation of why they were denied credit. In other cases, such as medical diagnoses, interpretability may be important for a doctor to see why the model decided someone had a disease.

If interpretability is important, some methods can be applied to machine learning models to get an understanding of why they predicted the output for an instance. Some of them work by perturbing the data (that is, making slight changes to it) and trying to find what variables are most influential in the model coming to its decision. One such algorithm is called LIME (Local Interpretable Model-Agnostic Explanations). (Ribeiro, Marco Tulio, Sameer Singh, and Carlos Guestrin. Why should I trust you?: Explaining the predictions of any classifier. Proceedings of the 22nd ACM SIGKDD international conference on knowledge discovery and data mining. ACM, 2016.) This has been implemented in many languages including R; there is a package called lime. We will use this package in Chapter 6, Tuning and Optimizing Models.

Finally, while deep learning models can run on CPUs, the truth is that any real work requires a workstation with a GPU. This does not mean that you need to go out and purchase one, as you can use cloud-computing to train your models. In Chapter 10, Running Deep Learning Models in the Cloud, will look at using AWS, Azure, and Google Cloud to train deep learning models.

Deep Learning with R for Beginners

By : Mark Hodnett, Joshua F. Wiley, Yuxi (Hayden) Liu, Pablo Maldonado

Deep Learning with R for Beginners

By: Mark Hodnett, Joshua F. Wiley, Yuxi (Hayden) Liu, Pablo Maldonado

Overview of this book

Related Content you might be interested in

Current Title:

Deep Learning with R for Beginners

Some common myths about deep learning