Chapter 1. What is Reinforcement Learning?

Reinforcement Learning is a subfield of machine learning which addresses the problem of automatic learning of optimal decisions over time. This is a general and common problem studied in many scientific and engineering fields.

In our changing world, even problems which look like static input-output problems become dynamic in a larger perspective. For example, consider that you're solving the simple supervised learning problem of pet image classification with two target classes—dog and cat. You've gathered the training dataset and implemented the classifier using your favorite deep learning toolkit, and after a while, the model that has converged demonstrates excellent performance. Good? Definitely! You've deployed it and left it running for a while. Then, after a vacation on some seaside resort, you discover that dog haircut fashions have changed, and a significant portion of your queries are now misclassified, so you need to update your training images and repeat the process again. Good? Definitely not!

The preceding example is intended to show that even simple Machine Learning (ML) problems have a hidden time dimension, which is frequently overlooked, but it might become an issue in a production system.

Reinforcement Learning (RL) is an approach that natively incorporates this extra dimension (which is usually time, but not necessarily) into learning equations, which puts it much close to the human perception of artificial intelligence. In this chapter, we will become familiar with the following:

How RL is related to and differs from other ML disciplines: supervised and unsupervised learning
What the main RL formalisms are and how they are related to each other
Theoretical foundations of RL: the Markov decision processes

Tech Concepts

Programming languages

Tech Tools

Unlimited access to the largest independent learning library in tech of over 8,000 expert-authored tech books and videos.

Innovative learning tools, including AI book assistants, code context explainers, and text-to-speech.

50+ new titles added per month and exclusive early access to books as they are being written.

Deep Reinforcement Learning Hands-On

By : Maxim Lapan, Oleg Vasilev, Martijn van Otterlo, Mikhail Yurushkin, Basem O. F. Alijla

Deep Reinforcement Learning Hands-On

By: Maxim Lapan, Oleg Vasilev, Martijn van Otterlo, Mikhail Yurushkin, Basem O. F. Alijla

Overview of this book

Chapter 1. What is Reinforcement Learning?

Deep Reinforcement Learning Hands-On

By : Maxim Lapan, Oleg Vasilev, Martijn van Otterlo, Mikhail Yurushkin, Basem O. F. Alijla

Deep Reinforcement Learning Hands-On

By: Maxim Lapan, Oleg Vasilev, Martijn van Otterlo, Mikhail Yurushkin, Basem O. F. Alijla

Overview of this book

Chapter 1. What is Reinforcement Learning?

Confirmation

Buy this book with your credits?

Submit Your Feedback

Create a Free Account To Continue Reading

Sign in to activate your 7-day free access