Book Image

Reinforcement Learning with TensorFlow & TRFL [Video]

By : Jim DiLorenzo
Book Image

Reinforcement Learning with TensorFlow & TRFL [Video]

By: Jim DiLorenzo

Overview of this book

<p>The TRFL library is a collection of key algorithmic components that are used for a large number of DeepMind agents such as DQN, DDPG, and the Importance of Weighted Actor Learner Architecture. With this course, you will learn to implement classical RL algorithms as well as other cutting-edge techniques.</p> <p>This course will help you get up-to-speed with the TRFL library quickly, so you can start building your own RL agents. Without wasting much time on theory, the course dives straightaway into designing and implementing RL algorithms.</p> <p>By the end, you will be quite familiar with the tool and will be ready to put your knowledge into practice in your own projects.</p> <p>The code bundle for this course is available at -&nbsp;<a href="https://github.com/PacktPublishing/Hands-On-Reinforcement-Learning-with-TensorFlow-TRFL" target="_blank">https://github.com/PacktPublishing/Hands-On-Reinforcement-Learning-with-TensorFlow-TRFL</a></p> <h1>Style and Approach</h1> <p>In each section of the course, we walk through part of the TRFL library. We explain how TRFL is used with clear code examples that highlight integrating TRFL into TensorFlow code, making it easy to deploy TRFL in new or existing projects. While this course emphasizes practical TRFL usage, we provide explanations that relate the TRFL library to the underlying theory and provide further resources for those wanting to know more.</p>
Table of Contents (5 chapters)
Chapter 5
Cutting Edge RL with Impala and Unreal
Content Locked
Section 1
Using Importance Sampling with Retrace (λ)
Retrace(λ) corrects for the off-policy-ness of data. Retrace(λ) reduces variance and better uses off-policy and on-policy returns than prior λ methods. Retrace(λ) builds upon TD(λ) and importance sampling. - Define importance sampling - Describe Retrace(λ) and its benefits - Use Retrace(λ) in TRFL with Taxi environment