Book Image

Reinforcement Learning with TensorFlow & TRFL [Video]

By : Jim DiLorenzo
Book Image

Reinforcement Learning with TensorFlow & TRFL [Video]

By: Jim DiLorenzo

Overview of this book

<p>The TRFL library is a collection of key algorithmic components that are used for a large number of DeepMind agents such as DQN, DDPG, and the Importance of Weighted Actor Learner Architecture. With this course, you will learn to implement classical RL algorithms as well as other cutting-edge techniques.</p> <p>This course will help you get up-to-speed with the TRFL library quickly, so you can start building your own RL agents. Without wasting much time on theory, the course dives straightaway into designing and implementing RL algorithms.</p> <p>By the end, you will be quite familiar with the tool and will be ready to put your knowledge into practice in your own projects.</p> <p>The code bundle for this course is available at -&nbsp;<a href="https://github.com/PacktPublishing/Hands-On-Reinforcement-Learning-with-TensorFlow-TRFL" target="_blank">https://github.com/PacktPublishing/Hands-On-Reinforcement-Learning-with-TensorFlow-TRFL</a></p> <h1>Style and Approach</h1> <p>In each section of the course, we walk through part of the TRFL library. We explain how TRFL is used with clear code examples that highlight integrating TRFL into TensorFlow code, making it easy to deploy TRFL in new or existing projects. While this course emphasizes practical TRFL usage, we provide explanations that relate the TRFL library to the underlying theory and provide further resources for those wanting to know more.</p>
Table of Contents (5 chapters)
Chapter 3
Fundamentals of Deep RL: Policy Gradient Methods
Content Locked
Section 3
Applying Actor-Critic with A3C and A2C
Teach the Actor Critic model and how it combines policy gradients and value function methods. Discuss A3C and A2C as two methods that parallelize gradient collection from multiple Actor Critic agents. - Introduce Actor Critic definition and architecture - Discuss the A3C and A2C variants of Actor Critic - Show TensorFlow Actor Critic model and TRFL A2C loss