Sign In Start Free Trial
Account

Add to playlist

Create a Playlist

Modal Close icon
You need to login to use this feature.
  • Book Overview & Buying Deep Reinforcement Learning Hands-On
  • Table Of Contents Toc
Deep Reinforcement Learning Hands-On

Deep Reinforcement Learning Hands-On

By : Maxim Lapan, Oleg Vasilev, Martijn van Otterlo, Mikhail Yurushkin, Basem O. F. Alijla
4.3 (34)
close
close
Deep Reinforcement Learning Hands-On

Deep Reinforcement Learning Hands-On

4.3 (34)
By: Maxim Lapan, Oleg Vasilev, Martijn van Otterlo, Mikhail Yurushkin, Basem O. F. Alijla

Overview of this book

Deep Reinforcement Learning Hands-On is a comprehensive guide to the very latest DL tools and their limitations. You will evaluate methods including Cross-entropy and policy gradients, before applying them to real-world environments. Take on both the Atari set of virtual games and family favorites such as Connect4. The book provides an introduction to the basics of RL, giving you the know-how to code intelligent learning agents to take on a formidable array of practical tasks. Discover how to implement Q-learning on 'grid world' environments, teach your agent to buy and trade stocks, and find out how natural language models are driving the boom in chatbots.
Table of Contents (21 chapters)
close
close
20
Index

Basic DQN

By combining all the above, we can reimplement the same DQN agent in a much shorter, but still flexible, way, which will become handy later, when we'll start to modify and change various DQN parts to make the DQN better.

In the basic DQN implementation we have three modules:

  • Chapter07/lib/dqn_model.py: The DQN neural network, which is the same as we've seen in the previous chapter
  • Chapter07/lib/common.py: Common functions used in this chapter's examples, but too specialized to be moved to PTAN
  • Chapter07/01_dqn_basic.py: The creation of all used pieces and the training loop

Let's start with the contents of lib/common.py. First of all, we have here hyperparameters for our Pong environment, that was introduced in the previous chapter. The hyperparameters are stored in the dict, with keys as the configuration name and values as a dict of parameters. This makes it easy to add another configuration set for more complicated Atari games.

HYPERPARAMS = {
    'pong...
CONTINUE READING
83
Tech Concepts
36
Programming languages
73
Tech Tools
Icon Unlimited access to the largest independent learning library in tech of over 8,000 expert-authored tech books and videos.
Icon Innovative learning tools, including AI book assistants, code context explainers, and text-to-speech.
Icon 50+ new titles added per month and exclusive early access to books as they are being written.
Deep Reinforcement Learning Hands-On
notes
bookmark Notes and Bookmarks search Search in title playlist Add to playlist download Download options font-size Font size

Change the font size

margin-width Margin width

Change margin width

day-mode Day/Sepia/Night Modes

Change background colour

Close icon Search
Country selected

Close icon Your notes and bookmarks

Confirmation

Modal Close icon
claim successful

Buy this book with your credits?

Modal Close icon
Are you sure you want to buy this book with one of your credits?
Close
YES, BUY

Submit Your Feedback

Modal Close icon
Modal Close icon
Modal Close icon