Chapter 3
Robot Control System Using Deep Reinforcement Learning
Section 6
DQN to Control a Robot's Mobility
A general solution to the reinforcement learning problem is to estimate, thanks to the learning process, an evaluation function. This function must be able to evaluate, through the sum of the rewards, the convenience or otherwise of a particular policy. In fact, Qlearning tries to maximize the value of the Q function (action-value function), which represents the maximum discounted future reward when we perform actions, a, in the state, s. Here are the topics that we will cover now: - DQN to Control a Robot's Mobility - OpenAI Gym Installation and Methods