Let's assess our understanding of the actor-critic method by answering the following questions:
- What is the actor-critic method?
- What is the role of the actor and critic networks?
- How does the actor-critic method differ from the policy gradient with the baseline method?
- What is the gradient update equation of the actor network?
- How does A2C work?
- What does asynchronous mean in A3C?
- How does A2C differ from A3C?