PG-Rainbow: Using Distributional Reinforcement Learning in Policy Gradient Methods.
WooJae JeonKangJun LeeJeewoo LeePublished in: CoRR (2024)
Keyphrases
- policy gradient methods
- reinforcement learning
- natural actor critic
- policy gradient
- actor critic
- robot arm
- function approximation
- reinforcement learning algorithms
- function approximators
- reinforcement learning methods
- reinforcement learning problems
- temporal difference
- machine learning
- model free
- markov decision processes
- state space
- learning algorithm
- optimal control