PG-Rainbow: Using Distributional Reinforcement Learning in Policy Gradient Methods.

WooJae Jeon KangJun Lee Jeewoo Lee

Published in: CoRR (2024)

Keyphrases

policy gradient methods
reinforcement learning
natural actor critic
policy gradient
actor critic
robot arm
function approximation
reinforcement learning algorithms
function approximators
reinforcement learning methods
reinforcement learning problems
temporal difference
machine learning
model free
markov decision processes
state space
learning algorithm
optimal control