Login / Signup
Sample-based Distributional Policy Gradient.
Rahul Singh
Keuntaek Lee
Yongxin Chen
Published in:
CoRR (2020)
Keyphrases
</>
policy gradient
parametric optimization
model free reinforcement learning
actor critic
reinforcement learning
gradient method
optimal control
approximation methods
reinforcement learning algorithms
monte carlo
function approximation
variance reduction