Login / Signup
Is the Policy Gradient a Gradient?
Chris Nota
Philip S. Thomas
Published in:
AAMAS (2020)
Keyphrases
</>
policy gradient
gradient method
reinforcement learning
actor critic
parametric optimization
function approximation
optimal control
reinforcement learning algorithms
variance reduction
partially observable markov decision processes
model free reinforcement learning
average reward
approximation methods