Login / Signup
The Role of Baselines in Policy Gradient Optimization.
Jincheng Mei
Wesley Chung
Valentin Thomas
Bo Dai
Csaba Szepesvári
Dale Schuurmans
Published in:
NeurIPS (2022)
Keyphrases
</>
parametric optimization
policy gradient
optimization problems
optimization algorithm
neural network
actor critic
reinforcement learning
real valued
function approximation
gradient method
variance reduction