C
search
search
reviewers
reviewers
feeds
feeds
assignments
assignments
settings
logout
The Role of Baselines in Policy Gradient Optimization.
Jincheng Mei
Wesley Chung
Valentin Thomas
Bo Dai
Csaba Szepesvári
Dale Schuurmans
Published in:
NeurIPS (2022)
Keyphrases
</>
parametric optimization
policy gradient
optimization problems
optimization algorithm
neural network
actor critic
reinforcement learning
real valued
function approximation
gradient method
variance reduction