Login / Signup
Beyond Variance Reduction: Understanding the True Impact of Baselines on Policy Optimization.
Wesley Chung
Valentin Thomas
Marlos C. Machado
Nicolas Le Roux
Published in:
ICML (2021)
Keyphrases
</>
variance reduction
gradient estimation
monte carlo
sample size
policy gradient
least squares
random numbers
quasi monte carlo
confidence intervals
importance sampling