Login / Signup
Beyond variance reduction: Understanding the true impact of baselines on policy optimization.
Wesley Chung
Valentin Thomas
Marlos C. Machado
Nicolas Le Roux
Published in:
CoRR (2020)
Keyphrases
</>
variance reduction
gradient estimation
monte carlo
policy gradient
sample size
random numbers
stability of feature selection
trade off
error rate
bias variance decomposition