Login / Signup
Sample Efficient Policy Gradient Methods with Recursive Variance Reduction.
Pan Xu
Felicia Gao
Quanquan Gu
Published in:
ICLR (2020)
Keyphrases
</>
variance reduction
sample size
policy gradient
policy gradient methods
decision trees
monte carlo