Login / Signup
Regularized Policy Gradients: Direct Variance Reduction in Policy Gradient Estimation.
Tingting Zhao
Gang Niu
Ning Xie
Jucheng Yang
Masashi Sugiyama
Published in:
ACML (2015)
Keyphrases
</>
gradient estimation
variance reduction
policy gradient
monte carlo
sample size
optimal policy
least squares
particle filter
confidence intervals