Login / Signup
Trajectory-wise Control Variates for Variance Reduction in Policy Gradient Methods.
Ching-An Cheng
Xinyan Yan
Byron Boots
Published in:
CoRR (2019)
Keyphrases
</>
variance reduction
policy gradient
policy gradient methods
sample size
control system
natural actor critic
bayesian networks
control strategies
importance sampling