Login / Signup
Low-Variance Policy Gradient Estimation with World Models.
Michal Nauman
Floris den Hengst
Published in:
CoRR (2020)
Keyphrases
</>
gradient estimation
model selection
probabilistic model
optimal policy
dynamic programming
bayesian framework
low variance
unbiased estimator