Login / Signup
Hindsight Value Function for Variance Reduction in Stochastic Dynamic Environment.
Jiaming Guo
Rui Zhang
Xishan Zhang
Shaohui Peng
Qi Yi
Zidong Du
Xing Hu
Qi Guo
Yunji Chen
Published in:
CoRR (2021)
Keyphrases
</>
dynamic environments
variance reduction
monte carlo
mobile robot
gradient estimation
path planning
sample size
changing environment
classification accuracy
importance sampling
bias variance decomposition
machine learning
image sequences
reinforcement learning
probabilistic model
confidence intervals