Login / Signup
Hindsight Value Function for Variance Reduction in Stochastic Dynamic Environment.
Jiaming Guo
Rui Zhang
Xishan Zhang
Shaohui Peng
Qi Yi
Zidong Du
Xing Hu
Qi Guo
Yunji Chen
Published in:
IJCAI (2021)
Keyphrases
</>
dynamic environments
variance reduction
monte carlo
mobile robot
gradient estimation
sample size
path planning
changing environment
bias variance decomposition
real environment
policy gradient
image sequences
optimal solution
semi supervised
density function
importance sampling