Login / Signup
An Efficient Dynamic Sampling Policy For Monte Carlo Tree Search.
Gongbo Zhang
Yijie Peng
Yilong Xu
Published in:
CoRR (2022)
Keyphrases
</>
monte carlo tree search
monte carlo
bayesian reinforcement learning
dynamic environments
tree search algorithm
monte carlo search
optimal policy
evaluation function
learning algorithm
temporal difference learning