Login / Signup
Quality-based Rewards for Monte-Carlo Tree Search Simulations.
Tom Pepels
Mandy J. W. Tak
Marc Lanctot
Mark H. M. Winands
Published in:
ECAI (2014)
Keyphrases
</>
monte carlo tree search
monte carlo
evaluation function
tree search algorithm
reinforcement learning
decision making
temporal difference
game tree
mathematical models
bayesian reinforcement learning