Login / Signup
Adaptive Playouts in Monte-Carlo Tree Search with Policy-Gradient Reinforcement Learning.
Tobias Graf
Marco Platzner
Published in:
ACG (2015)
Keyphrases
</>
monte carlo tree search
monte carlo
tree search algorithm
evaluation function
game tree
neural network
multi agent
constraint satisfaction
adaptive control
temporal difference learning