Login / Signup

Adaptive Playouts in Monte-Carlo Tree Search with Policy-Gradient Reinforcement Learning.

Tobias GrafMarco Platzner
Published in: ACG (2015)
Keyphrases
  • monte carlo tree search
  • monte carlo
  • tree search algorithm
  • evaluation function
  • game tree
  • neural network
  • multi agent
  • constraint satisfaction
  • adaptive control
  • temporal difference learning