Monte Carlo Tree Search with Heuristic Evaluations using Implicit Minimax Backups.
Marc LanctotMark H. M. WinandsTom PepelsNathan R. SturtevantPublished in: CoRR (2014)
Keyphrases
- monte carlo tree search
- game tree
- tree search algorithm
- evaluation function
- monte carlo
- game playing
- search algorithm
- minimax search
- bayesian reinforcement learning
- tree search
- search tree
- optimal strategy
- temporal difference learning
- reinforcement learning methods
- temporal difference
- alpha beta search
- neural network
- vehicle routing problem
- reinforcement learning