Split Moves for Monte-Carlo Tree Search.
Jakub KowalskiMaksymilian MikaWojciech PawlikJakub SutowiczMarek SzykulaMark H. M. WinandsPublished in: AAAI (2022)
Keyphrases
- monte carlo tree search
- monte carlo
- tree search algorithm
- evaluation function
- bayesian reinforcement learning
- monte carlo search
- temporal difference
- alpha beta search
- markov chain
- game tree
- temporal difference learning
- neural network
- fixed point
- reinforcement learning methods
- particle filter
- lower bound
- reinforcement learning