Monte-Carlo Tree Search for Constrained POMDPs.
Jongmin LeeGeon-hyeong KimPascal PoupartKee-Eung KimPublished in: NeurIPS (2018)
Keyphrases
- monte carlo tree search
- bayesian reinforcement learning
- monte carlo
- tree search algorithm
- evaluation function
- reinforcement learning
- belief state
- reinforcement learning methods
- optimal policy
- partially observable markov decision processes
- dynamic programming
- alpha beta search
- learning algorithm
- temporal difference
- markov decision processes
- temporal difference learning
- partially observable
- function approximation
- computational complexity