Bayesian Residual Policy Optimization: : Scalable Bayesian Reinforcement Learning with Clairvoyant Experts.
Gilwoo LeeBrian HouSanjiban ChoudhurySiddhartha S. SrinivasaPublished in: IROS (2021)
Keyphrases
- bayesian reinforcement learning
- optimal policy
- monte carlo tree search
- reinforcement learning
- state space
- markov decision processes
- partially observable markov decision processes
- infinite horizon
- decision problems
- markov decision process
- sufficient conditions
- finite state
- np hard
- monte carlo
- evaluation function
- average reward