Model-based Bayesian Reinforcement Learning in Factored Markov Decision Process.
Bo WuYan-Peng FengHong-Yan ZhengPublished in: J. Comput. (2014)
Keyphrases
- markov decision process
- bayesian reinforcement learning
- optimal policy
- state space
- reinforcement learning
- markov decision processes
- infinite horizon
- decision problems
- dynamic programming
- temporal difference learning
- policy iteration
- initial state
- model free
- finite state
- partially observable markov decision processes
- long run
- heuristic search
- state variables
- reinforcement learning algorithms
- average cost
- markov decision problems
- markov chain
- partially observable
- belief state
- action space
- average reward
- monte carlo tree search
- sufficient conditions