Login / Signup
Variance-Based Rewards for Approximate Bayesian Reinforcement Learning.
Jonathan Sorg
Satinder P. Singh
Richard L. Lewis
Published in:
UAI (2010)
Keyphrases
</>
bayesian reinforcement learning
reinforcement learning
monte carlo tree search
optimal policy
markov decision processes
partially observable markov decision processes
monte carlo
state space
objective function
computational complexity
exact solution
control problems