Login / Signup
Variance-Based Rewards for Approximate Bayesian Reinforcement Learning
Jonathan Sorg
Satinder P. Singh
Richard L. Lewis
Published in:
CoRR (2012)
Keyphrases
</>
bayesian reinforcement learning
reinforcement learning
optimal policy
monte carlo tree search
markov decision processes
monte carlo
reinforcement learning algorithms
np hard
dynamic programming
state space