Login / Signup

Robust and explorative behavior in model-based Bayesian reinforcement learning.

Toru HishinumaKei Senda
Published in: SSCI (2016)
Keyphrases
  • bayesian reinforcement learning
  • optimal policy
  • machine learning
  • learning algorithm
  • markov chain