Login / Signup

Regret bounds for restless Markov bandits.

Ronald OrtnerDaniil RyabkoPeter AuerRémi Munos
Published in: Theor. Comput. Sci. (2014)
Keyphrases
  • regret bounds
  • semi markov
  • online learning
  • lower bound
  • linear regression
  • multi armed bandit
  • optimal control
  • upper bound
  • markov chain
  • bregman divergences
  • active learning
  • knn