Login / Signup
Regret bounds for restless Markov bandits.
Ronald Ortner
Daniil Ryabko
Peter Auer
Rémi Munos
Published in:
Theor. Comput. Sci. (2014)
Keyphrases
</>
regret bounds
semi markov
online learning
lower bound
linear regression
multi armed bandit
optimal control
upper bound
markov chain
bregman divergences
active learning
knn