Bounded regret in stochastic multi-armed bandits

Sébastien Bubeck Vianney Perchet Philippe Rigollet

Published in: CoRR (2013)

Keyphrases

multi armed bandits
multi armed bandit
bandit problems
regret bounds
decision problems
reinforcement learning
monte carlo
bayesian networks
multi class