Login / Signup
Bounded regret in stochastic multi-armed bandits
Sébastien Bubeck
Vianney Perchet
Philippe Rigollet
Published in:
CoRR (2013)
Keyphrases
</>
multi armed bandits
multi armed bandit
bandit problems
regret bounds
decision problems
reinforcement learning
monte carlo
bayesian networks
multi class