Login / Signup
Pure Exploration in Multi-armed Bandits Problems.
Sébastien Bubeck
Rémi Munos
Gilles Stoltz
Published in:
ALT (2009)
Keyphrases
</>
multi armed bandits
optimization problems
reinforcement learning
lower bound
bandit problems