Pure Exploration in Multi-armed Bandits Problems.

Sébastien Bubeck Rémi Munos Gilles Stoltz

Published in: ALT (2009)

Keyphrases

multi armed bandits
optimization problems
reinforcement learning
lower bound
bandit problems