Upper-Confidence-Bound Algorithms for Active Learning in Multi-armed Bandits.

Published in: ALT (2011)

Keyphrases