Lower bounds and selectivity of weak-consistent policies in stochastic multi-armed bandit problem.

Published in: J. Mach. Learn. Res. (2013)

Keyphrases