Login / Signup

An asymptotically optimal policy for finite support models in the multiarmed bandit problem.

Junya HondaAkimichi Takemura
Published in: Mach. Learn. (2011)
Keyphrases