Login / Signup

Best-arm identification algorithms for multi-armed bandits in the fixed confidence setting.

Kevin G. JamiesonRobert D. Nowak
Published in: CISS (2014)
Keyphrases
  • multi armed bandits
  • learning algorithm
  • multi armed bandit
  • least squares
  • mutual information
  • optimization problems