Login / Signup

Upper-Confidence-Bound Algorithms for Active Learning in Multi-armed Bandits.

Alexandra CarpentierAlessandro LazaricMohammad GhavamzadehRémi MunosPeter Auer
Published in: ALT (2011)
Keyphrases
  • active learning
  • upper confidence bound
  • contextual bandit
  • multi armed bandits
  • learning algorithm
  • multi class
  • training set
  • data mining
  • decision making
  • probability distribution
  • supervised learning