Login / Signup

On Upper-Confidence Bound Policies for Switching Bandit Problems.

Aurélien GarivierEric Moulines
Published in: ALT (2011)
Keyphrases
  • bandit problems
  • upper confidence bound
  • multi armed bandit problems
  • contextual bandit
  • decision problems
  • multi armed bandits
  • optimal policy
  • news recommendation
  • social networks
  • natural language
  • text mining