On Upper-Confidence Bound Policies for Switching Bandit Problems.

Aurélien Garivier Eric Moulines

Published in: ALT (2011)

Keyphrases

bandit problems
upper confidence bound
multi armed bandit problems
contextual bandit
decision problems
multi armed bandits
optimal policy
news recommendation
social networks
natural language
text mining