Improving Regret Bounds for Combinatorial Semi-Bandits with Probabilistically Triggered Arms and Its Applications.

Qinshi Wang Wei Chen

Published in: NIPS (2017)

Keyphrases

regret bounds
multi armed bandit
multi armed bandits
lower bound
online learning
linear regression
upper bound
reinforcement learning
maximum entropy
multi armed bandit problems
special case
text classification
bregman divergences
bandit problems