Login / Signup
Improving Regret Bounds for Combinatorial Semi-Bandits with Probabilistically Triggered Arms and Its Applications.
Qinshi Wang
Wei Chen
Published in:
NIPS (2017)
Keyphrases
</>
regret bounds
multi armed bandit
multi armed bandits
lower bound
online learning
linear regression
upper bound
reinforcement learning
maximum entropy
multi armed bandit problems
special case
text classification
bregman divergences
bandit problems