Login / Signup
Improved Path-length Regret Bounds for Bandits.
Sébastien Bubeck
Yuanzhi Li
Haipeng Luo
Chen-Yu Wei
Published in:
CoRR (2019)
Keyphrases
</>
path length
regret bounds
multi armed bandit
shortest path
lower bound
online learning
small world
linear regression
machine learning
pairwise
special case
upper bound