Improved Path-length Regret Bounds for Bandits.

Sébastien Bubeck Yuanzhi Li Haipeng Luo Chen-Yu Wei

Published in: CoRR (2019)

Keyphrases

path length
regret bounds
multi armed bandit
shortest path
lower bound
online learning
small world
linear regression
machine learning
pairwise
special case
upper bound