Login / Signup
Bandits with Switching Costs: T^{2/3} Regret.
Ofer Dekel
Jian Ding
Tomer Koren
Yuval Peres
Published in:
CoRR (2013)
Keyphrases
</>
switching costs
regret bounds
multi armed bandit problems
multi armed bandit
multi armed bandits
prior studies
online learning
lower bound
bandit problems
individual level
online services
linear regression
network effects
reinforcement learning
website