Login / Signup
Nearly Minimax-Optimal Regret for Linearly Parameterized Bandits.
Yingkai Li
Yining Wang
Yuan Zhou
Published in:
CoRR (2019)
Keyphrases
</>
worst case
regret bounds
multi armed bandit
lower bound
minimax regret
online learning
multi armed bandits
dynamic programming
upper bound
loss function
closed form
regret minimization
multi armed bandit problems