Login / Signup
Regret Bounds for Batched Bandits.
Hossein Esfandiari
Amin Karbasi
Abbas Mehrabian
Vahab S. Mirrokni
Published in:
AAAI (2021)
Keyphrases
</>
regret bounds
online learning
lower bound
linear regression
upper bound
multi armed bandit
training data
bregman divergences