Login / Signup
Best arm identification in multi-armed bandits with delayed feedback.
Aditya Grover
Todor M. Markov
Peter M. Attia
Norman Jin
Nicolas Perkins
Bryan Cheong
Michael H. Chen
Zi Yang
Stephen J. Harris
William C. Chueh
Stefano Ermon
Published in:
AISTATS (2018)
Keyphrases
</>
multi armed bandits
delayed feedback
bandit problems
least squares
loss function