Nonstochastic Bandits and Experts with Arm-Dependent Delays.

Dirk van der Hoeven Nicolò Cesa-Bianchi

Published in: CoRR (2021)

Keyphrases

multi armed bandit problems
lower bound
stochastic systems
multi armed bandits
active learning
domain experts
data sets
case study
multiscale
reinforcement learning
relational databases
multiresolution
round trip