Login / Signup
Nonstochastic Bandits and Experts with Arm-Dependent Delays.
Dirk van der Hoeven
Nicolò Cesa-Bianchi
Published in:
CoRR (2021)
Keyphrases
</>
multi armed bandit problems
lower bound
stochastic systems
multi armed bandits
active learning
domain experts
data sets
case study
multiscale
reinforcement learning
relational databases
multiresolution
round trip