Login / Signup
Garbage In, Reward Out: Bootstrapping Exploration in Multi-Armed Bandits.
Branislav Kveton
Csaba Szepesvári
Sharan Vaswani
Zheng Wen
Tor Lattimore
Mohammad Ghavamzadeh
Published in:
ICML (2019)
Keyphrases
</>
multi armed bandits
bandit problems
multi armed bandit
decision problems
reinforcement learning
information extraction
mutual information
long run
action selection
pairwise
multi objective
dynamical systems
information theoretic