Login / Signup
Garbage In, Reward Out: Bootstrapping Exploration in Multi-Armed Bandits.
Branislav Kveton
Csaba Szepesvári
Zheng Wen
Mohammad Ghavamzadeh
Tor Lattimore
Published in:
CoRR (2018)
Keyphrases
</>
multi armed bandits
bandit problems
multi armed bandit
decision problems
reinforcement learning
information extraction
action selection
optimal policy