Login / Signup
Are sample means in multi-armed bandits positively or negatively biased?
Jaehyeok Shin
Aaditya Ramdas
Alessandro Rinaldo
Published in:
NeurIPS (2019)
Keyphrases
</>
multi armed bandits
bandit problems
negatively correlated
reinforcement learning
linear programming
sample size