Login / Signup
Distribution oblivious, risk-aware algorithms for multi-armed bandits with unbounded rewards.
Anmol Kagrecha
Jayakrishnan Nair
Krishna P. Jagannathan
Published in:
NeurIPS (2019)
Keyphrases
</>
multi armed bandits
bandit problems
learning algorithm
reinforcement learning
probability distribution