Login / Signup
Distribution oblivious, risk-aware algorithms for multi-armed bandits with unbounded rewards.
Anmol Kagrecha
Jayakrishnan Nair
Krishna P. Jagannathan
Published in:
CoRR (2019)
Keyphrases
</>
multi armed bandits
bandit problems
learning algorithm
bayesian networks
reinforcement learning
multi armed bandit