Login / Signup
Regret Minimisation in Multi-Armed Bandits Using Bounded Arm Memory.
Arghya Roy Chaudhuri
Shivaram Kalyanakrishnan
Published in:
AAAI (2020)
Keyphrases
</>
multi armed bandits
bandit problems
multi armed bandit problems
multi armed bandit
decision problems
machine learning
expected utility