Login / Signup
Learning in Restless Multi-Armed Bandits via Adaptive Arm Sequencing Rules.
Tomer Gafni
Kobi Cohen
Published in:
CoRR (2019)
Keyphrases
</>
multi armed bandits
learning algorithm
learning process
learning tasks
online learning
reinforcement learning
dynamic programming
sufficient conditions