Login / Signup
Optimal Exploitation of Clustering and History Information in Multi-armed Bandit.
Djallel Bouneffouf
Srinivasan Parthasarathy
Horst Samulowitz
Martin Wistuba
Published in:
IJCAI (2019)
Keyphrases
</>
reinforcement learning
nearest neighbor
clustering method