Login / Signup

Optimal Exploitation of Clustering and History Information in Multi-armed Bandit.

Djallel BouneffoufSrinivasan ParthasarathyHorst SamulowitzMartin Wistuba
Published in: IJCAI (2019)
Keyphrases
  • reinforcement learning
  • nearest neighbor
  • clustering method