Login / Signup
Incentivized Exploration for Multi-Armed Bandits under Reward Drift.
Zhiyuan Liu
Huazheng Wang
Fan Shen
Kai Liu
Lijun Chen
Published in:
CoRR (2019)
Keyphrases
</>
multi armed bandits
bandit problems
multi armed bandit
decision problems
reinforcement learning
lower bound
decision making
active learning
online learning