Login / Signup
Incentivized Exploration for Multi-Armed Bandits under Reward Drift.
Zhiyuan Liu
Huazheng Wang
Fan Shen
Kai Liu
Lijun Chen
Published in:
AAAI (2020)
Keyphrases
</>
multi armed bandits
bandit problems
multi armed bandit
decision problems
reinforcement learning
bayesian networks
active learning
markov chain
long run