Login / Signup
Incentivized Bandit Learning with Self-Reinforcing User Preferences.
Tianchen Zhou
Jia Liu
Chaosheng Dong
Jingyuan Deng
Published in:
CoRR (2021)
Keyphrases
</>
user preferences
hierarchical task networks
learning algorithm
reinforcement learning
learning process
active learning
online learning
search space
prior knowledge
learning tasks
user interests
search engine
domain knowledge
collaborative filtering
user feedback
user behaviour