Login / Signup
Adaptively Optimize Content Recommendation Using Multi Armed Bandit Algorithms in E-commerce.
Ding Xiang
Becky West
Jiaqi Wang
Xiquan Cui
Jinzhou Huang
Published in:
CoRR (2021)
Keyphrases
</>
multi armed bandit
learning algorithm
recommender systems
reinforcement learning
computational complexity
worst case
multi agent
loss function