Login / Signup
Large-scale Interactive Recommendation with Tree-structured Policy Gradient.
Haokun Chen
Xinyi Dai
Han Cai
Weinan Zhang
Xuejian Wang
Ruiming Tang
Yuzhou Zhang
Yong Yu
Published in:
CoRR (2018)
Keyphrases
</>
policy gradient
collaborative filtering
parametric optimization
recommender systems
actor critic
model free reinforcement learning
gradient method
decision trees
reinforcement learning
variance reduction