Alleviating Matthew Effect of Offline Reinforcement Learning in Interactive Recommendation.
Chongming GaoKexin HuangJiawei ChenYuan ZhangBiao LiPeng JiangShiqi WangZhong ZhangXiangnan HePublished in: CoRR (2023)
Keyphrases
- reinforcement learning
- function approximation
- real time
- reinforcement learning algorithms
- state space
- markov decision processes
- recommender systems
- computer graphics
- temporal difference learning
- user friendly
- optimal policy
- collaborative filtering
- virtual reality
- user interaction
- active learning
- machine learning
- markov decision process
- action space
- graphical interface
- neural network