Explore, Filter and Distill: Distilled Reinforcement Learning in Recommendation.
Ruobing XieShaoliang ZhangRui WangFeng XiaLeyu LinPublished in: CIKM (2021)
Keyphrases
- reinforcement learning
- function approximation
- model free
- learning algorithm
- machine learning
- user preferences
- recommender systems
- learning process
- multi agent
- filtering algorithm
- reinforcement learning algorithms
- temporal difference learning
- state space
- data sets
- temporal difference
- function approximators
- nonlinear filters
- customer preferences