Supervised Advantage Actor-Critic for Recommender Systems.
Xin XinAlexandros KaratzoglouIoannis ArapakisJoemon M. JosePublished in: CoRR (2021)
Keyphrases
- recommender systems
- actor critic
- reinforcement learning
- policy gradient
- collaborative filtering
- optimal control
- temporal difference
- learning algorithm
- function approximation
- gradient method
- supervised learning
- semi supervised
- reinforcement learning algorithms
- matrix factorization
- cost function
- policy iteration
- average reward
- approximate dynamic programming
- machine learning
- evaluation function
- neuro fuzzy
- active learning
- decision making