Learning from Atypical Behavior: Temporary Interest Aware Recommendation Based on Reinforcement Learning.
Ziwen DuNing YangZhonghua YuPhilip S. YuPublished in: CoRR (2022)
Keyphrases
- reinforcement learning
- learning problems
- learning algorithm
- knowledge acquisition
- learning process
- prior knowledge
- supervised learning
- learning mechanism
- online learning
- autonomous robots
- temporal difference learning
- autonomous learning
- active learning
- transfer learning
- function approximation
- optimal control
- learning agent
- learning agents