Towards Hierarchical Policy Learning for Conversational Recommendation with Hypergraph-based Reinforcement Learning.
Sen ZhaoWei WeiYifan LiuZiyang WangWendi LiXian-Ling MaoShuai ZhuMinghui YangZujie WenPublished in: CoRR (2023)
Keyphrases
- reinforcement learning
- action selection
- online learning
- learning algorithm
- learning process
- partially observable environments
- machine learning
- temporal difference learning
- unsupervised learning
- optimal policy
- supervised learning
- learning tasks
- reinforcement learning problems
- active exploration
- learning systems
- transfer learning
- infinite horizon
- collaborative filtering
- partially observable
- reinforcement learning methods
- policy gradient
- hierarchical reinforcement learning
- eligibility traces
- active learning