Reinforcement Learning to Optimize Lifetime Value in Cold-Start Recommendation.
Luo JiQin QiBingqing HanHongxia YangPublished in: CoRR (2021)
Keyphrases
- reinforcement learning
- function approximation
- state space
- neural network
- supervised learning
- action selection
- learning algorithm
- reinforcement learning algorithms
- dynamic programming
- machine learning
- control problems
- temporal difference
- markov decision processes
- learning problems
- optimal control
- learning classifier systems
- learning process
- multi agent
- case study
- evolutionary learning
- relational reinforcement learning