Reinforcement Learning to Optimize Lifetime Value in Cold-Start Recommendation.

Luo Ji Qin Qi Bingqing Han Hongxia Yang

Published in: CoRR (2021)

Keyphrases

reinforcement learning
function approximation
state space
neural network
supervised learning
action selection
learning algorithm
reinforcement learning algorithms
dynamic programming
machine learning
control problems
temporal difference
markov decision processes
learning problems
optimal control
learning classifier systems
learning process
multi agent
case study
evolutionary learning
relational reinforcement learning