Generative Slate Recommendation with Reinforcement Learning.
Romain DeffayetThibaut ThonetJean-Michel RendersMaarten de RijkePublished in: WSDM (2023)
Keyphrases
- reinforcement learning
- function approximation
- recommender systems
- generative model
- recommendation systems
- unsupervised learning
- state space
- collaborative filtering
- user preferences
- reinforcement learning algorithms
- machine learning
- model free
- data driven
- robotic control
- multi agent
- markov decision processes
- website
- personalized recommendation
- data sets
- temporal difference
- learning algorithm
- contextual bandit
- reinforcement learning methods
- learning process
- dynamic programming
- action selection
- information overload
- recommendation algorithms
- search engine
- temporal difference learning
- transfer learning