Debiased Off-Policy Evaluation for Recommendation Systems.
Yusuke NaritaShota YasuiKohei YataPublished in: RecSys (2021)
Keyphrases
- recommendation systems
- policy evaluation
- least squares
- reinforcement learning
- temporal difference
- monte carlo
- markov decision processes
- model free
- collaborative filtering
- policy iteration
- variance reduction
- function approximation
- recommender systems
- web search
- user preferences
- semi parametric
- search engine
- collaborative filtering recommendation algorithm
- user feedback
- gaussian process
- partially observable markov decision processes
- markov decision problems
- recommendation algorithms
- evaluation function
- optimal policy
- statistical inference
- step size
- state space
- multi agent