Login / Signup
When People Change their Mind: Off-Policy Evaluation in Non-stationary Recommendation Environments.
Rolf Jagerman
Ilya Markov
Maarten de Rijke
Published in:
WSDM (2019)
Keyphrases
</>
non stationary
policy evaluation
least squares
reinforcement learning
markov decision processes
monte carlo
policy iteration
model free
temporal difference
variance reduction
collaborative filtering
statistical inference
semi parametric
empirical mode decomposition
data mining