Login / Signup
Deep Reinforcement Learning for Dynamic Recommendation with Model-agnostic Counterfactual Policy Synthesis.
Siyu Wang
Xiaocong Chen
Lina Yao
Julian J. McAuley
Published in:
CoRR (2022)
Keyphrases
</>
reinforcement learning
computational model
mathematical model
experimental data
similarity measure
objective function
learning process
probabilistic model
state space
management system
learning algorithm
statistical model
markov decision processes
function approximation
model free
transition model