Login / Signup
Triply Robust Off-Policy Evaluation.
Anqi Liu
Hao Liu
Anima Anandkumar
Yisong Yue
Published in:
CoRR (2019)
Keyphrases
</>
policy evaluation
least squares
reinforcement learning
monte carlo
machine learning
learning algorithm
markov decision processes
model free