Login / Signup
State Relevance for Off-Policy Evaluation.
Simon P. Shen
Yecheng Jason Ma
Omer Gottesman
Finale Doshi-Velez
Published in:
CoRR (2021)
Keyphrases
</>
policy evaluation
least squares
temporal difference
model free
machine learning
training data
monte carlo