Login / Signup
Identification of Subgroups With Similar Benefits in Off-Policy Policy Evaluation.
Ramtin Keramati
Omer Gottesman
Leo Anthony Celi
Finale Doshi-Velez
Emma Brunskill
Published in:
CoRR (2021)
Keyphrases
</>
policy evaluation
least squares
monte carlo
machine learning
variance reduction
temporal difference
text classification
markov decision processes