Login / Signup
Identification of Subgroups With Similar Benefits in Off-Policy Policy Evaluation.
Ramtin Keramati
Omer Gottesman
Leo Anthony Celi
Finale Doshi-Velez
Emma Brunskill
Published in:
CHIL (2022)
Keyphrases
</>
policy evaluation
least squares
temporal difference
reinforcement learning
model free
monte carlo
variance reduction
matrix inversion
decision trees
support vector
linear programming
markov decision processes
policy iteration