Login / Signup
Towards Robust Off-Policy Evaluation via Human Inputs.
Harvineet Singh
Shalmali Joshi
Finale Doshi-Velez
Himabindu Lakkaraju
Published in:
AIES (2022)
Keyphrases
</>
policy evaluation
least squares
neural network
reinforcement learning
computer vision
objective function
regression model
basis functions
temporal difference