Login / Signup
Doubly robust off-policy evaluation with shrinkage.
Yi Su
Maria Dimakopoulou
Akshay Krishnamurthy
Miroslav Dudík
Published in:
ICML (2020)
Keyphrases
</>
policy evaluation
least squares
temporal difference
monte carlo
matrix inversion
computer vision
kalman filter
variance reduction