Login / Signup
Deeply-Debiased Off-Policy Interval Estimation.
Chengchun Shi
Runzhe Wan
Victor Chernozhukov
Rui Song
Published in:
ICML (2021)
Keyphrases
</>
interval estimation
collaborative filtering
markov decision processes
reinforcement learning
dynamic programming