Login / Signup
Distributional Offline Policy Evaluation with Predictive Error Guarantees.
Runzhe Wu
Masatoshi Uehara
Wen Sun
Published in:
ICML (2023)
Keyphrases
</>
policy evaluation
variance reduction
least squares
monte carlo
temporal difference
reinforcement learning
markov decision processes
model free
matrix inversion
semi parametric
policy iteration
statistical inference
function approximation
machine learning
optimal solution
computational complexity