Login / Signup
Off-Policy Evaluation in Embedded Spaces.
Jaron J. R. Lee
David Arbour
Georgios Theocharous
Published in:
CoRR (2022)
Keyphrases
</>
policy evaluation
least squares
temporal difference
reinforcement learning
model free
monte carlo
markov decision processes
policy iteration
variance reduction
function approximation
matrix inversion
semi parametric
machine learning
fixed point