Login / Signup
Policy Evaluation Networks.
Jean Harb
Tom Schaul
Doina Precup
Pierre-Luc Bacon
Published in:
CoRR (2020)
Keyphrases
</>
policy evaluation
least squares
monte carlo
temporal difference
model free
variance reduction
matrix inversion
reinforcement learning
function approximation
policy iteration
np hard
linear regression
semi parametric