Login / Signup
On a Variance Reduction Correction of the Temporal Difference for Policy Evaluation in the Stochastic Continuous Setting.
Ziad Kobeissi
Francis Bach
Published in:
CoRR (2022)
Keyphrases
</>
variance reduction
policy evaluation
monte carlo
temporal difference
td learning
sample size
importance sampling
particle filter
naive bayes classifier
decision trees
markov chain
fixed point
function approximation
markov chain monte carlo