Sign in
Expected Sarsa(λ) with Control Variate for Variance Reduction.
Long Yang
Yu Zhang
Published in:
CoRR (2019)
Keyphrases
</>
variance reduction
importance sampling
gradient estimation
monte carlo
random numbers
reinforcement learning
machine learning
sample size
support vector
function approximation
optimal control
bias variance decomposition
quasi monte carlo