Login / Signup
PAGE-PG: A Simple and Loopless Variance-Reduced Policy Gradient Method with Probabilistic Gradient Estimation.
Matilde Gargiani
Andrea Zanelli
Andrea Martinelli
Tyler H. Summers
John Lygeros
Published in:
CoRR (2022)
Keyphrases
</>
gradient estimation
gradient method
variance reduction
actor critic
bayesian networks
convergence rate
policy gradient
keywords
evolutionary algorithm
optimization methods
step size