Login / Signup
On the Convergence and Sample Efficiency of Variance-Reduced Policy Gradient Method.
Junyu Zhang
Chengzhuo Ni
Zheng Yu
Csaba Szepesvári
Mengdi Wang
Published in:
CoRR (2021)
Keyphrases
</>
gradient method
convergence rate
policy gradient
actor critic
log likelihood function
convergence speed
optimal policy
optimization methods
data sets
convex formulation
high dimensional
cost function
support vector machine
image compression
image representation
learning rate