Login / Signup
A Note on the Linear Convergence of Policy Gradient Methods.
Jalaj Bhandari
Daniel Russo
Published in:
CoRR (2020)
Keyphrases
</>
policy gradient methods
natural actor critic
cost function
convergence speed
learning rate
policy gradient