A Note on the Linear Convergence of Policy Gradient Methods.

Jalaj Bhandari Daniel Russo

Published in: CoRR (2020)

Keyphrases

policy gradient methods
natural actor critic
cost function
convergence speed
learning rate
policy gradient