Convergence and Optimality of Policy Gradient Methods in Weakly Smooth Settings.

Matthew Shunshi Zhang Murat A. Erdogdu Animesh Garg

Published in: AAAI (2022)

Keyphrases

policy gradient methods
natural actor critic
optimal solution
convergence speed
policy gradient
genetic algorithm
cost function
learning problems