Convergence and Optimality of Policy Gradient Methods in Weakly Smooth Settings.

Matthew Shunshi Zhang Murat Erdogdu Animesh Garg

Published in: CoRR (2021)

Keyphrases

policy gradient methods
natural actor critic
convergence speed
convergence rate
optimal solution
machine learning
genetic algorithm
reinforcement learning
markov decision processes
policy gradient