Login / Signup
Convergence and Optimality of Policy Gradient Methods in Weakly Smooth Settings.
Matthew Shunshi Zhang
Murat A. Erdogdu
Animesh Garg
Published in:
AAAI (2022)
Keyphrases
</>
policy gradient methods
natural actor critic
optimal solution
convergence speed
policy gradient
genetic algorithm
cost function
learning problems