Login / Signup
Convergence and Optimality of Policy Gradient Methods in Weakly Smooth Settings.
Matthew Shunshi Zhang
Murat Erdogdu
Animesh Garg
Published in:
CoRR (2021)
Keyphrases
</>
policy gradient methods
natural actor critic
convergence speed
convergence rate
optimal solution
machine learning
genetic algorithm
reinforcement learning
markov decision processes
policy gradient