Geometry and convergence of natural policy gradient methods.

Johannes Müller Guido Montúfar

Published in: CoRR (2022)

Keyphrases

policy gradient methods
natural actor critic
convergence rate
search space
convergence speed
neural network
cost function
robot arm