Login / Signup

Geometry and convergence of natural policy gradient methods.

Johannes MüllerGuido Montúfar
Published in: CoRR (2022)
Keyphrases
  • policy gradient methods
  • natural actor critic
  • convergence rate
  • search space
  • convergence speed
  • neural network
  • cost function
  • robot arm