Login / Signup

On the Convergence of Discounted Policy Gradient Methods.

Chris Nota
Published in: CoRR (2022)
Keyphrases
  • policy gradient methods
  • natural actor critic
  • dynamic programming
  • convergence speed
  • markov decision processes
  • robot arm
  • policy gradient
  • convergence rate
  • finite state
  • markov decision process