Login / Signup
On the Convergence of Discounted Policy Gradient Methods.
Chris Nota
Published in:
CoRR (2022)
Keyphrases
</>
policy gradient methods
natural actor critic
dynamic programming
convergence speed
markov decision processes
robot arm
policy gradient
convergence rate
finite state
markov decision process