Login / Signup
Beyond Stationarity: Convergence Analysis of Stochastic Softmax Policy Gradient Methods.
Sara Klein
Simon Weissmann
Leif Döring
Published in:
ICLR (2024)
Keyphrases
</>
convergence analysis
policy gradient methods
natural actor critic
global convergence
optimality conditions
monte carlo
neural network
reinforcement learning
learning tasks
convergence rate
policy gradient