Login / Signup
On the Linear Convergence of Natural Policy Gradient Algorithm.
Sajad Khodadadian
Prakirt Raj Jhunjhunwala
Sushil Mahavir Varma
Siva Theja Maguluri
Published in:
CDC (2021)
Keyphrases
</>
learning algorithm
simulated annealing
dynamic programming
convergence rate
objective function
computational complexity
worst case
cost function
support vector
state space
particle swarm optimization
particle filter
policy gradient