Login / Signup
Convergence and optimality of policy gradient primal-dual method for constrained Markov decision processes.
Dongsheng Ding
Kaiqing Zhang
Tamer Basar
Mihailo R. Jovanovic
Published in:
ACC (2022)
Keyphrases
</>
markov decision processes
dynamic programming
cost function
state space
linear programming
optimal solution
primal dual
convergence rate
average reward
computational complexity
support vector machine
mathematical model
linear program
finite state
negative matrix factorization