Login / Signup
Natural Policy Gradient Primal-Dual Method for Constrained Markov Decision Processes.
Dongsheng Ding
Kaiqing Zhang
Tamer Basar
Mihailo R. Jovanovic
Published in:
NeurIPS (2020)
Keyphrases
</>
markov decision processes
primal dual
dynamic programming
reinforcement learning
cost function
support vector machine
reinforcement learning algorithms
objective function
state space
sufficient conditions
optimal policy
mathematical model
finite state
average reward