Login / Signup
Convergence and sample complexity of natural policy gradient primal-dual methods for constrained MDPs.
Dongsheng Ding
Kaiqing Zhang
Jiali Duan
Tamer Basar
Mihailo R. Jovanovic
Published in:
CoRR (2022)
Keyphrases
</>
primal dual
sample complexity
linear programming
theoretical analysis
markov decision processes
convergence rate
convex optimization
machine learning
reinforcement learning
pairwise
active learning
cross validation
linear program
optimization methods
optimal control
approximation methods