A constrained optimization perspective on actor-critic algorithms and application to network routing.

Prashanth L. A.H. L. PrasadShalabh BhatnagarPrakash Chandra
Published in: Syst. Control. Lett. (2016)