Sign in

A Policy Gradient Primal-Dual Algorithm for Constrained MDPs with Uniform PAC Guarantees.

Toshinori KitamuraTadashi KozunoMasahiro KatoYuki IchiharaSoichiro NishimoriAkiyoshi SannaiSho SonodaWataru KumagaiYutaka Matsuo
Published in: CoRR (2024)
Keyphrases