Login / Signup

Policy Gradient Converges to the Globally Optimal Policy for Nearly Linear-Quadratic Regulators.

Yinbin HanMeisam RazaviyaynRenyuan Xu
Published in: CoRR (2023)
Keyphrases