Login / Signup

On the Convergence of Natural Policy Gradient and Mirror Descent-Like Policy Methods for Average-Reward MDPs.

Yashaswini MurthyR. Srikant
Published in: CDC (2023)
Keyphrases