On the Global Convergence of Policy Gradient in Average Reward Markov Decision Processes.

Published in: CoRR (2024)

Keyphrases