On the convergence of policy gradient methods to Nash equilibria in general stochastic games.

Published in: NeurIPS (2022)

Keyphrases