Optimistic Policy Gradient in Multi-Player Markov Games with a Single Controller: Convergence beyond the Minty Property.

Published in: AAAI (2024)

Keyphrases