Symmetric (Optimistic) Natural Policy Gradient for Multi-agent Learning with Parameter Convergence.

Published in: CoRR (2022)

Keyphrases