Login / Signup
Symmetric (Optimistic) Natural Policy Gradient for Multi-agent Learning with Parameter Convergence.
Sarath Pattathil
Kaiqing Zhang
Asuman E. Ozdaglar
Published in:
CoRR (2022)
Keyphrases
</>
multi agent learning
policy gradient
game theory
gradient method
convergence rate
multi agent
function approximation
artificial intelligence
reinforcement learning
optimal control
convergence speed
complex domains
variance reduction
cooperative
reinforcement learning algorithms
function approximators