Login / Signup
Multi-agent cooperation policy gradient method based on enhanced exploration for cooperative tasks.
Li-yang Zhao
Tian-qing Chang
Lei Zhang
Xin-lu Zhang
Jiang-feng Wang
Published in:
Int. J. Mach. Learn. Cybern. (2024)
Keyphrases
</>
gradient method
cooperative
multi agent cooperation
multi agent systems
policy gradient
multi agent
convergence rate
actor critic
step size
optimal policy
optimization methods
cost function
negative matrix factorization
information retrieval
learning algorithm
convex formulation