An off-policy multi-agent stochastic policy gradient algorithm for cooperative continuous control.

Published in: Neural Networks (2024)

Keyphrases