Login / Signup
An off-policy multi-agent stochastic policy gradient algorithm for cooperative continuous control.
Delin Guo
Lan Tang
Xinggan Zhang
Ying-Chang Liang
Published in:
Neural Networks (2024)
Keyphrases
</>
cooperative
multi agent
computational complexity
dynamic programming
learning algorithm
worst case
np hard
search space
control system
cost function
reinforcement learning
optimal solution
path planning
natural gradient