Two-stage reward allocation with decay for multi-agent coordinated behavior for sequential cooperative task by using deep reinforcement learning.

Published in: Auton. Intell. Syst. (2022)

Keyphrases