Sign in

A Policy Gradient Algorithm to Alleviate the Multi-Agent Value Overestimation Problem in Complex Environments.

Yang YangJiang LiJinyong HouYe WangHuadong Zhao
Published in: Sensors (2023)
Keyphrases