An Overestimation Reduction Method Based on the Multi-step Weighted Double Estimation Using Value-Decomposition Multi-agent Reinforcement Learning.
Li-yang ZhaoTian-qing ChangLi-bin GuoJie ZhangLei ZhangJin-dun MaPublished in: Neural Process. Lett. (2024)