Multi-agent hierarchical policy gradient for Air Combat Tactics emergence via self-play.
Zhixiao SunHaiyin PiaoZhen YangYiyang ZhaoGuang ZhanDeyun ZhouGuanglei MengHechang ChenXing ChenBohao QuYuanjie LuPublished in: Eng. Appl. Artif. Intell. (2021)