Efficient policy detecting and reusing for non-stationarity in Markov games.

Yan ZhengJianye HaoZongzhang ZhangZhaopeng MengTianpei YangYanran LiChangjie Fan
Published in: Auton. Agents Multi Agent Syst. (2021)