Login / Signup
Measuring Policy Distance for Multi-Agent Reinforcement Learning.
Tianyi Hu
Zhiqiang Pu
Xiaolin Ai
Tenghai Qiu
Jianqiang Yi
Published in:
CoRR (2024)
Keyphrases
</>
multi agent reinforcement learning
multi agent
multi agent learning
reinforcement learning
multi agent systems
optimal policy
learning agents
cooperative
stochastic games
distributed control
markov chain
sufficient conditions
markov decision process