Login / Signup
POWQMIX: Weighted Value Factorization with Potentially Optimal Joint Actions Recognition for Cooperative Multi-Agent Reinforcement Learning.
Chang Huang
Junqiao Zhao
Shatong Zhu
Hongtu Zhou
Chen Ye
Tiantian Feng
Changjun Jiang
Published in:
CoRR (2024)
Keyphrases
</>
multi agent reinforcement learning
cooperative
joint action
multi agent
multi agent systems
multi agent learning
learning agent
learning agents
reinforcement learning
intelligent agents
dynamic programming
neural network
state space
worst case
stochastic games