RPM: Generalizable Multi-Agent Policies for Multi-Agent Reinforcement Learning.
Wei QiuXiao MaBo AnSvetlana ObraztsovaShuicheng YanZhongwen XuPublished in: ICLR (2023)
Keyphrases
- multi agent reinforcement learning
- multi agent
- cooperative multi agent systems
- learning agents
- multi agent systems
- cooperative
- multi agent learning
- multiagent systems
- partially observable markov decision processes
- reinforcement learning
- optimal policy
- multiple agents
- autonomous agents
- intelligent agents
- single agent
- stochastic games
- agent interactions
- artificial intelligence
- distributed control