Regularized Softmax Deep Multi-Agent Q-Learning.
Ling PanTabish RashidBei PengLongbo HuangShimon WhitesonPublished in: NeurIPS (2021)
Keyphrases
- multi agent
- reinforcement learning
- cooperative
- temporal difference learning
- multiagent systems
- single agent
- least squares
- intelligent agents
- multiple agents
- multi agent systems
- function approximation
- multi agent reinforcement learning
- autonomous agents
- total least squares
- learning rate
- coalition formation
- action selection
- reinforcement learning algorithms
- agent oriented
- risk minimization
- heterogeneous agents
- cooperative agents
- multiagent reinforcement learning