Actor-Critic for Multi-Agent Reinforcement Learning with Self-Attention.
Juan ZhaoTong ZhuShuo XiaoZongqian GaoHao SunPublished in: Int. J. Pattern Recognit. Artif. Intell. (2022)
Keyphrases
- multi agent reinforcement learning
- actor critic
- reinforcement learning
- multi agent
- temporal difference
- learning agents
- optimal control
- multi agent learning
- multi agent systems
- reinforcement learning algorithms
- neuro fuzzy
- stochastic games
- policy gradient
- gradient method
- function approximation
- policy iteration
- evaluation function
- machine learning
- optimal policy
- artificial intelligence