Asynchronous Actor-Critic for Multi-Agent Reinforcement Learning.
Yuchen XiaoWeihao TanChristopher AmatoPublished in: CoRR (2022)
Keyphrases
- multi agent reinforcement learning
- actor critic
- reinforcement learning
- policy gradient
- reinforcement learning algorithms
- temporal difference
- average reward
- optimal control
- stochastic games
- function approximation
- learning agents
- neuro fuzzy
- policy iteration
- state space
- gradient method
- model free
- multi agent
- learning agent
- adaptive control
- rl algorithms
- single agent
- learning algorithm
- learning problems
- mobile robot
- multi agent systems
- function approximators
- infinite horizon
- markov decision processes
- transfer learning
- temporal difference learning
- least squares
- multi agent learning
- artificial intelligence