Option-Critic in Cooperative Multi-agent Systems.
Jhelum ChakravortyPatrick Nadeem WardJulien RoyMaxime Chevalier-BoisvertSumana BasuAndrei LupuDoina PrecupPublished in: AAMAS (2020)
Keyphrases
- cooperative multi agent systems
- reinforcement learning
- multi agent reinforcement learning
- multi agent systems
- multi agent
- function approximation
- reinforcement learning algorithms
- temporal difference
- cooperative
- optimal policy
- complex domains
- stochastic games
- policy gradient
- multi agent learning
- average reward
- learning agents
- learning algorithm