QSOD: Hybrid Policy Gradient for Deep Multi-agent Reinforcement Learning.
Hafiz Raza ur RehmanByung-Won OnDevarani Devi NingombamSungwon YiGyu Sang ChoiPublished in: IEEE Access (2021)
Keyphrases
- multi agent reinforcement learning
- policy gradient
- reinforcement learning
- multi agent
- multi agent learning
- single agent
- learning agents
- function approximation
- stochastic games
- optimal control
- reinforcement learning algorithms
- gradient method
- multi agent systems
- machine learning
- cooperative
- dynamic programming
- model free
- learning agent