Decentralized Multi-Agent Reinforcement Learning in Average-Reward Dynamic DCOPs.
Duc Thien NguyenWilliam YeohHoong Chuin LauShlomo ZilbersteinChongjie ZhangPublished in: AAAI (2014)
Keyphrases
- multi agent reinforcement learning
- average reward
- stochastic games
- multi agent
- markov decision processes
- reinforcement learning
- long run
- distributed constraint optimization
- model free
- cooperative
- dynamic environments
- optimal policy
- multi agent systems
- state space
- nash equilibrium
- learning agents
- multi agent learning