Multi-Agent Deep Reinforcement Learning with Adaptive Policies.
Yixiang WangFeng WuPublished in: CoRR (2019)
Keyphrases
- reinforcement learning
- multi agent
- optimal policy
- reinforcement learning agents
- cooperative multi agent systems
- adaptive control
- function approximation
- state space
- control policies
- markov decision process
- policy search
- multi agent reinforcement learning
- multiagent reinforcement learning
- reinforcement learning algorithms
- intelligent agents
- learning capabilities
- cooperative
- reward function
- partially observable markov decision processes
- multi agent systems
- multi agent environments
- temporal difference
- multiple agents
- control policy
- markov decision problems
- supervised learning
- learning agents
- single agent
- learning process
- multiagent systems
- dynamic programming
- sufficient conditions
- traffic signal control
- fitted q iteration
- learning algorithm