Learning Zero-Sum Simultaneous-Move Markov Games Using Function Approximation and Correlated Equilibrium.
Qiaomin XieYudong ChenZhaoran WangZhuoran YangPublished in: CoRR (2020)
Keyphrases
- function approximation
- reinforcement learning
- learning tasks
- temporal difference learning
- markov games
- learning process
- reinforcement learning algorithms
- learning algorithm
- function approximators
- temporal difference
- game theory
- training data
- artificial neural networks
- supervised learning
- multi agent
- learning capabilities
- stochastic games
- machine learning