Learning Zero-Sum Simultaneous-Move Markov Games Using Function Approximation and Correlated Equilibrium.
Qiaomin XieYudong ChenZhaoran WangZhuoran YangPublished in: Math. Oper. Res. (2023)
Keyphrases
- function approximation
- reinforcement learning
- markov games
- learning tasks
- reinforcement learning algorithms
- temporal difference learning
- learning algorithm
- learning process
- markov decision processes
- active learning
- model free
- markov decision process
- function approximators
- stochastic games
- genetic algorithm
- e learning
- markov chain
- machine learning