Policy Optimization for Markov Games: Unified Framework and Faster Convergence.
Runyu ZhangQinghua LiuHuan WangCaiming XiongNa LiYu BaiPublished in: NeurIPS (2022)
Keyphrases
- faster convergence
- unified framework
- markov games
- convergence speed
- markov decision processes
- step size
- global optimum
- global optimization
- markov decision process
- reinforcement learning algorithms
- control problems
- pso algorithm
- optimization problems
- reinforcement learning
- convergence rate
- optimal policy
- optimization method
- genetic programming
- differential evolution
- multiscale
- probabilistic model
- dynamic programming
- multi agent