Global Convergence of Localized Policy Iteration in Networked Multi-Agent Reinforcement Learning.
Yizhou ZhangGuannan QuPan XuYiheng LinZaiwei ChenAdam WiermanPublished in: Proc. ACM Meas. Anal. Comput. Syst. (2023)
Keyphrases
- global convergence
- multi agent reinforcement learning
- policy iteration
- convergence rate
- reinforcement learning
- markov decision processes
- stochastic games
- convergence speed
- convergence analysis
- average reward
- global optimum
- model free
- multi agent
- optimization methods
- fixed point
- temporal difference
- optimal policy
- step size
- markov decision process
- least squares
- finite state
- machine learning
- linear programming
- action selection
- optimal control
- multi agent systems
- state space
- function approximation
- support vector
- learning problems
- transfer learning