Near-Optimal Policy Optimization for Correlated Equilibrium in General-Sum Markov Games.
Yang CaiHaipeng LuoChen-Yu WeiWeiqiang ZhengPublished in: CoRR (2024)
Keyphrases
- optimal policy
- markov decision processes
- state space
- reinforcement learning
- finite horizon
- decision problems
- dynamic programming
- state dependent
- special case
- infinite horizon
- multistage
- long run
- reinforcement learning algorithms
- sufficient conditions
- finite state
- markov decision process
- policy iteration
- correlated equilibrium
- reward function
- average cost
- average reward
- markov decision problems