On Reinforcement Learning for Turn-based Zero-sum Markov Games.
Devavrat ShahVarun SomaniQiaomin XieZhi XuPublished in: CoRR (2020)
Keyphrases
- markov games
- reinforcement learning
- reinforcement learning algorithms
- markov decision processes
- markov decision process
- multiagent reinforcement learning
- control problems
- state space
- stochastic games
- function approximation
- model free
- learning algorithm
- multi agent
- temporal difference
- multiagent systems
- supervised learning
- optimal policy
- learning process
- temporal difference learning
- cooperative
- machine learning
- nash equilibrium
- infinite horizon
- dynamic programming
- learning capabilities
- action space
- finite horizon
- policy iteration
- partially observable
- autonomous agents
- reward function
- optimal control
- convergence rate