Best-response dynamics in zero-sum stochastic games.
David S. LeslieSteven PerkinsZibo XuPublished in: J. Econ. Theory (2020)
Keyphrases
- stochastic games
- nash equilibria
- games with incomplete information
- markov decision processes
- nash equilibrium
- multiagent reinforcement learning
- repeated games
- learning automata
- multi agent
- reinforcement learning algorithms
- imperfect information
- average reward
- robust optimization
- neural network
- state space
- dynamical systems
- monte carlo
- reinforcement learning