Login / Signup
On the convergence of policy gradient methods to Nash equilibria in general stochastic games.
Angeliki Giannou
Kyriakos Lotidis
Panayotis Mertikopoulos
Emmanouil-Vasileios Vlatakis-Gkaragkounis
Published in:
NeurIPS (2022)
Keyphrases
</>
stochastic games
nash equilibria
game theory
incomplete information
nash equilibrium
game theoretic
special case
markov decision processes
fixed point
robust optimization
state action
cost function
learning automata
imperfect information