Login / Signup
On the convergence of policy gradient methods to Nash equilibria in general stochastic games.
Angeliki Giannou
Kyriakos Lotidis
Panayotis Mertikopoulos
Emmanouil-Vasileios Vlatakis-Gkaragkounis
Published in:
CoRR (2022)
Keyphrases
</>
stochastic games
nash equilibria
incomplete information
game theory
nash equilibrium
special case
game theoretic
markov decision processes
average reward
state action
learning automata
robust optimization
convergence rate
multi agent
reinforcement learning algorithms
state transitions
objective function