Login / Signup
Convergence of Optimistic-Follow-the-Regularized-Leader in Two-Player Zero-Sum Markov Games.
Yuepeng Yang
Cong Ma
Published in:
ICLR (2023)
Keyphrases
</>
markov games
markov decision processes
multiagent reinforcement learning
reinforcement learning algorithms
convergence rate
machine learning
least squares
objective function
multiagent systems
search algorithm