Login / Signup
Provably Efficient Policy Optimization for Two-Player Zero-Sum Markov Games.
Yulai Zhao
Yuandong Tian
Jason D. Lee
Simon S. Du
Published in:
AISTATS (2022)
Keyphrases
</>
multiagent reinforcement learning
objective function
markov games
markov decision processes
convergence rate