Login / Signup
Provably Efficient Policy Gradient Methods for Two-Player Zero-Sum Markov Games.
Yulai Zhao
Yuandong Tian
Jason D. Lee
Simon S. Du
Published in:
CoRR (2021)
Keyphrases
</>
search algorithm
machine learning
reinforcement learning algorithms