Login / Signup
Model-Free Algorithm with Improved Sample Efficiency for Zero-Sum Markov Games.
Songtao Feng
Ming Yin
Yu-Xiang Wang
Jing Yang
Yingbin Liang
Published in:
CoRR (2023)
Keyphrases
</>
model free
learning algorithm
optimal solution
neural network
search space
dynamic programming
reinforcement learning
convergence rate
reinforcement learning algorithms
state space
machine learning
dynamic environments
path planning
policy iteration