Model-Free Algorithm with Improved Sample Efficiency for Zero-Sum Markov Games.

Songtao Feng Ming Yin Yu-Xiang Wang Jing Yang Yingbin Liang

Published in: CoRR (2023)

Keyphrases

model free
learning algorithm
optimal solution
neural network
search space
dynamic programming
reinforcement learning
convergence rate
reinforcement learning algorithms
state space
machine learning
dynamic environments
path planning
policy iteration