Login / Signup
Finite-Time Analysis of Minimax Q-Learning for Two-Player Zero-Sum Markov Games: Switching System Approach.
Donghwan Lee
Published in:
CoRR (2023)
Keyphrases
</>
reinforcement learning
cooperative
learning algorithm
linear programming
dynamic environments
incomplete information
reinforcement learning algorithms