Login / Signup

Finite-Time Analysis of Minimax Q-Learning for Two-Player Zero-Sum Markov Games: Switching System Approach.

Donghwan Lee
Published in: CoRR (2023)
Keyphrases
  • reinforcement learning
  • cooperative
  • learning algorithm
  • linear programming
  • dynamic environments
  • incomplete information
  • reinforcement learning algorithms