Login / Signup
A Two-Step Minimax Q-learning Algorithm for Two-Player Zero-Sum Markov Games.
Shreyas Sumithra Rudresha
Villavarayan Antony Vijesh
Published in:
CoRR (2024)
Keyphrases
</>
learning algorithm
reinforcement learning algorithms
markov games
reinforcement learning
markov decision processes
multiagent reinforcement learning
machine learning
training data
supervised learning
machine learning algorithms
cooperative
active learning