A Two-Step Minimax Q-learning Algorithm for Two-Player Zero-Sum Markov Games.

Shreyas Sumithra Rudresha Villavarayan Antony Vijesh

Published in: CoRR (2024)

Keyphrases

learning algorithm
reinforcement learning algorithms
markov games
reinforcement learning
markov decision processes
multiagent reinforcement learning
machine learning
training data
supervised learning
machine learning algorithms
cooperative
active learning