Login / Signup
Easy as ABCs: Unifying Boltzmann Q-Learning and Counterfactual Regret Minimization.
Luca D'Amico-Wong
Hugh Zhang
Marc Lanctot
David C. Parkes
Published in:
CoRR (2024)
Keyphrases
</>
regret minimization
reinforcement learning
state space
nash equilibrium
function approximation
cooperative
learning algorithm
multi agent
model free
action selection
game theoretic
multi agent reinforcement learning
multi agent systems
sufficient conditions
reinforcement learning algorithms
multi agent learning