Learning Individual Policies in Large Multi-agent Systems through Local Variance Minimization.

Tanvi Verma Pradeep Varakantham

Published in: CoRR (2022)

Keyphrases

multi agent systems
learning algorithm
learning process
reinforcement learning
objective function
machine learning
learning scenarios
learning tasks
learning scheme
autonomous agents
background knowledge
learning systems
knowledge acquisition
online learning
supervised learning
active learning
support vector
cooperative