Learning Individual Policies in Large Multi-agent Systems through Local Variance Minimization.
Tanvi VermaPradeep VarakanthamPublished in: CoRR (2022)
Keyphrases
- multi agent systems
- learning algorithm
- learning process
- reinforcement learning
- objective function
- machine learning
- learning scenarios
- learning tasks
- learning scheme
- autonomous agents
- background knowledge
- learning systems
- knowledge acquisition
- online learning
- supervised learning
- active learning
- support vector
- cooperative