Embracing Relational Reasoning in Multi-Agent Actor-Critic.
Sharlin UtkeJeremie HoussineauGiovanni MontanaPublished in: AAMAS (2024)
Keyphrases
- multi agent
- actor critic
- reinforcement learning
- policy gradient
- temporal difference
- optimal control
- reinforcement learning algorithms
- single agent
- approximate dynamic programming
- knowledge base
- multi agent systems
- neuro fuzzy
- gradient method
- cooperative
- multiple agents
- function approximation
- neural network
- markov decision processes
- markov chain
- machine learning