Multi-Agent Reinforcement Learning via Adaptive Kalman Temporal Difference and Successor Representation.
Mohammad SalimibeniArash MohammadiParvin MalekzadehKonstantinos N. PlataniotisPublished in: Sensors (2022)
Keyphrases
- multi agent reinforcement learning
- temporal difference
- reinforcement learning
- function approximation
- evaluation function
- td learning
- multi agent
- reinforcement learning algorithms
- action selection
- monte carlo
- model free
- temporal difference learning
- training set
- dynamic environments
- function approximators
- distributed control