Global rewards in multi-agent deep reinforcement learning for autonomous mobility on demand systems.
Heiko HoppeTobias EndersQuentin CappartMaximilian SchifferPublished in: L4DC (2024)
Keyphrases
- reinforcement learning
- multi agent
- cooperative
- markov decision processes
- learning algorithm
- function approximation
- state space
- computer systems
- machine learning
- dynamic programming
- expert systems
- temporal difference
- decentralized control
- sensor networks
- intelligent systems
- complex systems
- robotic systems
- database systems
- autonomous systems
- learning agents
- multiple autonomous
- multi agent environments
- reinforcement learning agents