SMAC-tuned Deep Q-learning for Ramp Metering.
Omar ElSamadisyYazeed AbdulhaiHaoyuan XueIlia SmirnovElias B. KhalilBaher AbdulhaiPublished in: SM (2023)
Keyphrases
- reinforcement learning
- function approximation
- multi agent
- learning algorithm
- cooperative
- state space
- stochastic approximation
- optimal policy
- action selection
- bucket brigade
- temporal difference learning
- learning agent
- model free
- learning rate
- real time
- markov chain
- case study
- neural network
- multiagent learning
- relational reinforcement learning