A Reinforcement Learning and Prediction-Based Lookahead Policy for Vehicle Repositioning in Online Ride-Hailing Systems.
Honghao WeiZixian YangXin LiuZhiwei (Tony) QinXiaocheng TangLei YingPublished in: IEEE Trans. Intell. Transp. Syst. (2024)
Keyphrases
- reinforcement learning
- transport systems
- online learning
- real time
- optimal policy
- learning algorithm
- e learning
- management system
- distributed systems
- computer systems
- complex systems
- partially observable domains
- reinforcement learning problems
- intelligent transportation systems
- action space
- markov decision process
- action selection
- human users
- optimal control
- pedestrian detection
- function approximation
- state space
- neural network