A City-Wide Crowdsourcing Delivery System with Reinforcement Learning.
Yi DingBaoshen GuoLin ZhengMingming LuDesheng ZhangShuai WangSang Hyuk SonTian HePublished in: Proc. ACM Interact. Mob. Wearable Ubiquitous Technol. (2021)
Keyphrases
- reinforcement learning
- function approximation
- markov decision processes
- wide range
- reinforcement learning algorithms
- temporal difference
- machine learning
- learning algorithm
- state space
- policy search
- temporal difference learning
- model free
- human computation
- function approximators
- partially observable
- urban areas
- action selection
- neural network
- optimal control
- robotic control