Login / Signup
Time-Constrained Actor-Critic Reinforcement Learning for Concurrent Order Dispatch in On-Demand Delivery.
Shuai Wang
Baoshen Guo
Yi Ding
Guang Wang
Suining He
Desheng Zhang
Tian He
Published in:
IEEE Trans. Mob. Comput. (2024)
Keyphrases
</>
reinforcement learning
actor critic
machine learning
function approximation
temporal difference
neural network
objective function
reinforcement learning algorithms
learning algorithm
dynamic programming
fixed point
policy iteration