C
search
search
reviewers
reviewers
feeds
feeds
assignments
assignments
settings
logout
Concurrent Order Dispatch for Instant Delivery with Time-Constrained Actor-Critic Reinforcement Learning.
Baoshen Guo
Shuai Wang
Yi Ding
Guang Wang
Suining He
Desheng Zhang
Tian He
Published in:
RTSS (2021)
Keyphrases
</>
reinforcement learning
actor critic
optimal control
multi agent
machine learning
function approximation
state space
approximate dynamic programming