Login / Signup

Concurrent Order Dispatch for Instant Delivery with Time-Constrained Actor-Critic Reinforcement Learning.

Baoshen GuoShuai WangYi DingGuang WangSuining HeDesheng ZhangTian He
Published in: RTSS (2021)
Keyphrases
  • reinforcement learning
  • actor critic
  • optimal control
  • multi agent
  • machine learning
  • function approximation
  • state space
  • approximate dynamic programming