• search
    search
  • reviewers
    reviewers
  • feeds
    feeds
  • assignments
    assignments
  • settings
  • logout

Concurrent Order Dispatch for Instant Delivery with Time-Constrained Actor-Critic Reinforcement Learning.

Baoshen GuoShuai WangYi DingGuang WangSuining HeDesheng ZhangTian He
Published in: RTSS (2021)
Keyphrases
  • reinforcement learning
  • actor critic
  • optimal control
  • multi agent
  • machine learning
  • function approximation
  • state space
  • approximate dynamic programming