Login / Signup

NondBREM: Nondeterministic Offline Reinforcement Learning for Large-Scale Order Dispatching.

Hongbo ZhangGuang WangXu WangZhengyang ZhouChen ZhangZheng DongYang Wang
Published in: AAAI (2024)
Keyphrases
  • reinforcement learning
  • real time
  • scheduling problem
  • data sets
  • real world
  • information retrieval
  • learning algorithm
  • decision making
  • decision trees
  • learning environment
  • mobile robot
  • optimal policy