Reinforcement Learning in the Wild: Scalable RL Dispatching Algorithm Deployed in Ridehailing Marketplace.
Soheil Sadeghi EshkevariXiaocheng TangZhiwei QinJinhan MeiCheng ZhangQianying MengJia XuPublished in: CoRR (2022)
Keyphrases
- reinforcement learning
- learning algorithm
- dynamic programming
- single pass
- detection algorithm
- evolutionary algorithm
- worst case
- optimization algorithm
- preprocessing
- cost function
- probabilistic model
- model free
- k means
- np hard
- memory efficient
- simulated annealing
- convergence rate
- policy iteration
- control policy
- markov decision processes
- expectation maximization
- state space
- multi agent
- image segmentation