Real-Time Rideshare Driver Supply Values Using Online Reinforcement Learning.
Benjamin HanHyungjun LeeSébastien MartinPublished in: KDD (2022)
Keyphrases
- real time
- reinforcement learning
- robotic control
- online learning
- function approximation
- cellular phone
- model free
- learning process
- control system
- temporal difference
- intelligent vehicles
- low cost
- state space
- multi agent
- neural network
- policy search
- optimal policy
- action selection
- learning tasks
- markov decision processes
- data sets