Real-Time Rideshare Driver Supply Values Using Online Reinforcement Learning.

Benjamin Han Hyungjun Lee Sébastien Martin

Published in: KDD (2022)

Keyphrases

real time
reinforcement learning
robotic control
online learning
function approximation
cellular phone
model free
learning process
control system
temporal difference
intelligent vehicles
low cost
state space
multi agent
neural network
policy search
optimal policy
action selection
learning tasks
markov decision processes
data sets