H-TD2: Hybrid Temporal Difference Learning for Adaptive Urban Taxi Dispatch.

Benjamin Rivière Soon-Jo Chung

Published in: CoRR (2021)

Keyphrases

temporal difference learning
function approximation
temporal difference
fixed point
evaluation function
reinforcement learning
game playing
approximate value iteration
reinforcement learning algorithms
markov decision process
monte carlo
real valued
model free
machine learning
kernel methods
belief propagation
artificial neural networks