An application of the temporal difference algorithm to the truck backer-upper problem.

Christopher J. Gatti Mark J. Embrechts

Published in: ESANN (2014)

Keyphrases

learning algorithm
dynamic programming
optimization algorithm
temporal difference
reinforcement learning
monte carlo
cost function
td learning
artificial neural networks
model free
multi step