Login / Signup
An application of the temporal difference algorithm to the truck backer-upper problem.
Christopher J. Gatti
Mark J. Embrechts
Published in:
ESANN (2014)
Keyphrases
</>
learning algorithm
dynamic programming
optimization algorithm
temporal difference
reinforcement learning
monte carlo
cost function
td learning
artificial neural networks
model free
multi step