Login / Signup
Efficient Asymptotic Approximation in Temporal Difference Learning.
Frédérick Garcia
Florent Serre
Published in:
ECAI (2000)
Keyphrases
</>
neural network
temporal difference learning
function approximation
closed form
evaluation function
approximate value iteration
reinforcement learning
loss bounds
learning algorithm
machine learning algorithms