Login / Signup
TD(lambda) networks: temporal-difference networks with eligibility traces.
Brian Tanner
Richard S. Sutton
Published in:
ICML (2005)
Keyphrases
</>
temporal difference
eligibility traces
reinforcement learning algorithms
reinforcement learning
td learning
policy evaluation
evaluation function
machine learning
decision making
learning process
monte carlo
function approximation
active learning
markov decision processes
long run