Actual Return Reinforcement Learning versus Temporal Differences: Some Theoretical and Experimental Results.

Mark D. Pendrith Malcolm R. K. Ryan

Published in: ICML (1996)

Keyphrases

temporal difference
reinforcement learning
function approximation
model free
reinforcement learning algorithms
neural network
state space
td learning
machine learning
learning algorithm
decision making
particle swarm optimization
optimal policy
evaluation function
step size
action selection