Toward Off-Policy Learning Control with Function Approximation.

Hamid Reza Maei Csaba Szepesvári Shalabh Bhatnagar Richard S. Sutton

Published in: ICML (2010)

Keyphrases

function approximation
reinforcement learning
learning tasks
temporal difference learning
td learning
learning algorithm
supervised learning
actor critic
learning process
prior knowledge
machine learning
adaptive control
radial basis function
function approximators
control system
neural network
temporal difference learning algorithms