Publication: Restricted gradient-descent algorithm for value-function approximation in reinforcement learning.