Publication: Approximate Policy Iteration using Large-Margin Classifiers.