Publication: Fast Reinforcement Learning of Dialogue Policies Using Stable Function Approximation.