Publication: On the Sample Complexity of Reinforcement Learning with Policy Space Generalization.