Publication: Policy Optimization with Augmented Value Targets for Generalization in Reinforcement Learning.