Publication: Hindsight Reward Tweaking via Conditional Deep Reinforcement Learning.