Publication: Shaping Rewards for Reinforcement Learning with Imperfect Demonstrations using Generative Models.