Publication: Leveraging Reward Gradients For Reinforcement Learning in Differentiable Physics Simulations.