Publication: Understanding Failures of Deterministic Actor-Critic with Continuous Action Spaces and Sparse Rewards.