Publication: Imitation-Projected Policy Gradient for Programmatic Reinforcement Learning.