Publication: Single-Timescale Actor-Critic Provably Finds Globally Optimal Policy.