Publication: A performance gradient perspective on gradient-based policy iteration and a modified value iteration.