Login / Signup

On the Convergence of Policy Iteration in Finite State Undiscounted Markov Decision Processes: The Unichain Case.

Arie HordijkMartin L. Puterman
Published in: Math. Oper. Res. (1987)
Keyphrases