Login / Signup

Value iteration and approximately optimal stationary policies in finite-state average Markov decision chains.

Rolando Cavazos-Cadena
Published in: Math. Methods Oper. Res. (2002)
Keyphrases