Login / Signup

Average, Sensitive and Blackwell Optimal Policies in Denumerable Markov Decision Chains with Unbounded Rewards.

Rommert DekkerArie Hordijk
Published in: Math. Oper. Res. (1988)
Keyphrases