Login / Signup

Adaptive policy-iteration and policy-value-iteration for discounted Markov decision processes.

Gerhard HübnerManfred Schäl
Published in: ZOR Methods Model. Oper. Res. (1991)
Keyphrases