Login / Signup
A policy gradient method for semi-Markov decision processes with application to call admission control.
Sumeetpal S. Singh
Vladislav Z. B. Tadic
Arnaud Doucet
Published in:
Eur. J. Oper. Res. (2007)
Keyphrases
</>
gradient method
semi markov decision processes
call admission control
optimal policy
average reward