Login / Signup

A policy gradient method for semi-Markov decision processes with application to call admission control.

Sumeetpal S. SinghVladislav Z. B. TadicArnaud Doucet
Published in: Eur. J. Oper. Res. (2007)
Keyphrases
  • gradient method
  • semi markov decision processes
  • call admission control
  • optimal policy
  • average reward