Login / Signup
Maximizing the long-run average expected profit of a periodic-review assemble-to-order system.
Yaping Zhao
Xiaoyun Xu
Haidong Li
Published in:
CASE (2017)
Keyphrases
</>
long run
periodic review
infinite horizon
average cost
inventory policy
inventory level
optimal policy
expected cost
queueing networks
finite horizon
reinforcement learning
markov decision processes
heavy traffic
average reward
inventory systems