Sign in
Dynamic packaging in e-retailing with stochastic demand over finite horizons: A Q-learning approach.
Yan Cheng
Published in:
Expert Syst. Appl. (2009)
Keyphrases
</>
stochastic demand
infinite horizon
optimal policy
reinforcement learning
state space
learning algorithm
lead time
finite horizon
markov decision processes
finite number
inventory control
special case
dynamic programming
sufficient conditions
multistage