Optimal Policy for Dynamic Assortment Planning Under Multinomial Logit Models.
Xi ChenYining WangYuan ZhouPublished in: Math. Oper. Res. (2021)
Keyphrases
- optimal policy
- multinomial logit
- stochastic inventory control
- decision problems
- markov decision processes
- reinforcement learning
- markov decision process
- finite horizon
- state dependent
- dynamic programming
- state space
- infinite horizon
- initial state
- inventory level
- lost sales
- multistage
- sufficient conditions
- finite state
- average cost
- feature selection