Reinforcement Learning Approach for Multi-period Inventory with Stochastic Demand.
Manoj ShakyaHuey Yuen NgDarrell Joshua OngBu-Sung LeePublished in: AIAI (1) (2022)
Keyphrases
- stochastic demand
- multi period
- infinite horizon
- production planning
- reinforcement learning
- planning horizon
- lead time
- optimal policy
- inventory control
- markov decision processes
- lot sizing
- lost sales
- optimal control
- single item
- multi item
- markov decision process
- finite horizon
- state space
- dynamic programming
- supply chain
- lot size
- total cost
- fixed cost
- long run
- inventory systems
- routing problem
- production cost
- holding cost
- mixed integer programming
- inventory level
- learning algorithm
- service level
- setup cost
- resource allocation
- integer programming
- production process
- markov chain
- random variables
- asymptotically optimal
- customer demand
- order quantity
- finite state