The Finite-Horizon Two-Armed Bandit Problem with Binary Responses: A Multidisciplinary Survey of the History, State of the Art, and Myths.
Peter JackoPublished in: CoRR (2019)
Keyphrases
- finite horizon
- infinite horizon
- optimal policy
- markov decision processes
- optimal stopping
- single product
- inventory models
- inventory control
- multistage
- yield management
- average cost
- markov decision process
- long run
- control policies
- single item
- fixed cost
- stochastic demand
- lost sales
- inventory policy
- dynamic programming
- mathematical model
- non stationary