An Improved Greedy Curvature Bound in Finite-Horizon String Optimization with an Application to a Sensor Coverage Problem.
Brandon Van OverBowen LiEdwin K. P. ChongAli PezeshkiPublished in: CDC (2023)
Keyphrases
- finite horizon
- optimal policy
- infinite horizon
- optimal stopping
- markov decision processes
- inventory control
- inventory models
- single product
- multistage
- search algorithm
- upper bound
- sensor networks
- greedy algorithm
- markov decision process
- yield management
- real time
- worst case
- average cost
- non stationary
- single item
- search space
- objective function
- reinforcement learning
- learning algorithm
- machine learning