Optimal Policy for Software Vulnerability Disclosure.
Ashish AroraRahul TelangHao XuPublished in: Manag. Sci. (2008)
Keyphrases
- optimal policy
- finite horizon
- markov decision processes
- decision problems
- reinforcement learning
- state space
- multistage
- dynamic programming
- finite state
- state dependent
- long run
- infinite horizon
- bayesian reinforcement learning
- average cost
- sufficient conditions
- markov decision process
- policy iteration
- lost sales
- control policies
- serial inventory systems
- production system
- machine learning
- partially observable markov decision processes
- control system
- inventory level
- markov decision problems