Optimal policies with decreasing probability of imperfect maintenance.
Shey-Huei SheuYuh-Bin LinGwo-Liang LiaoPublished in: IEEE Trans. Reliab. (2005)
Keyphrases
- optimal policy
- markov decision processes
- decision problems
- finite horizon
- dynamic programming
- reinforcement learning
- state space
- long run
- serial inventory systems
- infinite horizon
- multistage
- finite state
- state dependent
- average reward
- dynamic programming algorithms
- sample path
- probability distribution
- average cost
- sufficient conditions
- initial state
- control policies
- average reward reinforcement learning
- markov decision process
- policy iteration
- markov chain
- partially observable markov decision processes
- large deviations
- expected reward
- bayesian reinforcement learning
- single stage
- optimality criterion