A comprehensive analysis of warranty claims and optimal policies.
Ming LuoShaomin WuPublished in: Eur. J. Oper. Res. (2019)
Keyphrases
- comprehensive analysis
- optimal policy
- markov decision processes
- decision problems
- state space
- finite horizon
- dynamic programming
- reinforcement learning
- infinite horizon
- state dependent
- long run
- multistage
- finite state
- initial state
- average reward reinforcement learning
- sufficient conditions
- average reward
- markov decision process
- bayesian reinforcement learning
- serial inventory systems
- partially observable markov decision processes
- production system
- learning algorithm
- policy iteration
- average cost
- control policies
- markov decision problems
- reward function