Keyphrases
- infinite horizon
- optimal control
- linear quadratic
- dynamic programming
- finite horizon
- control strategy
- single item
- production planning
- optimal policy
- multiple output
- stochastic demand
- markov decision processes
- long run
- markov decision process
- average cost
- partially observable
- dec pomdps
- fixed cost
- state space
- reinforcement learning
- policy iteration