Optimal control in light traffic Markov decision processes.
Ger KooleOlaf PasschierPublished in: Math. Methods Oper. Res. (1997)
Keyphrases
- optimal control
- markov decision processes
- dynamic programming
- reinforcement learning
- risk sensitive
- policy iteration
- infinite horizon
- average cost
- optimal policy
- control problems
- finite state
- state space
- decision theoretic planning
- transition matrices
- control strategy
- optimal control problems
- finite horizon
- real time
- partially observable
- reinforcement learning algorithms
- neural network
- approximate dynamic programming
- stochastic shortest path
- production planning
- multistage
- markov decision process
- partially observable markov decision processes
- temporal difference
- function approximation
- dynamical systems
- policy iteration algorithm
- machine learning