An application of simulation for large-scale Markov decision processes to a problem in telephone network routing.
Christopher W. ZobelWilliam T. SchererPublished in: SMC (1998)
Keyphrases
- markov decision processes
- network routing
- state space
- optimal policy
- finite state
- policy iteration
- transition matrices
- reinforcement learning
- dynamic programming
- action sets
- decision theoretic planning
- model based reinforcement learning
- dynamic optimization
- routing algorithm
- reward function
- average reward
- infinite horizon
- markov decision process
- long run
- mathematical model
- average cost
- convergence speed
- dynamical systems
- ant colony optimization
- stochastic shortest path