Obtaining a Continuous Time Markov Decision Process from Statecharts.
Marcelino S. da SilvaÁdamo L. de SantanaCarlos Renato Lisboa FrancêsNandamudi Lankalapalli VijaykumarSolon V. CarvalhoPublished in: NaBIC (2009)
Keyphrases
- markov decision process
- state space
- semi markov decision process
- markov decision processes
- infinite horizon
- stationary policies
- optimal control
- reinforcement learning
- optimal policy
- markov chain
- finite horizon
- policy iteration
- transition matrices
- transition probabilities
- initial state
- dynamical systems
- dynamic programming
- stochastic processes
- reward function
- planning problems
- long run
- finite state