Observing Continuous-Time MDPs by 1-Clock Timed Automata.
Taolue ChenTingting HanJoost-Pieter KatoenAlexandru MereacrePublished in: RP (2011)
Keyphrases
- timed automata
- reachability analysis
- markov decision processes
- state space
- model checking
- theorem prover
- high speed
- reinforcement learning
- markov chain
- theorem proving
- factored mdps
- optimal policy
- markov processes
- power consumption
- finite state
- optimal control
- temporal logic
- dynamical systems
- first order logic
- real time systems
- markov decision process
- planning under uncertainty
- policy iteration
- stationary policies
- decision theoretic planning
- low cost
- iterative learning control
- duty cycle
- infinite horizon
- stochastic processes
- initial state
- learning algorithm
- dynamic programming
- factored markov decision processes