Verification of Markov Decision Processes using Learning Algorithms.
Tomás BrázdilKrishnendu ChatterjeeMartin ChmelikVojtech ForejtJan KretínskýMarta Z. KwiatkowskaDavid ParkerMateusz UjmaPublished in: CoRR (2014)
Keyphrases
- markov decision processes
- learning algorithm
- reinforcement learning algorithms
- reinforcement learning
- state space
- finite state
- optimal policy
- transition matrices
- policy iteration
- dynamic programming
- machine learning algorithms
- model checking
- learning problems
- policy evaluation
- planning under uncertainty
- model based reinforcement learning
- machine learning
- reachability analysis
- infinite horizon
- partially observable
- average cost
- factored mdps
- decision processes
- finite horizon
- average reward
- markov decision process
- decision theoretic planning
- supervised learning
- least squares
- learning tasks
- search algorithm
- action space
- discounted reward