Online Model-free Safety Verification for Markov Decision Processes Without Safety Violation.
Abhijit MazumdarRafal WisniewskiManuela-Luminita BujorianuPublished in: CoRR (2023)
Keyphrases
- markov decision processes
- model free
- policy iteration
- reinforcement learning
- reinforcement learning algorithms
- average reward
- risk sensitive
- policy evaluation
- state space
- optimal policy
- finite state
- function approximation
- temporal difference
- markov decision process
- reachability analysis
- machine learning
- transition matrices
- fixed point
- model based reinforcement learning
- dynamic programming
- planning under uncertainty
- model checking
- learning algorithm
- infinite horizon
- action space
- average cost
- partially observable
- stochastic games
- markov decision problems
- least squares
- decision problems
- decision theoretic planning
- action sets
- search algorithm