A Taxonomy of Similarity Metrics for Markov Decision Processes.
Álvaro VisúsJavier GarcíaFernando FernándezPublished in: CoRR (2021)
Keyphrases
- similarity metrics
- markov decision processes
- similarity metric
- similarity measure
- state space
- optimal policy
- transition matrices
- finite state
- policy iteration
- reinforcement learning
- reachability analysis
- similarity measurement
- decision theoretic planning
- dynamic programming
- average cost
- planning under uncertainty
- finite horizon
- average reward
- infinite horizon
- reinforcement learning algorithms
- reward function
- partially observable
- action space
- factored mdps
- model based reinforcement learning