Of Cores: A Partial-Exploration Framework for Markov Decision Processes.
Jan KretínskýTobias MeggendorferPublished in: CONCUR (2019)
Keyphrases
- markov decision processes
- decision theoretic planning
- optimal policy
- model based reinforcement learning
- reinforcement learning
- state space
- factored mdps
- dynamic programming
- transition matrices
- finite state
- infinite horizon
- interval estimation
- semi markov decision processes
- average reward
- policy iteration
- sufficient conditions
- np hard