Interval dominance based structural results for Markov decision process.
Vikram KrishnamurthyPublished in: Autom. (2023)
Keyphrases
- markov decision process
- state space
- markov decision processes
- optimal policy
- reinforcement learning
- infinite horizon
- finite horizon
- transition matrices
- temporal difference learning
- policy iteration
- initial state
- transition probabilities
- markov chain
- dynamic programming
- reward function
- optimal control
- partial observability
- computational complexity