Interval iteration algorithm for MDPs and IMDPs.
Serge HaddadBenjamin MonmegePublished in: Theor. Comput. Sci. (2018)
Keyphrases
- detection algorithm
- theoretical analysis
- preprocessing
- linear programming
- np hard
- optimal solution
- k means
- dynamic programming
- optimization algorithm
- experimental evaluation
- cost function
- computational complexity
- lower bound
- segmentation algorithm
- model free
- markov decision processes
- monte carlo
- policy iteration
- computational cost
- particle swarm optimization
- least squares
- reinforcement learning
- search space
- objective function