A PAC RL Algorithm for Episodic POMDPs.
Zhaohan Daniel GuoShayan DoroudiEmma BrunskillPublished in: AISTATS (2016)
Keyphrases
- learning algorithm
- reinforcement learning
- dynamic programming
- detection algorithm
- preprocessing
- search space
- computational complexity
- cost function
- markov decision processes
- objective function
- np hard
- k means
- mistake bound
- segmentation algorithm
- theoretical analysis
- state space
- optimal solution
- image segmentation
- clustering algorithm