A Geometric Traversal Algorithm for Reward-Uncertain MDPs
Eunsoo OhKee-Eung KimPublished in: CoRR (2012)
Keyphrases
- detection algorithm
- dynamic programming
- reinforcement learning
- learning algorithm
- objective function
- cost function
- expectation maximization
- simulated annealing
- worst case
- computational cost
- np hard
- significant improvement
- k means
- average reward
- tree structure
- neural network
- markov decision processes
- optimization algorithm
- convergence rate
- segmentation algorithm
- probabilistic model
- linear programming
- bayesian networks
- decision making
- search space
- preprocessing
- optimal solution
- image segmentation