Keyphrases
- discounted reward
- traveling salesman problem
- markov decision processes
- average reward
- policy iteration
- ant colony optimization
- optimal policy
- hierarchical reinforcement learning
- search space
- state and action spaces
- combinatorial optimization
- long run
- reinforcement learning
- optimality criterion
- optimal solution
- state space
- np hard
- finite state
- markov decision problems
- machine learning
- fixed point
- heuristic search
- dynamic programming