Keyphrases
- heuristic search
- dynamic programming
- upper bound
- markov decision processes
- control theory
- larger problems
- initial state
- lower bound
- reinforcement learning methods
- real time dynamic programming
- search space
- tree search
- beam search
- convergence rate
- search algorithm
- search problems
- policy iteration
- branch and bound
- optimal policy