Keyphrases
- heuristic search
- markov decision processes
- dynamic programming
- larger problems
- upper bound
- control theory
- real time dynamic programming
- initial state
- reinforcement learning methods
- state space
- beam search
- search space
- reinforcement learning
- optimal policy
- dynamical systems
- search algorithm
- average cost
- lower bound
- reinforcement learning algorithms
- heuristic function
- machine learning
- planning problems
- path finding
- control problems
- branch and bound
- policy iteration
- action space