Keyphrases
- reinforcement learning
- branch and bound
- reinforcement learning algorithms
- model free
- state space
- multi agent
- function approximation
- optimal policy
- robotic control
- search tree
- markov decision processes
- learning algorithm
- machine learning
- learning problems
- dynamic programming
- search algorithm
- control problems
- stochastic approximation
- search space
- optimal control
- knowledge base
- temporal difference
- learning capabilities
- learning agent
- function approximators
- perceptual aliasing