Keyphrases
- reinforcement learning
- function approximation
- model free
- markov decision processes
- reinforcement learning algorithms
- higher level
- learning algorithm
- action selection
- state space
- learning problems
- direct policy search
- automatically extracting
- multi agent
- hierarchical structure
- robotic control
- dynamic programming
- learning process
- hierarchically organized
- lower level
- hierarchical organization
- temporal difference learning
- learning automata
- optimal policy
- search space