Keyphrases
- reinforcement learning
- function approximation
- reinforcement learning algorithms
- state space
- optimal policy
- robotic control
- model free
- multi agent
- dynamic programming
- markov decision processes
- machine learning
- learning algorithm
- temporal difference
- direct policy search
- optimal control
- temporal difference learning
- supervised learning
- learning problems
- transfer learning
- data sets
- clustering algorithm
- computer vision
- information retrieval
- data mining
- real world