Keyphrases
- reinforcement learning
- function approximation
- model free
- reinforcement learning algorithms
- robotic control
- learning algorithm
- optimal control
- markov decision processes
- state space
- optimal policy
- supervised learning
- action selection
- reinforcement learning methods
- multi agent
- temporal difference
- artificial intelligence
- machine learning
- direct policy search
- social networks
- robot control
- action space
- temporal difference learning
- continuous state