Keyphrases
- reinforcement learning
- state space
- function approximation
- reinforcement learning algorithms
- robotic control
- temporal difference learning
- control problems
- learning process
- dynamic programming
- markov decision processes
- direct policy search
- machine learning
- multi agent reinforcement learning
- temporal difference
- model free
- optimal control
- monte carlo
- supervised learning
- multi agent
- artificial intelligence