Keyphrases
- reinforcement learning
- function approximation
- learning algorithm
- reinforcement learning algorithms
- model free
- multi agent
- state space
- machine learning
- artificial intelligence
- robotic control
- temporal difference learning
- control problems
- temporal difference
- policy search
- fitted q iteration
- transition model
- partially observable domains
- stochastic approximation
- database
- markov decision process
- learning capabilities
- optimal control
- radial basis function
- information retrieval
- data mining