Keyphrases
- reinforcement learning
- function approximation
- state space
- control problems
- multi agent
- dynamic programming
- optimal policy
- function approximators
- markov decision processes
- policy search
- neural network
- continuous state
- temporal difference
- direct policy search
- action selection
- multi agent reinforcement learning
- reinforcement learning methods
- learning agents
- markov decision process
- real robot
- data sets
- sufficient conditions
- machine learning
- databases