Keyphrases
- reinforcement learning
- function approximation
- cooperative
- multi agent
- learning algorithm
- state space
- learning rate
- optimal policy
- temporal difference learning
- stochastic approximation
- action selection
- model free
- reinforcement learning algorithms
- bucket brigade
- multiagent learning
- multi agent reinforcement learning
- stochastic shortest path
- potential field
- policy iteration
- temporal difference
- markov decision processes
- artificial intelligence
- information retrieval
- continuous state spaces
- hierarchical reinforcement learning
- machine learning
- credit assignment
- real world