Keyphrases
- reinforcement learning
- function approximation
- multi agent
- cooperative
- state space
- learning algorithm
- optimal policy
- reinforcement learning algorithms
- model free
- action selection
- learning rate
- stochastic approximation
- multi agent reinforcement learning
- temporal difference
- potential field
- dynamic programming
- temporal difference learning
- search algorithm
- relational reinforcement learning
- data sets
- least squares
- reward function
- stochastic shortest path