Keyphrases
- reinforcement learning
- cooperative
- state space
- function approximation
- multi agent
- learning algorithm
- stochastic approximation
- model free
- action selection
- reinforcement learning algorithms
- case study
- temporal difference learning
- learning rate
- optimal policy
- dynamic programming
- temporal difference
- video sequences
- td learning
- bucket brigade