Keyphrases
- reinforcement learning
- function approximation
- sufficient conditions
- cooperative
- multi agent
- state space
- stochastic approximation
- learning algorithm
- reinforcement learning algorithms
- optimal policy
- multi agent reinforcement learning
- action selection
- model free
- bucket brigade
- temporal difference learning
- database
- state action
- reinforcement learning methods
- stochastic shortest path
- stationary points
- approximation methods
- learning rate
- dynamical systems
- evolutionary algorithm
- objective function