Keyphrases
- state space
- reinforcement learning
- markov chain
- function approximation
- cooperative
- markov processes
- optimal control
- multi agent
- learning algorithm
- optimal policy
- stochastic approximation
- dynamic programming
- multi agent reinforcement learning
- stochastic processes
- iterative learning control
- bucket brigade
- action selection
- model free
- markov decision processes
- dynamical systems
- learning rate
- potential field
- reinforcement learning algorithms
- continuous state spaces
- relational reinforcement learning
- machine learning
- markov decision process
- learning agent
- markov process
- dynamic environments
- search space
- multi agent systems