Keyphrases
- state space
- reinforcement learning
- cooperative
- markov chain
- function approximation
- multi agent
- dynamical systems
- markov processes
- learning algorithm
- stochastic approximation
- action selection
- optimal policy
- optimal control
- model free
- iterative learning control
- temporal difference learning
- learning rate
- dynamic programming
- markov decision processes
- multi agent reinforcement learning
- monte carlo
- continuous state spaces
- bucket brigade
- continuous time bayesian networks
- sufficient conditions
- stochastic processes
- infinite horizon
- data sets
- knowledge base
- non stationary
- continuous state and action spaces
- stochastic shortest path