Keyphrases
- natural gradient
- learning rate
- independent component analysis
- learning algorithm
- blind source separation
- convergence rate
- function approximation
- policy gradient
- cooperative
- reinforcement learning
- convergence speed
- fixed point
- state space
- multi agent
- optimal policy
- neural network
- model free
- markov decision processes
- pattern recognition
- mixing matrix