Successive Over Relaxation Q-Learning.
Chandramouli KamanchiRaghuram Bharadwaj DiddigiShalabh BhatnagarPublished in: CoRR (2019)
Keyphrases
- reinforcement learning
- cooperative
- function approximation
- stochastic approximation
- multi agent
- state space
- probabilistic relaxation
- reinforcement learning algorithms
- learning algorithm
- model free
- optimal policy
- lognormal distribution
- dynamic programming
- action selection
- iterative algorithms
- multi agent reinforcement learning
- bucket brigade
- learning rate
- support vector
- machine learning
- monte carlo
- artificial neural networks
- relational reinforcement learning