Successive Over Relaxation Q-Learning.

Chandramouli Kamanchi Raghuram Bharadwaj Diddigi Shalabh Bhatnagar

Published in: CoRR (2019)

Keyphrases

reinforcement learning
cooperative
function approximation
stochastic approximation
multi agent
state space
probabilistic relaxation
reinforcement learning algorithms
learning algorithm
model free
optimal policy
lognormal distribution
dynamic programming
action selection
iterative algorithms
multi agent reinforcement learning
bucket brigade
learning rate
support vector
machine learning
monte carlo
artificial neural networks
relational reinforcement learning