Zap Q-Learning.

Adithya M. Devraj Sean P. Meyn

Published in: NIPS (2017)

Keyphrases

reinforcement learning
function approximation
cooperative
multi agent
learning algorithm
state space
learning rate
optimal policy
temporal difference learning
stochastic approximation
action selection
model free
reinforcement learning algorithms
bucket brigade
multiagent learning
multi agent reinforcement learning
stochastic shortest path
potential field
policy iteration
temporal difference
markov decision processes
artificial intelligence
information retrieval
continuous state spaces
hierarchical reinforcement learning
machine learning
credit assignment
real world