Boundedness of iterates in Q-Learning.

Published in: Syst. Control. Lett. (2006)

Keyphrases

reinforcement learning
function approximation
sufficient conditions
cooperative
multi agent
state space
stochastic approximation
learning algorithm
reinforcement learning algorithms
optimal policy
multi agent reinforcement learning
action selection
model free
bucket brigade
temporal difference learning
database
state action
reinforcement learning methods
stochastic shortest path
stationary points
approximation methods
learning rate
dynamical systems
evolutionary algorithm
objective function