Periodic Q-Learning.

Donghwan Lee Niao He

Published in: L4DC (2020)

Keyphrases

reinforcement learning
cooperative
state space
function approximation
multi agent
learning algorithm
stochastic approximation
model free
action selection
reinforcement learning algorithms
case study
temporal difference learning
learning rate
optimal policy
dynamic programming
temporal difference
video sequences
td learning
bucket brigade