Why Target Networks Stabilise Temporal Difference Methods.

Mattie Fellows Matthew J. A. Smith Shimon Whiteson

Published in: CoRR (2023)

Keyphrases

temporal difference methods
function approximation
artificial neural networks
reinforcement learning
linear combination
evolutionary methods