Login / Signup
Why Target Networks Stabilise Temporal Difference Methods.
Mattie Fellows
Matthew J. A. Smith
Shimon Whiteson
Published in:
CoRR (2023)
Keyphrases
</>
temporal difference methods
function approximation
artificial neural networks
reinforcement learning
linear combination
evolutionary methods