Target Network and Truncation Overcome The Deadly triad in Q-Learning.

Zaiwei Chen John-Paul Clarke Siva Theja Maguluri

Published in: CoRR (2022)

Keyphrases

complex networks
reinforcement learning
multi agent
cooperative
computer networks
function approximation
network model
neural network
state space
communication networks
link prediction
learning rate
model free