Login / Signup
Provable distributed adaptive temporal-difference learning over time-varying networks.
Junlong Zhu
Bing Li
Lin Wang
Mingchuan Zhang
Ling Xing
Jiangtao Xi
Qingtao Wu
Published in:
Expert Syst. Appl. (2023)
Keyphrases
</>
temporal difference learning
function approximation
fixed point
evaluation function
game playing
reinforcement learning
approximate value iteration
temporal difference
multi agent
decision making
semi supervised
linear combination