Towards a Better Understanding of Representation Dynamics under TD-learning.

Yunhao Tang Rémi Munos

Published in: CoRR (2023)

Keyphrases

td learning
temporal difference
function approximation
evaluation function
learning algorithm
decision making
feature extraction