Login / Signup

Towards a Better Understanding of Representation Dynamics under TD-learning.

Yunhao TangRémi Munos
Published in: CoRR (2023)
Keyphrases
  • td learning
  • temporal difference
  • function approximation
  • evaluation function
  • learning algorithm
  • decision making
  • feature extraction