Login / Signup
Simplifying Deep Temporal Difference Learning.
Matteo Gallici
Mattie Fellows
Benjamin Ellis
Bartomeu Pou
Ivan Masmitja
Jakob Nicolaus Foerster
Mario Martin
Published in:
CoRR (2024)
Keyphrases
</>
temporal difference learning
function approximation
fixed point
evaluation function
reinforcement learning
game playing
temporal difference
approximate value iteration
reinforcement learning algorithms
markov decision process
monte carlo
neural network
machine learning
semi supervised