Temporal Difference Learning as Gradient Splitting.

Rui Liu Alex Olshevsky

Published in: ICML (2021)

Keyphrases

temporal difference learning
function approximation
fixed point
evaluation function
game playing
reinforcement learning
approximate value iteration
temporal difference
markov decision process
reinforcement learning algorithms
artificial neural networks
markov random field
sufficient conditions
monte carlo
real valued