On the Performance of Temporal Difference Learning With Neural Networks.

Haoxing Tian Ioannis Ch. Paschalidis Alex Olshevsky

Published in: ICLR (2023)

Keyphrases

temporal difference learning
neural network
function approximation
fixed point
function approximators
reinforcement learning
evaluation function
game playing
approximate value iteration
temporal difference
artificial neural networks
markov decision process
pattern recognition
reinforcement learning algorithms
monte carlo
genetic algorithm
radial basis function
random walk
probabilistic model
active learning