On the Performance of Temporal Difference Learning With Neural Networks.
Haoxing TianIoannis Ch. PaschalidisAlex OlshevskyPublished in: ICLR (2023)
Keyphrases
- temporal difference learning
- neural network
- function approximation
- fixed point
- function approximators
- reinforcement learning
- evaluation function
- game playing
- approximate value iteration
- temporal difference
- artificial neural networks
- markov decision process
- pattern recognition
- reinforcement learning algorithms
- monte carlo
- genetic algorithm
- radial basis function
- random walk
- probabilistic model
- active learning