Implicit Temporal Differences.
Aviv TamarPanos ToulisShie MannorEdoardo M. AiroldiPublished in: CoRR (2014)
Keyphrases
- temporal difference
- reinforcement learning
- function approximation
- evaluation function
- td learning
- monte carlo
- action selection
- step size
- model free
- policy evaluation
- reinforcement learning algorithms
- policy iteration
- temporal difference methods
- decision making
- data sets
- radial basis function
- optimization algorithm
- artificial neural networks
- similarity measure
- feature extraction
- neural network