Finite time analysis of temporal difference learning with linear function approximation: Tail averaging and regularisation.
Gandharv PatilPrashanth L. A.Dheeraj NagarajDoina PrecupPublished in: AISTATS (2023)
Keyphrases
- function approximation
- temporal difference learning
- temporal difference learning algorithms
- reinforcement learning
- function approximators
- radial basis function
- model free
- fixed point
- temporal difference
- reinforcement learning algorithms
- model selection
- learning tasks
- evaluation function
- game playing
- neural network
- search space