A Non-asymptotic Analysis of Non-parametric Temporal-Difference Learning.
Eloïse BerthierZiad KobeissiFrancis R. BachPublished in: NeurIPS (2022)
Keyphrases
- asymptotic analysis
- temporal difference learning
- function approximation
- fixed point
- reinforcement learning
- evaluation function
- game playing
- fluid model
- gaussian process
- temporal difference
- monte carlo
- reinforcement learning algorithms
- markov decision process
- machine learning
- cost function
- state space
- neural network