A Concentration Bound for TD(0) with Function Approximation.
Siddharth ChandakVivek S. BorkarPublished in: CoRR (2023)
Keyphrases
- function approximation
- temporal difference
- td learning
- temporal difference learning
- reinforcement learning
- radial basis function
- td methods
- reinforcement learning algorithms
- function approximators
- temporal difference methods
- learning tasks
- temporal difference learning algorithms
- model free
- policy evaluation
- data sets
- least squares