Improved Regret for Efficient Online Reinforcement Learning with Linear Function Approximation.
Uri ShermanTomer KorenYishay MansourPublished in: CoRR (2023)
Keyphrases
- function approximation
- reinforcement learning
- function approximators
- temporal difference learning algorithms
- online learning
- temporal difference
- temporal difference learning
- learning tasks
- reinforcement learning algorithms
- radial basis function
- model free
- markov decision processes
- state space
- mountain car
- temporal difference methods
- supervised learning
- td learning
- transfer learning
- machine learning
- least squares
- pattern recognition
- support vector
- genetic algorithm