Prior-dependent analysis of posterior sampling reinforcement learning with function approximation.
Yingru LiZhi-Quan LuoPublished in: AISTATS (2024)
Keyphrases
- function approximation
- reinforcement learning
- radial basis function
- reinforcement learning algorithms
- temporal difference learning
- function approximators
- learning tasks
- temporal difference
- model free
- temporal difference learning algorithms
- td learning
- prior knowledge
- artificial neural networks
- feature selection
- graph cuts
- state space
- reinforcement learning methods
- mountain car
- learning process