Prior-dependent analysis of posterior sampling reinforcement learning with function approximation.
Yingru LiZhi-Quan LuoPublished in: CoRR (2024)
Keyphrases
- function approximation
- reinforcement learning
- temporal difference learning
- radial basis function
- temporal difference
- state space
- probability distribution
- function approximators
- temporal difference learning algorithms
- learning tasks
- neural network
- artificial neural networks
- model free
- mountain car
- reinforcement learning algorithms
- multi agent
- prior knowledge
- small number