Dyna-Style Planning with Linear Function Approximation and Prioritized Sweeping
Richard S. SuttonCsaba SzepesváriAlborz GeramifardMichael BowlingPublished in: CoRR (2012)
Keyphrases
- function approximation
- temporal difference learning algorithms
- function approximators
- reinforcement learning
- temporal difference learning
- reinforcement learning problems
- radial basis function
- temporal difference
- learning tasks
- model free
- td learning
- temporal difference methods
- pattern recognition
- semi supervised learning