Toward Off-Policy Learning Control with Function Approximation.
Hamid Reza MaeiCsaba SzepesváriShalabh BhatnagarRichard S. SuttonPublished in: ICML (2010)
Keyphrases
- function approximation
- reinforcement learning
- learning tasks
- temporal difference learning
- td learning
- learning algorithm
- supervised learning
- actor critic
- learning process
- prior knowledge
- machine learning
- adaptive control
- radial basis function
- function approximators
- control system
- neural network
- temporal difference learning algorithms