Generalised Task Planning with First-Order Function Approximation.
Jun Hao Alvin NgRonald P. A. PetrickPublished in: CoRL (2021)
Keyphrases
- function approximation
- reinforcement learning
- radial basis function
- temporal difference learning algorithms
- temporal difference learning
- model free
- learning tasks
- planning problems
- reinforcement learning problems
- higher order
- temporal difference
- neural network
- dynamic programming
- td learning
- data sets
- small number
- machine learning