Generalised Task Planning with First-Order Function Approximation.

Jun Hao Alvin Ng Ronald P. A. Petrick

Published in: CoRL (2021)

Keyphrases

function approximation
reinforcement learning
radial basis function
temporal difference learning algorithms
temporal difference learning
model free
learning tasks
planning problems
reinforcement learning problems
higher order
temporal difference
neural network
dynamic programming
td learning
data sets
small number
machine learning