Login / Signup
Concentration bounds for temporal difference learning with linear function approximation: the case of batch data and uniform sampling.
Prashanth L. A.
Nathaniel Korda
Rémi Munos
Published in:
Mach. Learn. (2021)
Keyphrases
</>
function approximation
temporal difference learning
temporal difference learning algorithms
data sets
training data
reinforcement learning
data points
temporal difference
fixed point
computer vision
objective function
prior knowledge
learning tasks
radial basis function
function approximators