Login / Signup
Finite time analysis of temporal difference learning with linear function approximation: Tail averaging and regularisation.
Gandharv Patil
Prashanth L. A.
Dheeraj Nagaraj
Doina Precup
Published in:
CoRR (2022)
Keyphrases
</>
function approximation
temporal difference learning
temporal difference learning algorithms
reinforcement learning
function approximators
temporal difference
fixed point
evaluation function
game playing
learning tasks
radial basis function
model free
data mining
model selection
learning experience
finite number