Login / Signup
Finite Sample Analysis for TD(0) with Linear Function Approximation.
Gal Dalal
Balázs Szörényi
Gugan Thoppe
Shie Mannor
Published in:
CoRR (2017)
Keyphrases
</>
function approximation
reinforcement learning
temporal difference
temporal difference learning algorithms
td learning
function approximators
finite sample
temporal difference learning
temporal difference methods
data sets
sample size
learning tasks
model free