Finite Sample Analysis for TD(0) with Linear Function Approximation.

Gal Dalal Balázs Szörényi Gugan Thoppe Shie Mannor

Published in: CoRR (2017)

Keyphrases

function approximation
reinforcement learning
temporal difference
temporal difference learning algorithms
td learning
function approximators
finite sample
temporal difference learning
temporal difference methods
data sets
sample size
learning tasks
model free