A Concentration Bound for TD(0) with Function Approximation.

Siddharth Chandak Vivek S. Borkar

Published in: CoRR (2023)

Keyphrases

function approximation
temporal difference
td learning
temporal difference learning
reinforcement learning
radial basis function
td methods
reinforcement learning algorithms
function approximators
temporal difference methods
learning tasks
temporal difference learning algorithms
model free
policy evaluation
data sets
least squares