Finite-sample Analysis of Greedy-GQ with Linear Function Approximation under Markovian Noise.

Yue Wang Shaofeng Zou

Published in: UAI (2020)

Keyphrases

function approximation
reinforcement learning
temporal difference learning algorithms
finite sample
data sets
feature selection
convergence rate