Login / Signup
Finite-sample Analysis of Greedy-GQ with Linear Function Approximation under Markovian Noise.
Yue Wang
Shaofeng Zou
Published in:
UAI (2020)
Keyphrases
</>
function approximation
reinforcement learning
temporal difference learning algorithms
finite sample
data sets
feature selection
convergence rate