Login / Signup
A finite-sample analysis of multi-step temporal difference estimates.
Yaqi Duan
Martin J. Wainwright
Published in:
L4DC (2023)
Keyphrases
</>
multi step
temporal difference
td learning
evaluation function
finite sample
data sets
reinforcement learning
neural network
feature extraction
pattern recognition
evolutionary algorithm
supervised learning
convergence rate