Login / Signup
Finite Sample Analysis of the GTD Policy Evaluation Algorithms in Markov Setting.
Yue Wang
Wei Chen
Yuting Liu
Zhi-Ming Ma
Tie-Yan Liu
Published in:
CoRR (2018)
Keyphrases
</>
learning algorithm
computational complexity
error bounds
finite sample
policy evaluation
least squares
worst case
statistical methods