Login / Signup
Finite-Sample Analysis of Off-Policy TD-Learning via Generalized Bellman Operators.
Zaiwei Chen
Siva Theja Maguluri
Sanjay Shakkottai
Karthikeyan Shanmugam
Published in:
CoRR (2021)
Keyphrases
</>
finite sample
data sets
feature extraction
machine learning algorithms
statistical learning theory
td learning