Login / Signup
Finite Sample Analysis of LSTD with Random Projections and Eligibility Traces.
Haifang Li
Yingce Xia
Wensheng Zhang
Published in:
CoRR (2018)
Keyphrases
</>
random projections
reinforcement learning
least squares
eligibility traces
neural network
policy evaluation
support vector
machine learning
feature extraction
feature vectors
hash functions
temporal difference
reinforcement learning algorithms