Login / Signup
Off-policy TD( l) with a true online equivalence.
Hado van Hasselt
Ashique Rupam Mahmood
Richard S. Sutton
Published in:
UAI (2014)
Keyphrases
</>
online learning
real time
learning algorithm
information retrieval
cross cultural
online algorithms
computer vision
data structure
information technology