Login / Signup
Polyak-Ruppert Averaged Q-Leaning is Statistically Efficient.
Xiang Li
Wenhao Yang
Zhihua Zhang
Michael I. Jordan
Published in:
CoRR (2021)
Keyphrases
</>
data sets
learning algorithm
machine learning
decision making
case study
image sequences
feature extraction
high quality
pattern recognition
natural language
preprocessing
cost effective