Login / Signup
PER-ETD: A Polynomially Efficient Emphatic Temporal Difference Learning Method.
Ziwei Guan
Tengyu Xu
Yingbin Liang
Published in:
CoRR (2021)
Keyphrases
</>
objective function
cost function
dynamic programming
support vector machine
machine learning
artificial neural networks
support vector machine svm
temporal difference learning