Login / Signup
PER-ETD: A Polynomially Efficient Emphatic Temporal Difference Learning Method.
Ziwei Guan
Tengyu Xu
Yingbin Liang
Published in:
ICLR (2022)
Keyphrases
</>
support vector machine
objective function
support vector machine svm
cost function
dynamic programming
temporal difference learning