Login / Signup
On the Curses of Future and History in Future-dependent Value Functions for Off-policy Evaluation.
Yuheng Zhang
Nan Jiang
Published in:
CoRR (2024)
Keyphrases
</>
machine learning
least squares
learning algorithm
support vector machine
policy evaluation