Login / Signup

On the Curses of Future and History in Future-dependent Value Functions for Off-policy Evaluation.

Yuheng ZhangNan Jiang
Published in: CoRR (2024)
Keyphrases
  • machine learning
  • least squares
  • learning algorithm
  • support vector machine
  • policy evaluation