Login / Signup
Long-term Off-Policy Evaluation and Learning.
Yuta Saito
Himan Abdollahpouri
Jesse Anderton
Ben Carterette
Mounia Lalmas
Published in:
CoRR (2024)
Keyphrases
</>
long term
learning algorithm
reinforcement learning
learning process
least squares
learning tasks
statistical learning
machine learning