Login / Signup
Long-term Off-Policy Evaluation and Learning.
Yuta Saito
Himan Abdollahpouri
Jesse Anderton
Ben Carterette
Mounia Lalmas
Published in:
WWW (2024)
Keyphrases
</>
long term
learning process
learning algorithm
active learning
neural network
reinforcement learning
learning tasks
statistical learning
training data
td learning