Login / Signup

Long-term Off-Policy Evaluation and Learning.

Yuta SaitoHiman AbdollahpouriJesse AndertonBen CarteretteMounia Lalmas
Published in: WWW (2024)
Keyphrases
  • long term
  • learning process
  • learning algorithm
  • active learning
  • neural network
  • reinforcement learning
  • learning tasks
  • statistical learning
  • training data
  • td learning