Login / Signup

Learning Action Embeddings for Off-Policy Evaluation.

Matej CiefJacek GolebiowskiPhilipp SchmidtZiawasch AbedjanArtur Bekasov
Published in: ECIR (1) (2024)
Keyphrases
  • learning algorithm
  • reinforcement learning
  • learning tasks
  • active learning
  • monte carlo
  • supervised learning
  • markov decision processes
  • action selection