Login / Signup

Learning Action Embeddings for Off-Policy Evaluation.

Matej CiefJacek GolebiowskiPhilipp SchmidtZiawasch AbedjanArtur Bekasov
Published in: CoRR (2023)
Keyphrases
  • learning algorithm
  • learning tasks
  • reinforcement learning
  • active learning
  • least squares
  • action selection