Login / Signup
Learning Action Embeddings for Off-Policy Evaluation.
Matej Cief
Jacek Golebiowski
Philipp Schmidt
Ziawasch Abedjan
Artur Bekasov
Published in:
CoRR (2023)
Keyphrases
</>
learning algorithm
learning tasks
reinforcement learning
active learning
least squares
action selection