Confident Off-Policy Evaluation and Selection through Self-Normalized Importance Weighting.

Published in: AISTATS (2021)

Keyphrases