Confident Off-Policy Evaluation and Selection through Self-Normalized Importance Weighting.

Published in: CoRR (2020)

Keyphrases