Login / Signup
Off-Policy Evaluation via Adaptive Weighting with Data from Contextual Bandits.
Ruohan Zhan
Vitor Hadad
David A. Hirshberg
Susan Athey
Published in:
CoRR (2021)
Keyphrases
</>
training data
probability distribution
statistical methods
statistical inference