Login / Signup
Variance-Optimal Augmentation Logging for Counterfactual Evaluation in Contextual Bandits.
Aaron David Tucker
Thorsten Joachims
Published in:
CoRR (2022)
Keyphrases
</>
evaluation method
dynamic programming
conditional expectation
case study
optimal solution
evaluation metrics
neural network
web pages
least squares
closed form
context sensitive
variance reduction