Login / Signup
Confidence Intervals for Policy Evaluation in Adaptive Experiments.
Vitor Hadad
David A. Hirshberg
Ruohan Zhan
Stefan Wager
Susan Athey
Published in:
CoRR (2019)
Keyphrases
</>
confidence intervals
variance reduction
policy evaluation
monte carlo
sample size
least squares
markov chain
temporal difference
reinforcement learning
model free
conditional probabilities
dynamic programming
text classification
test set
machine learning
roc curve
policy iteration