Login / Signup
Marginal Density Ratio for Off-Policy Evaluation in Contextual Bandits.
Muhammad Faaiz Taufiq
Arnaud Doucet
Rob Cornish
Jean-Francois Ton
Published in:
NeurIPS (2023)
Keyphrases
</>
density ratio
policy evaluation
least squares
semi parametric
density ratio estimation
density estimation
linear regression
linear model
statistical inference
monte carlo
model free
parametric models
policy iteration
temporal difference
machine learning
test set
probability distribution
optical flow