Login / Signup
Distributionally Robust Policy Evaluation under General Covariate Shift in Contextual Bandits.
Yihong Guo
Hao Liu
Yisong Yue
Anqi Liu
Published in:
Trans. Mach. Learn. Res. (2024)
Keyphrases
</>
policy evaluation
covariate shift
least squares
data mining
dynamic programming
monte carlo
temporal difference