Login / Signup
Off-Policy Evaluation and Learning for External Validity under a Covariate Shift.
Masatoshi Uehara
Masahiro Kato
Shota Yasui
Published in:
NeurIPS (2020)
Keyphrases
</>
covariate shift
learning algorithm
active learning
reinforcement learning
learning process
high dimensional
supervised learning
domain specific
semi supervised learning
learning tasks
learning problems
markov decision processes
model free