Login / Signup
Deep Proxy Causal Learning and its Application to Confounded Bandit Policy Evaluation.
Liyuan Xu
Heishiro Kanagawa
Arthur Gretton
Published in:
CoRR (2021)
Keyphrases
</>
learning algorithm
reinforcement learning
active learning
learning tasks
least squares
text classification
fixed point