Deep Proxy Causal Learning and its Application to Confounded Bandit Policy Evaluation.

Liyuan Xu Heishiro Kanagawa Arthur Gretton

Published in: CoRR (2021)

Keyphrases

learning algorithm
reinforcement learning
active learning
learning tasks
least squares
text classification
fixed point