Login / Signup
Doubly Robust Distributionally Robust Off-Policy Evaluation and Learning.
Nathan Kallus
Xiaojie Mao
Kaiwen Wang
Zhengyuan Zhou
Published in:
CoRR (2022)
Keyphrases
</>
learning algorithm
supervised learning
training data
reinforcement learning
active learning
least squares