Login / Signup
Distributionally Robust Policy Evaluation and Learning in Offline Contextual Bandits.
Nian Si
Fan Zhang
Zhengyuan Zhou
Jose H. Blanchet
Published in:
ICML (2020)
Keyphrases
</>
learning algorithm
reinforcement learning
learning tasks
least squares
supervised learning
active learning