Distributionally Robust Policy Evaluation and Learning in Offline Contextual Bandits.

Nian Si Fan Zhang Zhengyuan Zhou Jose H. Blanchet

Published in: ICML (2020)

Keyphrases

learning algorithm
reinforcement learning
learning tasks
least squares
supervised learning
active learning