Distributionally Robust Policy Gradient for Offline Contextual Bandits.

Published in: AISTATS (2023)

Keyphrases