Login / Signup
Distributionally Robust Policy Gradient for Offline Contextual Bandits.
Zhouhao Yang
Yihong Guo
Pan Xu
Anqi Liu
Animashree Anandkumar
Published in:
AISTATS (2023)
Keyphrases
</>
policy gradient
robust optimization
multi agent