Login / Signup
Wasserstein Distributionally Robust Policy Evaluation and Learning for Contextual Bandits.
Yi Shen
Pan Xu
Michael M. Zavlanos
Published in:
CoRR (2023)
Keyphrases
</>
learning algorithm
reinforcement learning
least squares
learning tasks
markov decision processes