Finite-Sample Regret Bound for Distributionally Robust Offline Tabular Reinforcement Learning.

Published in: AISTATS (2021)

Keyphrases