Sign in

Double Pessimism is Provably Efficient for Distributionally Robust Offline Reinforcement Learning: Generic Algorithm and Robust Partial Coverage.

Jose BlanchetMiao LuTong ZhangHan Zhong
Published in: CoRR (2023)
Keyphrases