Achieving the Asymptotically Minimax Optimal Sample Complexity of Offline Reinforcement Learning: A DRO-Based Approach.

Published in: Trans. Mach. Learn. Res. (2024)

Keyphrases