Diverse Randomized Value Functions: A Provably Pessimistic Approach for Offline Reinforcement Learning.

Published in: CoRR (2024)

Keyphrases