Diverse randomized value functions: A provably pessimistic approach for offline reinforcement learning.

Published in: Inf. Sci. (2024)

Keyphrases