Pessimistic Value Iteration for Multi-Task Data Sharing in Offline Reinforcement Learning.

Published in: CoRR (2024)

Keyphrases