Pessimistic Minimax Value Iteration: Provably Efficient Equilibrium Learning from Offline Datasets.

Published in: CoRR (2022)

Keyphrases