Unified PAC-Bayesian Study of Pessimism for Offline Policy Learning with Regularized Importance Sampling.

Published in: CoRR (2024)

Keyphrases