PiTL: Cross-modal Retrieval with Weakly-supervised Vision-language Pre-training via Prompting.

Published in: CoRR (2023)

Keyphrases