PiTL: Cross-modal Retrieval with Weakly-supervised Vision-language Pre-training via Prompting.

Published in: SIGIR (2023)

Keyphrases