Sign in

Unsupervised Vision-and-Language Pretraining via Retrieval-based Multi-Granular Alignment.

Mingyang ZhouLicheng YuAmanpreet SinghMengjiao WangZhou YuNing Zhang
Published in: CVPR (2022)
Keyphrases