Sign in

MAMO: Fine-Grained Vision-Language Representations Learning with Masked Multimodal Modeling.

Zijia ZhaoLongteng GuoXingjian HeShuai ShaoZehuan YuanJing Liu
Published in: SIGIR (2023)
Keyphrases