Login / Signup

MAMO: Masked Multimodal Modeling for Fine-Grained Vision-Language Representation Learning.

Zijia ZhaoLongteng GuoXingjian HeShuai ShaoZehuan YuanJing Liu
Published in: CoRR (2022)
Keyphrases