Login / Signup

Semantics-enhanced Cross-modal Masked Image Modeling for Vision-Language Pre-training.

Haowei LiuYaya ShiHaiyang XuChunfeng YuanQinghao YeChenliang LiMing YanJi ZhangFei HuangBing LiWeiming Hu
Published in: CoRR (2024)
Keyphrases