Sign in

Leveraging per Image-Token Consistency for Vision-Language Pre-training.

Yunhao GouTom KoHansi YangJames T. KwokYu ZhangMingxuan Wang
Published in: CVPR (2023)
Keyphrases