Sign in

Leveraging per Image-Token Consistency for Vision-Language Pre-training.

Yunhao GouTom KoHansi YangJames KwokYu ZhangMingxuan Wang
Published in: CoRR (2022)
Keyphrases