Login / Signup

ViLTA: Enhancing Vision-Language Pre-training through Textual Augmentation.

Weihan WangZhen YangBin XuJuanzi LiYankui Sun
Published in: CoRR (2023)
Keyphrases