Login / Signup

Augment the Pairs: Semantics-Preserving Image-Caption Pair Augmentation for Grounding-Based Vision and Language Models.

Jingru YiBurak UzkentOana IgnatZili LiAmanmeet GargXiang YuLinda Liu
Published in: CoRR (2023)
Keyphrases