Login / Signup
COPA: Efficient Vision-Language Pre-training Through Collaborative Object- and Patch-Text Alignment.
Chaoya Jiang
Haiyang Xu
Wei Ye
Qinghao Ye
Chenliang Li
Ming Yan
Bin Bi
Shikun Zhang
Ji Zhang
Fei Huang
Published in:
CoRR (2023)
Keyphrases
</>
programming language
english text
computer vision
language learning
text mining
vision system
text retrieval
complex objects
computational linguistics
english language
information retrieval
natural language
supervised learning
training examples
text documents
object model