Login / Signup

COPA: Efficient Vision-Language Pre-training Through Collaborative Object- and Patch-Text Alignment.

Chaoya JiangHaiyang XuWei YeQinghao YeChenliang LiMing YanBin BiShikun ZhangJi ZhangFei Huang
Published in: CoRR (2023)
Keyphrases