Sign in

COPA : Efficient Vision-Language Pre-training through Collaborative Object- and Patch-Text Alignment.

Chaoya JiangHaiyang XuWei YeQinghao YeChenliang LiMing YanBin BiShikun ZhangFei HuangJi Zhang
Published in: ACM Multimedia (2023)
Keyphrases