Sign in

VLDeformer: Vision-Language Decomposed Transformer for fast cross-modal retrieval.

Lisai ZhangHongfa WuQingcai ChenYimeng DengJoanna SiebertZhonghua LiYunpeng HanDejiang KongZhao Cao
Published in: Knowl. Based Syst. (2022)
Keyphrases