Login / Signup
Structure-CLIP: Enhance Multi-modal Language Representations with Structure Knowledge.
Yufeng Huang
Jiji Tang
Zhuo Chen
Rongsheng Zhang
Xinfeng Zhang
Weijie Chen
Zeng Zhao
Tangjie Lv
Zhipeng Hu
Wen Zhang
Published in:
CoRR (2023)
Keyphrases
</>
high level
multi modal
low level
multimedia