Login / Signup
Structure-CLIP: Towards Scene Graph Knowledge to Enhance Multi-Modal Structured Representations.
Yufeng Huang
Jiji Tang
Zhuo Chen
Rongsheng Zhang
Xinfeng Zhang
Weijie Chen
Zeng Zhao
Zhou Zhao
Tangjie Lv
Zhipeng Hu
Wen Zhang
Published in:
AAAI (2024)
Keyphrases
</>
multi modal
structured representations
knowledge base
multi modality
video sequences
cross modal
random walk
high dimensional
uni modal
machine learning
information extraction
plan recognition
single modality