Login / Signup
FSMR: A Feature Swapping Multi-modal Reasoning Approach with Joint Textual and Visual Clues.
Shuang Li
Jiahua Wang
Lijie Wen
Published in:
CoRR (2024)
Keyphrases
</>
multi modal
cross modal
video search
visual representations
single modality
multi modality
visual features
high dimensional
image features
low level
multimedia
audio visual
image annotation
auto annotation
keywords
information processing
feature space
fusing multiple