Structured Multi-modal Feature Embedding and Alignment for Image-Sentence Retrieval.
Xuri GeFuhai ChenJoemon M. JoseZhilong JiZhongqin WuXiao LiuPublished in: CoRR (2021)
Keyphrases
- multi modal
- image features
- sentence retrieval
- image data
- multi modality
- uni modal
- image content
- image retrieval
- image classification
- audio visual
- video search
- low level
- image annotation
- image collections
- single modality
- multiple modalities
- machine translation
- novelty detection
- visual information
- metadata
- image representation
- feature vectors
- similarity measure