Learning Joint Embedding with Modality Alignments for Cross-Modal Retrieval of Recipes and Food Images.
Zhongwei XieLing LiuLin LiLuo ZhongPublished in: CoRR (2021)
Keyphrases
- cross modal
- perceptual information
- multi modal
- image retrieval
- visual similarity
- image database
- image data
- learning tasks
- multiple modalities
- image classification
- image annotation
- multimedia retrieval
- visual features
- image regions
- information retrieval
- visual data
- e learning
- image understanding
- image set
- content based retrieval
- multimedia information retrieval
- image features
- object recognition
- similarity measure