Mixer: Image to Multi-Modal Retrieval Learning for Industrial Application.
Zida ChengShuai XiaoZhonghua ZhaiXiaoyi ZengWeilin HuangPublished in: CoRR (2023)
Keyphrases
- multi modal
- auto annotation
- industrial applications
- image retrieval
- multi modality
- image classification
- image data
- multiscale
- information retrieval
- cross modal
- image content
- fusing multiple
- video search
- uni modal
- image analysis
- input image
- low level
- image segmentation
- segmentation method
- single modality
- high level
- multiple modalities
- high dimensional
- feature space
- audio visual
- image representation
- medical images