MM-Interleaved: Interleaved Image-Text Generative Modeling via Multi-modal Feature Synchronizer.
Changyao TianXizhou ZhuYuwen XiongWeiyun WangZhe ChenWenhai WangYuntao ChenLewei LuTong LuJie ZhouHongsheng LiYu QiaoJifeng DaiPublished in: CoRR (2024)
Keyphrases
- multi modal
- image features
- auto annotation
- multiple modalities
- web images
- input image
- image annotation
- uni modal
- image data
- fusing multiple
- video search
- image analysis
- image representation
- image content
- image classification
- audio visual
- multi modality
- multiscale
- image regions
- edge detection
- cross modal
- segmentation method
- low level
- single modality
- image retrieval
- image segmentation
- image search
- feature vectors
- image collections
- segmentation algorithm
- keywords