IMAD: IMage-Augmented multi-modal Dialogue.
Viktor MoskvoretskiiAnton FrolovDenis KuznetsovPublished in: CoRR (2023)
Keyphrases
- multi modal
- input image
- uni modal
- image data
- auto annotation
- fusing multiple
- image classification
- multiple modalities
- high resolution
- image features
- image representation
- image retrieval
- image segmentation
- multiscale
- image analysis
- image content
- cross modal
- low level
- image regions
- segmentation method
- visual cues
- web images
- audio visual
- single modality
- edge detection
- high dimensional
- image processing