Multi-Modal Image Captioning for the Visually Impaired.
Hiba AhsanNikita BhallaDaivat BhattKaivankumar ShahPublished in: CoRR (2021)
Keyphrases
- multi modal
- auto annotation
- image features
- input image
- uni modal
- multi modality
- image analysis
- image data
- fusing multiple
- high resolution
- image content
- image classification
- image representation
- image retrieval
- multiscale
- image segmentation
- image regions
- audio visual
- multiple modalities
- high dimensional
- low level
- image collections
- cross modal
- mutual information
- segmentation method
- medical imaging
- semantic concepts
- similarity measure