Image Captioning with Clause-Focused Metrics in a Multi-Modal Setting for Marketing.
Philipp HarzigDan ZechaRainer LienhartCarolin KaiserRené SchallnerPublished in: CoRR (2019)
Keyphrases
- multi modal
- input image
- auto annotation
- image features
- fusing multiple
- image data
- multiscale
- segmentation method
- uni modal
- image analysis
- image representation
- image segmentation
- image regions
- high resolution
- image retrieval
- edge detection
- image content
- single modality
- image classification
- low level
- cross modal
- multi modality
- image annotation
- audio visual
- high dimensional
- high level
- feature extraction
- contrast enhancement
- multiple modalities