Deep Cross-Modal Retrieval Between Spatial Image and Acoustic Speech.
Xinyuan QianWei XueQiquan ZhangRuijie TaoHaizhou LiPublished in: IEEE Trans. Multim. (2024)
Keyphrases
- cross modal
- image retrieval
- visual similarity
- image data
- multi modal
- spatial information
- image content
- image features
- spatial relationships
- visual data
- multiscale
- perceptual information
- image collections
- multimedia retrieval
- image classification
- image database
- image representation
- image regions
- test images
- retrieval systems
- information retrieval
- low level
- image set
- spatial relations
- visual information
- multimedia databases
- content based retrieval
- spatial data
- web images
- visual features
- high dimensional
- feature extraction