Scene-centric vs. Object-centric Image-Text Cross-modal Retrieval: A Reproducibility Study.
Mariya HendriksenSvitlana VakulenkoErnst KuiperMaarten de RijkePublished in: CoRR (2023)
Keyphrases
- cross modal
- image retrieval
- input image
- image regions
- visual data
- information retrieval
- visual similarity
- image data
- spatial relations
- image features
- image content
- image representation
- scene classification
- image set
- multi modal
- text retrieval
- keypoints
- bounding box
- multimedia retrieval
- web images
- visual words
- moving objects
- image collections
- object parts
- spatial relationships
- document retrieval
- query expansion
- image classification
- video sequences
- keywords
- test images
- visual features
- dissimilarity measure
- relevance feedback
- visual recognition
- low level
- image sequences
- high level