BRIDGE: Bridging Gaps in Image Captioning Evaluation with Stronger Visual Cues.
Sara SartoMarcella CorniaLorenzo BaraldiRita CucchiaraPublished in: CoRR (2024)
Keyphrases
- visual cues
- low level
- visual information
- image content
- image features
- mid level
- input image
- depth cues
- image classification
- multiple cues
- image retrieval
- high level
- image representation
- image matching
- similarity measure
- image segmentation
- multiple visual cues
- single image
- saliency detection
- displacement field
- position and orientation
- vector field
- computer vision
- feature points
- image processing