Automatic Findings Generation for Distress Images Using In-Context Few-Shot Learning of Visual Language Model Based on Image Similarity and Text Diversity.
Yuto WatanabeNaoki OgawaKeisuke MaedaTakahiro OgawaMiki HaseyamaPublished in: J. Robotics Mechatronics (2024)
Keyphrases
- image similarity
- image database
- image retrieval
- visual features
- image data
- image features
- input image
- image regions
- image matching
- contextual information
- object recognition
- web images
- image collections
- image content
- information retrieval
- image classification
- similarity measure
- multiscale
- image understanding
- spatial relationships
- human perception
- similarity metric
- keywords
- visual similarity
- feature extraction