Improving Multimodal Datasets with Image Captioning.
Thao NguyenSamir Yitzhak GadreGabriel IlharcoSewoong OhLudwig SchmidtPublished in: CoRR (2023)
Keyphrases
- image data
- image classification
- single image
- image analysis
- image features
- input image
- image content
- image segmentation
- multiscale
- high resolution
- similarity measure
- image representation
- template matching
- low level
- image dataset
- feature points
- grey level
- image collections
- image pixels
- data sets
- test images
- vector field
- million images
- segmentation method
- post processing
- image retrieval
- image matching
- bag of words
- spatial information
- image regions
- region of interest
- energy function
- active contours
- feature selection