CapText: Large Language Model-based Caption Generation From Image Context and Description.
Shinjini GhoshSagnik AnupamPublished in: CoRR (2023)
Keyphrases
- input image
- image data
- image pixels
- image features
- single image
- image representation
- template matching
- image content
- contextual information
- multiscale
- image retrieval
- test images
- low level
- image matching
- image classification
- pixel values
- context dependent
- edge detection
- image description
- image analysis
- bounding box
- image segmentation
- caption text
- image structure
- keypoints
- lighting conditions
- hough transform
- segmentation method
- high resolution
- computer vision