CapText: Large Language Model-based Caption Generation From Image Context and Description.

Shinjini Ghosh Sagnik Anupam

Published in: CoRR (2023)

Keyphrases

input image
image data
image pixels
image features
single image
image representation
template matching
image content
contextual information
multiscale
image retrieval
test images
low level
image matching
image classification
pixel values
context dependent
edge detection
image description
image analysis
bounding box
image segmentation
caption text
image structure
keypoints
lighting conditions
hough transform
segmentation method
high resolution
computer vision