CITE: Compact Interactive TransformEr for Multilingual Image Captioning.
Yueyuan XuZhenzhen HuYuanen ZhouShijie HaoRichang HongPublished in: ICIGP (2023)
Keyphrases
- image data
- input image
- single image
- image features
- multiscale
- image content
- image analysis
- image collections
- image regions
- high resolution
- image classification
- image representation
- test images
- template matching
- image matching
- region of interest
- image pixels
- pixel values
- image processing
- fuzzy logic
- low level
- image retrieval
- image segmentation
- spatial information
- segmentation method
- energy function
- user interaction
- edge detection
- lighting conditions
- image structure